Skip to content

ModuleNotFoundError: No module named 'cgi' #86

@TopCoder2K

Description

@TopCoder2K

I'm trying to load the factru dataset following the documentation:

from corus import load_factru

factru_dataset = load_factru("data/factru", ["testset"])

but I get:

---------------------------------------------------------------------------
ModuleNotFoundError                       Traceback (most recent call last)
Cell In[3], line 1
----> 1 from corus import load_factru
      3 factru_dataset = load_factru("data[/factru](http://localhost:8888/factru)", ["testset"])

http://localhost:8888/ad_hoc_analysis/src/ner/.venv/lib/python3.13/site-packages/corus/__init__.py#line=1
----> 2 from .sources import *  # noqa

http://localhost:8888/ad_hoc_analysis/src/ner/.venv/lib/python3.13/site-packages/corus/sources/__init__.py#line=12
     11 from .buriy import *  # noqa
     12 from .mokoron import *  # noqa
---> 13 from .wiki import load_wiki  # noqa
     14 from .ods import *  # noqa
     15 from .ria import *  # noqa

http://localhost:8888/ad_hoc_analysis/src/ner/.venv/lib/python3.13/site-packages/corus/sources/wiki.py#line=6
      5 from corus.record import Record
      6 from corus.io import load_bz2_lines
----> 7 from corus.third.WikiExtractor import (
      8     options,
      9     pages_from,
     10     Extractor
     11 )
     14 options.write_json = True
     17 class WikiRecord(Record):

http://localhost:8888/ad_hoc_analysis/src/ner/.venv/lib/python3.13/site-packages/corus/third/WikiExtractor.py#line=62
     61 import bz2
     62 import codecs
---> 63 import cgi
     64 import fileinput
     65 import logging

ModuleNotFoundError: No module named 'cgi'

Also, I obtain a lot of SyntaxWarning: invalid escape sequence.

Does this mean the package was installed incorrectly? My setting:

uv 0.10.5
Python 3.13.9
corus 0.10.0

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions