I'm trying to load the factru dataset following the documentation:
from corus import load_factru
factru_dataset = load_factru("data/factru", ["testset"])
but I get:
---------------------------------------------------------------------------
ModuleNotFoundError Traceback (most recent call last)
Cell In[3], line 1
----> 1 from corus import load_factru
3 factru_dataset = load_factru("data[/factru](http://localhost:8888/factru)", ["testset"])
http://localhost:8888/ad_hoc_analysis/src/ner/.venv/lib/python3.13/site-packages/corus/__init__.py#line=1
----> 2 from .sources import * # noqa
http://localhost:8888/ad_hoc_analysis/src/ner/.venv/lib/python3.13/site-packages/corus/sources/__init__.py#line=12
11 from .buriy import * # noqa
12 from .mokoron import * # noqa
---> 13 from .wiki import load_wiki # noqa
14 from .ods import * # noqa
15 from .ria import * # noqa
http://localhost:8888/ad_hoc_analysis/src/ner/.venv/lib/python3.13/site-packages/corus/sources/wiki.py#line=6
5 from corus.record import Record
6 from corus.io import load_bz2_lines
----> 7 from corus.third.WikiExtractor import (
8 options,
9 pages_from,
10 Extractor
11 )
14 options.write_json = True
17 class WikiRecord(Record):
http://localhost:8888/ad_hoc_analysis/src/ner/.venv/lib/python3.13/site-packages/corus/third/WikiExtractor.py#line=62
61 import bz2
62 import codecs
---> 63 import cgi
64 import fileinput
65 import logging
ModuleNotFoundError: No module named 'cgi'
Also, I obtain a lot of SyntaxWarning: invalid escape sequence.
Does this mean the package was installed incorrectly? My setting:
uv 0.10.5
Python 3.13.9
corus 0.10.0
I'm trying to load the
factrudataset following the documentation:but I get:
Also, I obtain a lot of
SyntaxWarning: invalid escape sequence.Does this mean the package was installed incorrectly? My setting: