Package Insights
((week_daily_avg - month_daily_avg) / month_daily_avg) * 100Weekly Downloads
GitHub Stars
Downloads by OS
Python Versions
Top Countries
Dependencies
- beautifulsoup4 >=4.14.2
- certifi >=2026.2.25
- charset-normalizer >=3.4.0
- docx2txt >=0.8
- faust-cchardet ==2.1.19
- gielladetect ==1.0.3
- html5lib ==1.1
- idna ==3.12
- joblib >=1.5.3
- lxml ==6.1.0
- numpy >=2.2.6
- psycopg2-binary >=2.9.10
- pygtrie >=2.5.0
- pymupdf ==1.23.26
- python-dateutil ==2.9.0.post0
- python-docx ==1.2.0
- python-dotenv >=1.0.1
- pytz >=2026.1.post1
- pyyaml >=6.0.2
- requests >=2.33.1
- simhash >=2.1.2
- six ==1.17.0
- soupsieve ==2.8.3
- tqdm >=4.67.1
- typing-extensions >=4.12.2
- tzdata >=2026.1
- urllib3 <3.0,>=2.5.0
- warcio >=1.7.4
- webencodings ==0.5.1
3 optional dependencies
- fasttext[glotlid]
- htmldate[htmldate]
- huggingface-hub[glotlid]