PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
macbre
mediawiki-dump

Python package for working with MediaWiki XML content dumps

643 25 4
akb89
witokit

A Python toolkit to generate a tokenized dump of Wikipedia for NLP

326 11 1
omarkamali
wikisets

Flexible Wikipedia dataset builder with sampling and pretraining support. Built on top of wikipedia-monthly, providing fresh, clean Wikipedia dumps updated monthly.

256 4 0
bfontaine
wpydumps

Read Wikipedia dumps

212 1 0
jon-edward
wiki-data-dump

A library that assists in traversing and downloading from Wikimedia Data Dumps and their mirrors.

147 11 1
    • Data from PyPI, GitHub, ClickHouse, and BigQuery