19 dependents
Package Description Downloads/month
Unicode Standard tokenization routines and orthography profile segmentation 852K
python package to read and write CLDF datasets 26K
Python library for quantitative tasks in historical linguistics 15K
Python API to access glottolog/glottolog 8K
Tooling to create CLDF datasets from existing data 5K
Handling Interlinear Glossed Text in python 5K
Programmatic access to linguistic literature 4K
Development tools and dependencies for use in the csvcubed tooling. 920
A cldfbench plugin to curate D-PLACE datasets 624
PDF processing pipeline: remove headers/footers, convert to markdown, and genera... 414
Programmatic curation of Glottography datasets 393
Package for curating dictionaries for the Dictionaria project 381
Provides functionality necessary to transform RDF Data Cube style CSV-Ws into a ... 329
Python package for the NoRaRe collection 321
Programmatic access to data in ASJP's text format 271
Convert a CSVW document (CSV metadata) to a DuckDB query to load a CSV file into... 135
a PyQt6 port of the excellent autiobooks project found at https://github.com/plu... 127
TuLaR curation library 79
Benchmark Datasets for Computational Historical Linguistics 73