51 dependents
Package Description Downloads/month
A tiny library for Python text normalisation. Useful for ad-hoc text processing. 214K
Utility library to turn country names into ISO two-letter codes 38K
Informační systém pro střediska volného času 8K
Sherpa Consolidation processor 6K
An attempt towards a followthemoney query dsl. 6K
Tool for creating, modifying and validating CycloneDX SBOMs. 5K
Open Source search based on OpenStreetMap data 4K
Open Source search based on OpenStreetMap data 4K
Library for building and paring Connexions' EPUBs. 3K
Create segments from annotations 3K
Annotator based on Huggingface transformers zero-shot classification pipeline 2K
A multi-platform app for writing metadata to digital comics 2K
Formatter based on Huggingface transformers summarization pipeline 2K
Python tool to check font files for language/character set support 2K
Open source Church presentation and lyrics projection application. 1K
API for OpenSanctions with support for entity search and bulk matching of data c... 1K
A simple interface written in python for reproducible i/o workflows around tabul... 1K
Application for accepting publication requests to the Connexions Archive. 1K
Creating fish movement heat-trails 846
Classifier based on Huggingface Text Classification pipeline 753
701
A distributed crawler for getting info about DNS domains and services attached t... 631
Translit russian cyrillic and slugify phrases 623
565
Mathics3 Language and Translation Toolkit module via PyICU 564
Un-fairseq: UnFormers (Universal Transformers) — config-driven enc-dec chassis c... 527
A library for counting stop words in HTML pages. 475
A better date and time API for Python 458
Deterministic Latin and IPA transliteration for Kazakh, Kyrgyz, Uzbek, Turkish, ... 422
pypa af
Af format tools 419
Column store implementation for ftm data based on clickhouse 377
Text tokenizers optimized for sparse retrieval. 355
Library that turns comment chains into Ace Attorney scenes, used in several bots 354
... 261
deepl translate for free, based on pyppeteer 258
dualtext alignment making use of a remote API for embedding 253
Deterministic Latin and IPA transliteration for Kazakh, Kyrgyz, plus tokenizer/g... 249
Python and command-line utility for converting scripture references between form... 230
A good mix of Norwegian dictionaries 210
The XML-to-OCDS parser for the TEDective project based on lxml 192
Annotator based on Huggingface transformers zero-shot classification pipeline 192
A fast Thai word tokenization 192
A package for anchored decoding 176
Sanskrit metre : miscellaneous code and data. 167
My personal porting of libretranslate 146
A CLDR-compliant slugify function using PyICU 96
Polyglot is a natural language pipeline that supports massive multilingual appli... 80
LangDive is a library for measuring the level of linguistic diversity in multili... 75
70
Clacks framework 49