51 dependents
| Package | Description | Downloads/month |
|---|---|---|
| A tiny library for Python text normalisation. Useful for ad-hoc text processing. | 214K | |
| Utility library to turn country names into ISO two-letter codes | 38K | |
| Informační systém pro střediska volného času | 8K | |
| Sherpa Consolidation processor | 6K | |
| An attempt towards a followthemoney query dsl. | 6K | |
| Tool for creating, modifying and validating CycloneDX SBOMs. | 5K | |
| Open Source search based on OpenStreetMap data | 4K | |
| Open Source search based on OpenStreetMap data | 4K | |
| Library for building and paring Connexions' EPUBs. | 3K | |
| Create segments from annotations | 3K | |
| Annotator based on Huggingface transformers zero-shot classification pipeline | 2K | |
| A multi-platform app for writing metadata to digital comics | 2K | |
| Formatter based on Huggingface transformers summarization pipeline | 2K | |
| Python tool to check font files for language/character set support | 2K | |
| Open source Church presentation and lyrics projection application. | 1K | |
| API for OpenSanctions with support for entity search and bulk matching of data c... | 1K | |
| A simple interface written in python for reproducible i/o workflows around tabul... | 1K | |
| Application for accepting publication requests to the Connexions Archive. | 1K | |
| Creating fish movement heat-trails | 846 | |
| Classifier based on Huggingface Text Classification pipeline | 753 | |
| 701 | ||
| A distributed crawler for getting info about DNS domains and services attached t... | 631 | |
| Translit russian cyrillic and slugify phrases | 623 | |
| 565 | ||
| Mathics3 Language and Translation Toolkit module via PyICU | 564 | |
| Un-fairseq: UnFormers (Universal Transformers) — config-driven enc-dec chassis c... | 527 | |
| A library for counting stop words in HTML pages. | 475 | |
| A better date and time API for Python | 458 | |
| Deterministic Latin and IPA transliteration for Kazakh, Kyrgyz, Uzbek, Turkish, ... | 422 | |
| Af format tools | 419 | |
| Column store implementation for ftm data based on clickhouse | 377 | |
| Text tokenizers optimized for sparse retrieval. | 355 | |
| Library that turns comment chains into Ace Attorney scenes, used in several bots | 354 | |
| ... | 261 | |
| deepl translate for free, based on pyppeteer | 258 | |
| dualtext alignment making use of a remote API for embedding | 253 | |
| Deterministic Latin and IPA transliteration for Kazakh, Kyrgyz, plus tokenizer/g... | 249 | |
| Python and command-line utility for converting scripture references between form... | 230 | |
| A good mix of Norwegian dictionaries | 210 | |
| The XML-to-OCDS parser for the TEDective project based on lxml | 192 | |
| Annotator based on Huggingface transformers zero-shot classification pipeline | 192 | |
| A fast Thai word tokenization | 192 | |
| A package for anchored decoding | 176 | |
| Sanskrit metre : miscellaneous code and data. | 167 | |
| My personal porting of libretranslate | 146 | |
| A CLDR-compliant slugify function using PyICU | 96 | |
| Polyglot is a natural language pipeline that supports massive multilingual appli... | 80 | |
| LangDive is a library for measuring the level of linguistic diversity in multili... | 75 | |
| 70 | ||
| Clacks framework | 49 |