PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
nvidia
nemo-text-processing

NeMo text processing for ASR and TTS

102K 455 159
ikegami-yukino
neologdn

Japanese text normalizer for mecab-neologd

43K 289 20
vinhdq842
soe-vinorm

Soe Vinorm: An Effective Text Normalization Toolkit for converting Vietnamese text to its spoken form.

4K 19 8
rhnfzl
squeakycleantext

Text preprocessing and PII anonymisation for NLP/ML. ONNX NER ensemble, language detection, stopword removal. Built for statistical ML and language models.

2K 8 0
pgolo
sic

Utility for string normalization

1K 2 0
curegit
unicodecheck

A simple tool to check if Unicode text files are Unicode-normalized

561 1 1
yeiichi
smith-utils

Essential Python utilities for robust text normalization, strict date handling, and numeric parsing.

558 0 0
seanghay
tha

A Khmer Text Normalization and Verbalization Toolkit.

362 9 1
jasminsternkopf
english-text-normalization

Command-line interface (CLI) and library to normalize English texts.

201 4 1
devjerry0
sane-contractions

Enhanced fork of contractions library - Expands English contractions with improved performance and new features

164 5 0
pszemraj
rehuman

Python bindings for rehuman: Unicode-safe text cleaning & normalization

145 0 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery