48 dependents
| Package | Description | Downloads/month |
|---|---|---|
| :us: a python library for parsing unstructured United States address strings int... | 6.4M | |
| scikit-learn inspired API for CRFsuite | 578K | |
| :family: a python library for parsing unstructured western names into name compo... | 196K | |
| A tokenizer, text cleaner, and phonemizer for many human languages. | 121K | |
| A tool to parse recipe ingredients into structured data | 30K | |
| Open-source low code data preparation library in python. Collect, clean and visu... | 11K | |
| Open source tools for Estonian natural language processing | 11K | |
| Persian NLP Toolkit | 9K | |
| :bookmark: A toolkit for making domain-specific probabilistic parsers | 8K | |
| Простой расстановщик ударений с обработкой омографов | 6K | |
| test processing | 5K | |
| 一个轻量且功能全面的中文分词器,帮助学生了解分词器的工作原理。MicroTokenizer: A lightweight Chinese tokenizer d... | 4K | |
| helix.personmatching | 3K | |
| PyThaiNLP For spaCy | 2K | |
| Python library for Pyidaungsu Myanmar languages | 2K | |
| geNomad: Identification of mobile genetic elements | 2K | |
| mySpellChecker | 2K | |
| LEKCut (เล็ก คัด) is a Thai tokenization library that ports the deep learning mo... | 1K | |
| AI for Thai Python Package | 959 | |
| ClowdFlows natural language processing module | 804 | |
| The 10-K Report Item Segmentation Tool | 781 | |
| F5-TTS: Text-to-Speech (TTS) ภาษาไทย — เครื่องมือสร้างเสียงพูดจากข้อความด้วยเทคน... | 749 | |
| Library for parsing unstructured FR addresses strings into address components | 716 | |
| A Library for using in our CRM | 665 | |
| Domain Adaptation of Thai Word Segmentation Models using Stacked Ensemble (EMNLP... | 577 | |
| X-Voice | 571 | |
| search for addresses in the text | 535 | |
| A toolkit for extracting chemical information from the scientific literature. | 468 | |
| 442 | ||
| NLP framework in python for entity recognition and relationship extraction | 399 | |
| CRF tagger | 380 | |
| Extract body text from Japanese business emails | 380 | |
| A (fast) Khmer word segmentation toolkit. | 357 | |
| Morphological Analyzer for Russian 💬 | 309 | |
| UK address utility based on machine learning and optimised search to parse, stan... | 304 | |
| TextFlows taggers module | 285 | |
| Python library for building custom AI chatbot with just one line of code. | 225 | |
| Python package for keyphrase labeling. | 205 | |
| Pycrfsuite를 이용한 띄어쓰기 교정기 | 204 | |
| chatbot_ner: Named Entity Recognition for chatbots. | 197 | |
| 150 | ||
| Comprehensive tokenization library for Myanmar language | 143 | |
| Deep Learning systems for training and testing disfluency detection and related ... | 113 | |
| 📜Probabilistic parser for tagging data that references the Illinois Compiled Sta... | 80 | |
| Project for Vietnamese nlp | 79 | |
| This is a package to translate a chinese sentence into bopomofo letters | 78 | |
| fetch, munge, and parse résumés and job postings | 75 | |
| Enhanced parser for medical professional names with advanced NLP methods, suppor... | 61 |