48 dependents
Package Description Downloads/month
:us: a python library for parsing unstructured United States address strings int... 6.4M
scikit-learn inspired API for CRFsuite 578K
:family: a python library for parsing unstructured western names into name compo... 196K
A tokenizer, text cleaner, and phonemizer for many human languages. 121K
A tool to parse recipe ingredients into structured data 30K
Open-source low code data preparation library in python. Collect, clean and visu... 11K
Open source tools for Estonian natural language processing 11K
Persian NLP Toolkit 9K
:bookmark: A toolkit for making domain-specific probabilistic parsers 8K
Простой расстановщик ударений с обработкой омографов 6K
test processing 5K
一个轻量且功能全面的中文分词器,帮助学生了解分词器的工作原理。MicroTokenizer: A lightweight Chinese tokenizer d... 4K
helix.personmatching 3K
PyThaiNLP For spaCy 2K
Python library for Pyidaungsu Myanmar languages 2K
geNomad: Identification of mobile genetic elements 2K
mySpellChecker 2K
LEKCut (เล็ก คัด) is a Thai tokenization library that ports the deep learning mo... 1K
AI for Thai Python Package 959
ClowdFlows natural language processing module 804
The 10-K Report Item Segmentation Tool 781
F5-TTS: Text-to-Speech (TTS) ภาษาไทย — เครื่องมือสร้างเสียงพูดจากข้อความด้วยเทคน... 749
Library for parsing unstructured FR addresses strings into address components 716
A Library for using in our CRM 665
Domain Adaptation of Thai Word Segmentation Models using Stacked Ensemble (EMNLP... 577
X-Voice 571
search for addresses in the text 535
A toolkit for extracting chemical information from the scientific literature. 468
442
NLP framework in python for entity recognition and relationship extraction 399
CRF tagger 380
Extract body text from Japanese business emails 380
A (fast) Khmer word segmentation toolkit. 357
Morphological Analyzer for Russian 💬 309
UK address utility based on machine learning and optimised search to parse, stan... 304
TextFlows taggers module 285
Python library for building custom AI chatbot with just one line of code. 225
Python package for keyphrase labeling. 205
Pycrfsuite를 이용한 띄어쓰기 교정기 204
chatbot_ner: Named Entity Recognition for chatbots. 197
150
Comprehensive tokenization library for Myanmar language 143
Deep Learning systems for training and testing disfluency detection and related ... 113
📜Probabilistic parser for tagging data that references the Illinois Compiled Sta... 80
Project for Vietnamese nlp 79
This is a package to translate a chinese sentence into bopomofo letters 78
fetch, munge, and parse résumés and job postings 75
Enhanced parser for medical professional names with advanced NLP methods, suppor... 61