PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
WorksApplications
sudachipy

Sudachi in Rust 🦀 and new generation of SudachiPy

1.9M 442 50
WorksApplications
sudachidict-core

A lexicon for Sudachi

1.8M 293 20
WorksApplications
sudachidict-full

A lexicon for Sudachi

600K 293 20
bab2min
kiwipiepy

Python API for Kiwi

252K 375 33
bab2min
kiwipiepy-model

Python API for Kiwi

149K 375 33
pysal
momepy

Urban Morphology Measuring Toolkit

132K 605 69
WorksApplications
sudachidict-small

A lexicon for Sudachi

97K 293 20
adbar
simplemma

Simple multilingual lemmatizer for Python, especially useful for speed and efficiency

90K 195 15
nlpub
pymystem3

A Python wrapper of the Yandex Mystem 3.1 morphological analyzer (http://api.yandex.ru/mystem). The original tool is shipped as a binary and this library makes it easy to integrate it in Python projects. Let us know in the issues if you would like to be involved into the developments or maintenance of this project. If you have any fix or suggestion, please make a pull request. We are very open to accepting any contributions.

38K 293 43
CAMeL-Lab
camel-tools

A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.

24K 548 89
zentrum-lexikographie
sfst-transduce

Python bindings for SFST focusing on transducer usage

6K 4 0
fergusq
kfst-rs

The accelerated companion library to kfst

4K 7 1
huspacy
huspacy

HuSpaCy: industrial-strength Hungarian natural language processing

3K 182 18
StarlangSoftware
nlptoolkit-morphologicalanalysis

Turkish Morphological Analysis

3K 23 19
obulat
zeyrek

Python morphological analyzer for Turkish language. Partial port of ZemberekNLP.

3K 65 9
huspacy
huspacy-nightly

HuSpaCy: industrial-strength Hungarian natural language processing

3K 182 18
daac-tools
vaporetto

🛥 Vaporetto is a fast and lightweight pointwise prediction based tokenizer. This is a Python wrapper for Vaporetto.

2K 21 1
TheWelcomer
morphseg

An efficient and easy-to-use morpheme segmentation library

2K 2 0
mikahama
uralicnlp

An NLP library for Uralic languages such as Finnish and Sami. Also supports Spanish, Arabic, Russian etc.

2K 94 7
fergusq
kfst

Pure-Python Finite State Transducers – monorepo for KFST, PyOmorfi, and PyVoikko

1K 7 1
yro7
panini-lang

a linguistic analysis/feature extraction framework, LLM-powered

1K 0 0
zentrum-lexikographie
dwdsmor

SFST/SMOR/DWDS-based German Morphology

1K 21 1
vngrs-ai
vngrs-nlp

Turkish NLP Tools developed by VNGRS.

1K 287 17
giakou4
pyfeats

[GitHub 2021] Open source software for image feature extraction.

1K 245 30
    • Data from PyPI, GitHub, ClickHouse, and BigQuery