PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
wikimedia
sentencex

A sentence segmentation library with wide language support optimized for speed and utility.

151K 124 15
natasha
razdel

Rule-based token, sentence segmentation for Russian language

99K 281 34
natasha
natasha

Solves basic Russian NLP tasks, API for lower level Natasha projects

49K 1K 116
segment-any-text
wtpsplit

Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.

31K 1K 83
superlinear-ai
wtpsplit-lite

✂️ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) models

15K 38 4
StarlangSoftware
nlptoolkit-corpus

Corpus processing library

2K 3 9
zaemyung
sentsplit

A flexible sentence segmentation library using CRF model and regex rules

2K 31 10
craigtrim
fast-sentence-segment

Fast and Efficient Sentence Segmentation

2K 2 0
nlp-uoregon
trankit

Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing

2K 795 108
mkartawijaya
hasami

A tool to perform sentence segmentation on Japanese text

1K 6 0
hellonlp
hellonlp

NLP tools, word segmentation, sentence segmentation, New-Word-Discovery,新词发现

741 27 9
StarlangSoftware
nlptoolkit-corpus-cy

Corpus Processing Library

728 0 0
veldica
prose-tokenizer

High-precision prose and Markdown tokenization for natural language processing.

686 1 0
mawo-ru
mawo-razdel

Продвинутая токенизация для русского языка с SynTagRus паттернами

501 11 0
Okramjimmy
meitei-senter

Neural sentence boundary detection for Meitei Mayek (Manipuri) using SentencePiece tokenization and a CNN-based spaCy pipeline.

292 0 0
bureaucratic-labs
b-labs-models

Pre-trained models for tokenization, sentence segmentation and so on

281 15 5
seanghay
khmerpunctuate

Punctuation Restoration for Khmer language

207 5 1
tc64
spacyss

sentence segmenters for spacy2.0+

155 9 1
mkartawijaya
py-hasami

Sentence segmentation for japanese text

79 6 0
eaklykova
syntaxcomp

A package for extracting syntactic complexity measures from CoNLL-U annotations.

65 4 2
segment-any-text
wtpsplit-triton

Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.

64 1K 83
    • Data from PyPI, GitHub, ClickHouse, and BigQuery