PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Sentence Segmentation Python Packages

Python packages with the GitHub topic sentence-segmentation. Sorted by relevance, with stars and monthly downloads.
wikimedia
sentencex

A sentence segmentation library with wide language support optimized for speed and utility.

165K 124 15
natasha
razdel

Rule-based token, sentence segmentation for Russian language

99K 281 34
natasha
natasha

Solves basic Russian NLP tasks, API for lower level Natasha projects

49K 1K 116
segment-any-text
wtpsplit

Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.

31K 1K 83
superlinear-ai
wtpsplit-lite

✂️ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) models

16K 38 4
StarlangSoftware
nlptoolkit-corpus

Corpus processing library

2K 3 9
zaemyung
sentsplit

A flexible sentence segmentation library using CRF model and regex rules

2K 31 10
craigtrim
fast-sentence-segment

Fast and Efficient Sentence Segmentation

2K 2 0
nlp-uoregon
trankit

Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing

2K 795 108
mkartawijaya
hasami

A tool to perform sentence segmentation on Japanese text

1K 6 0
StarlangSoftware
nlptoolkit-corpus-cy

Corpus Processing Library

897 0 0
hellonlp
hellonlp

NLP tools, word segmentation, sentence segmentation, New-Word-Discovery,新词发现

821 27 9
veldica
prose-tokenizer

High-precision prose and Markdown tokenization for natural language processing.

756 1 0
mawo-ru
mawo-razdel

Продвинутая токенизация для русского языка с SynTagRus паттернами

498 11 0
bureaucratic-labs
b-labs-models

Pre-trained models for tokenization, sentence segmentation and so on

317 15 5
Okramjimmy
meitei-senter

Neural sentence boundary detection for Meitei Mayek (Manipuri) using SentencePiece tokenization and a CNN-based spaCy pipeline.

294 0 0
seanghay
khmerpunctuate

Punctuation Restoration for Khmer language

206 5 1
tc64
spacyss

sentence segmenters for spacy2.0+

171 9 1
mkartawijaya
py-hasami

Sentence segmentation for japanese text

82 6 0
segment-any-text
wtpsplit-triton

Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.

76 1K 83
eaklykova
syntaxcomp

A package for extracting syntactic complexity measures from CoNLL-U annotations.

67 4 2
    • Data from PyPI, GitHub, ClickHouse, and BigQuery