PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
brianrisk
simphile

Python Text Similarity NLP Libray

7K 37 6
shibing624
text2vec

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

4K 5K 427
izikeros
sentence-plagiarism

Compare sentences from input document with all sentences from reference documents - find very similar ones.

597 3 0
lonePatient
torchblocks-chen

A PyTorch-based toolkit for natural language processing

555 160 27
Lipairui
textgo

Let's go and play with text!

451 45 3
sheriff1max
recs-searcher

Search engine and registry error corrector

429 2 0
easonanalytica
company-name-matcher

A library for matching and comparing company names using a fine-tuned sentence transformer model

419 9 1
justinbt1
akin

Python library for detecting near duplicate texts in a corpus at scale.

354 9 0
giacbrd
dandelion-eu

A python client for connecting to all the services provided by https://dandelion.eu

255 35 15
IDEA-CCNL
gts-engine

GTS Engine: A powerful NLU Training System。GTS引擎(GTS-Engine)是一款开箱即用且性能强大的自然语言理解引擎,聚焦于小样本任务,能够仅用小样本就能自动化生产NLP模型。

214 93 10
bstoilov
pysemantics

Free Python client, that utilizes the digitalowl.org NLP API.

213 9 2
lonePatient
torchblocks

A PyTorch-based toolkit for natural language processing

148 160 27
yongzhuo
near-synonym

near-synonym, 中文反义词/近义词(antonym/synonym)工具包.

129 31 3
yongzhuo
char-similar

字符相似度, 汉字字形/拼音/语义相似度(单字, 可用于数据增强, CSC错别字检测识别任务(构建混淆集)) Chinese character font/pinyin/semantic similarity (single character, can be used for data augmentation, CSC misclassified character detection and recognition tasks (building confusion sets))

95 22 3
kiwirafe
xiangsi

中文文本相似度计算器

83 170 23
chigwell
compario

A new package that uses large language models and pattern matching to perform structured similarity comparisons between textual content based on normalized compression distance. Users provide multiple

80 1 0
VietHoang1512
qs-kpa

Quantitative Summarization – Key Point Analysis

63 12 1
adhaamehab
textblob-ar-mk

Arabic language extension for TextBlob.

63 86 24
kiwirafe
xiangshi

中文文本相似度计算器

13 170 23
    • Data from PyPI, GitHub, ClickHouse, and BigQuery