PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
RaRe-Technologies
gensim

Topic Modelling for Humans

5M 16K 4K
vi3k6i5
flashtext

Extract Keywords from sentence or Replace keywords in sentences.

2.3M 6K 598
natasha
navec

Compact high quality word embeddings for Russian language

51K 218 19
JasonKessler
scattertext

Beautiful visualizations of how language differs among document types.

20K 2K 286
pdrm83
sent2vec

How to encode sentences in a high-dimensional vector space, a.k.a., sentence embedding.

12K 135 12
vector-ai
vectorhub-nightly

One liner to encode data into vectors with state-of-the-art models using tensorflow, pytorch and other open source libraries. Word2Vec, Image2Vec, BERT, etc

7K 561 55
explosion
sense2vec

🦆 Contextually-keyed word vectors

4K 2K 237
shibing624
text2vec

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

4K 5K 427
stephantul
reach

Load embeddings and featurize your sentences.

3K 31 7
dsfsi
textaugment

TextAugment: Text Augmentation Library

2K 435 61
danielfrg
word2vec

Wrapper for Google word2vec

2K 3K 620
iomega
spec2vec

Word2Vec based similarity measure of mass spectrometry data.

2K 84 20
vector-ai
vectorhub

Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, video2vec, graph2vec, bert, inception, etc)

1K 561 55
amansrivastava17
embedding-as-service

embedding-as-service: one-stop solution to encode sentence to vectors using various embedding methods

1K 210 32
alibaba
pyalink

Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.

1K 4K 788
ddbourgin
numpy-ml

Machine learning, in numpy

1K 16K 4K
vngrs-ai
vngrs-nlp

Turkish NLP Tools developed by VNGRS.

1K 287 17
nickduran
align

Python library for extracting quantitative, reproducible metrics of multi-level alignment between speakers in naturalistic language corpora.

1K 54 17
src-d
ast2vec

Part of source{d}'s stack for machine learning on source code. Provides API and tools to train and use models based on source code identifiers extracted from Babelfish's UASTs.

1K 141 44
bab2min
chronogram

Diachronic Word Embedding Model based on Word2vec Skip-gram with Chebyshev approximation

1K 13 0
alibaba
pyalink-flink-1-11

Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of Alibaba computing platform.

849 4K 788
eggplants
aovec

Make Word2Vec from aozorabunko/aozorabunko

846 3 0
src-d
sourced-ml

sourced.ml is a library and command line tools to build and apply machine learning models on top of Universal Abstract Syntax Trees

834 141 44
smilelight
lightnlp

基于Pytorch和torchtext的自然语言处理深度学习框架。

805 835 209
    • Data from PyPI, GitHub, ClickHouse, and BigQuery