PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Topic Modeling Python Packages

Python packages with the GitHub topic topic-modeling. Sorted by relevance, with stars and monthly downloads.
RaRe-Technologies
gensim

Topic Modelling for Humans

5.1M 16K 4K
MaartenGr
bertopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

388K 8K 893
nomic-ai
nomic

Nomic Developer API SDK

51K 2K 197
JasonKessler
scattertext

Beautiful visualizations of how language differs among document types.

20K 2K 286
bab2min
tomotopy

Python package of Tomoto, the Topic Modeling Tool

14K 594 65
ddangelov
top2vec

Top2Vec learns jointly embedded topic, document and word vectors.

8K 3K 377
gregversteeg
corextopic

Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx

3K 640 118
MIND-LAB
octis

OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)

3K 800 118
MilaNLProc
contextualized-topic-models

A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).

3K 1K 151
stephenhky
shorttext

Various Algorithms for Short Text Mining

2K 471 74
bobxwu
topmost

A Topic Modeling System Toolkit (ACL 2024 Demo)

2K 288 26
ContextLab
hypertools

A python package for visualizing and manipulating high-dimensional data

2K 2K 162
demetrius-mp
sesg

SeSG (Search String Generator) python package repository.

2K 1 0
mortazavilab
topyfic

Topyfic: Reproducible latent dirichlet allocation (LDA) using leiden clustering and harmony for single cell epigenomics data

2K 11 1
bobxwu
fastopic

A Fast, Adaptive, Stable, and Transferable Topic Model (NeurIPS 2024)

2K 153 13
bigartm
bigartm

Fast topic modeling platform

1K 673 121
raphschlatt
ads-bib

Pipeline for querying and turning NASA's ADS publications metadata into curated, analysis-ready datasets, topic maps, and citation networks.

1K 1 0
ddbourgin
numpy-ml

Machine learning, in numpy

1K 16K 4K
drob-xx
topicmodeltuner

HDBSCAN Tuning for BERTopic Models

1K 52 3
maximtrp
bitermplus

Biterm Topic Model (BTM): modeling topics in short texts

1K 85 15
charlesdedampierre
bunkatopics

Bunkatopics is a Topic Modeling package and Exploration Module

977 199 21
FedericoCinus
womg-core

WoMG: Word of Mouth Generator

925 2 0
emirkyz
manta-topic-modelling

Comprehensive topic modeling system using Non-negative Matrix Factorization (NMF)

918 3 1
yaniv-shulman
chunkey-bert

Modification of the KeyBERT method to extract keywords and keyphrases using chunks. This provides better results, especialy when handling long documents.

899 1 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery