Dependents of fasttext

75 dependents

Package	Description	Downloads/month
string2string	String-to-String Algorithms for Natural Language Processing	30K
pptagent	An Agentic Framework for Reflective PowerPoint Generation	7K
utilities-nlp		7K
open-dataflow	Modern Data Centric AI system for Large Language Models	3K
activetigger	ActiveTigger in Python	2K
ovos-lang-detector-fasttext-plugin	average plugin classifications for language detection	2K
mc-providers	Search Providers package for Mediacloud	2K
citeindex	Ingest sources with proper citation — PDF, URL, media, Office, DJVU	1K
mmf	mmf: a modular framework for vision and language multimodal research.	1K
aamraz	Aamraz which is written "ئامراز" in kurdish script means "instrument". This proj...	1K
wenbi	A simple tool to make the video, audio, subtitle and video-url (especially youtu...	1K
carte-ai	Repository for CARTE: Context-Aware Representation of Table Entries	929
nlu-inference	nlu模型推理服务	899
x-voice	X-Voice	571
nlp-automl	AutoML library for solving only text -> label task'	548
ir-axioms	Axiomatic constraints for information retrieval and retrieval-augmented generati...	534
tozatext	TozaText is a cleaning library for preprocessing raw Uzbek and multilingual text...	518
freestylo	Stylistic Device Detection Tool	488
pandas-nlp	Pandas extension with NLP functionalities	455
izihawa-nlptools		437
xtranslator	A package for translating text and detecting languages	417
babelvec	Position-aware, cross-lingually aligned word embeddings built on FastText	414
orange3-nlp	A collection of Orange3 widgets to perform natural language processing	394
chatmemorydb		387
sister	SISTER (SImple SenTence EmbeddeR)	380
cite-extractor	Extracts citations from PDF, URLs and local media files in CSL-JSON.	369
quantize-fasttext	量化fasttext并测试其性能	363
extractor-api-lib	Template for AI chatbots & document management using Retrieval-Augmented Generat...	356
tarte-ai	Repository for TARTE: Transformer Augmented Representation of Table Entries	352
anyclassifier	One Line To Build Any Classifier Without Data	323
fasttext-shop	FastText_Shop是一个基于FastText和结巴分词的短文本分类工具，特点是高效易用，同时支持中文和英文语料。基本使用方法、灵感来自TextGroce...	322
pat-cli		305
discriminative-lexicon-model	Python-implementation of Discriminative Lexicon Model / Linear Discriminative Le...	278
multilang-probe	A Python package for analyzing multilingual text.	272
nlpclean	Utilities for cleaning up text corpus	255
tinybee	dualtext alignment making use of a remote API for embedding	253
wechsel	Code for WECHSEL: Effective initialization of subword embeddings for cross-lingu...	235
deepfocus	Offcial Python implementation of "FOCUS: Effective Embedding Initialization for ...	231
data-modori	LMOps Tool for Korean	223
onomancer	Lookup and/or predict gender of given first name.	213
language-detector-api		211
document-classification	Awesome document classifcation - Implementation of major techniques	202
whatlangid	This project is build on top of whatthelang and langid	194
intelli3text	Ingestion (web/PDF/DOCX/TXT), cleaning, paragraph-level LID (PT/EN/ES), and spaC...	193
invisible-rabbit	Scalable Data Preprocessing Tool for Training Large Language Models	185
d3lta	A library for detecting verbatim-duplicated contents within a vast amount of doc...	179
ftlid	A small and fast language identification model powered by fastText	178
tailors-fast		172
vettavista-backend	Browser-integrated LinkedIn companion offering intelligent job filtering alongsi...	161
sygra	SyGra - Graph-oriented Synthetic data generation Pipeline	158