1,506 dependents
Package Description Downloads/month
Convert documents to structured data effortlessly. Unstructured is open-source E... 5.2M
An open-source framework for detecting, redacting, masking, and anonymizing sens... 4.5M
The fastai deep learning library 994K
A grading component for keyword-based scoring for resumes 548K
[DEPRECATED] Library to predict info types for DataHub 409K
spaCy pipelines for pre-trained BERT and other transformers 277K
NL to Gherkin format translation tool 268K
coqui-ai tts
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and p... 202K
Audiocraft is a library for audio processing and generation with deep learning. ... 122K
Set of vectorizers that extract keyphrases with part-of-speech patterns from a c... 85K
A modular graph-based Retrieval-Augmented Generation (RAG) system 82K
An open-source NLP research library, built on PyTorch. 72K
An open experiment: does developer sentiment with Claude Code vary by time of da... 72K
a big lib with many usefull tools and it are not only os and sys tools... 66K
A full spaCy pipeline and models for scientific/biomedical documents. 55K
init 45K
spaCy pipeline object for negating concepts in text 44K
Rust-accelerated Cypher query validator, generator, and NL-to-Cypher pipeline vi... 43K
A TextBlob sentiment analysis pipeline component for spaCy. 43K
Language detection using Spacy and Fasttext 43K
Generate karaoke videos, by downloading audio and lyrics, separating instrumenta... 31K
Algorithms for explaining machine learning models 30K
An open-source framework for detecting, redacting, masking, and anonymizing sens... 30K
A structured OCR pipeline designed for **layout-aware text extraction from compl... 24K
Robotic Process Automation by running BPMN diagram flows. 23K
Open Source Neural Machine Translation and (Large) Language Models in PyTorch 23K
✔️Contextual word checker for better suggestions (not actively maintained) 22K
LDaCA Web App - FastAPI backend with bundled production frontend for the Languag... 21K
ChatterBot is a machine learning, conversational dialog engine for creating chat... 21K
A simple FastAPI Server to run XTTSv2 20K
Beautiful visualizations of how language differs among document types. 20K
A package for detail image caption evaluation. 19K
Detect and extract locations from text or URL page 19K
Modular, fast NLP framework, compatible with Pytorch and spaCy, offering tailore... 19K
The runtime engine for MUXI formations. Runs locally, in the cloud, or embedded ... 17K
Fuzzy matching and more functionality for spaCy. 17K
An open-source framework for detecting, redacting, masking, and anonymizing sens... 14K
Library for clinical NLP with spaCy. 13K
A Python multilingual toolkit for Sentiment Analysis and Social NLP tasks 13K
Optimizing inference proxy for LLMs 12K
LLMOps Observability SDK: decorators + SQS dispatch with compression 12K
AI Operating System - Build your own AI using ontologies as the unifying field c... 11K
Agentic AI memory with Ebbinghaus forgetting curve decay. +16pp better recall th... 11K
A python interface to text-based adventure games. 11K
Extract IOCs from text. 10K
PixlStash is a Python-based image management, tagging and editing web app levera... 10K
🦙 Integrating LLMs into structured NLP pipelines 10K
AI Execute Services - A middleware framework for AI-powered task execution and t... 9K
A spaCy wrapper for GliNER 9K
Sentiment Analysis, Text Classification, Text Augmentation, Text Adversarial de... 8K