2,448 dependents
Package Description Downloads/month
LlamaIndex is the leading document agent and OCR platform 10.4M
LlamaIndex is the leading document agent and OCR platform 7.1M
Safety checks Python dependencies for known security vulnerabilities and suggest... 5.9M
The Python CDK empowers hundreds of Airbyte connectors, including low-code and n... 3.5M
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, n... 2.1M
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join ... 1.5M
Open WebUI 1.3M
Evidently is ​​an open-source ML and LLM observability framework. Evaluate, test... 1.2M
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarizatio... 1.1M
:memo: python package to calculate readability statistics of a text object - par... 1M
newspaper3k is a news, full-text, and article metadata extraction in Python 3. A... 999K
Label Studio SDK 899K
Fast and Accurate ML in 3 Lines of Code 859K
LlamaIndex is the leading document agent and OCR platform 754K
Open Source framework for voice and multimodal conversational AI 677K
g2p: English Grapheme To Phoneme Conversion 620K
A grading component for keyword-based scoring for resumes 548K
This repository is for active development of the Azure SDK for Python. For consu... 381K
The Security Toolkit for LLM Interactions 307K
g2p ID: Indonesian Grapheme-to-Phoneme Converter 306K
Python implementation of the Rapid Automatic Keyword Extraction algorithm using ... 275K
Let language models run code 253K
coqui-ai tts
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and p... 202K
Text utilities library by Pinecone.io 186K
A library of components to help agent builders boost their agent performance (to... 175K
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, ... 171K
Module for automatic summarization of text documents and HTML pages. 151K
A Collection of Competitive Text-Based Games for Language Model Evaluation and R... 148K
Create LLM agents with long-term memory and custom tools 143K
Harness LLMs with Multi-Agent Programming 108K
Efficient and multi-language generation from context free or sensitive grammars ... 97K
A RL env with procedurally generated symbolic reasoning data 96K
Training Sparse Autoencoders on Language Models 92K
A helper library for chemistry calculations,used by the edx-platform 87K
"A Python library for the Demisto SDK" 87K
Set of vectorizers that extract keyphrases with part-of-speech patterns from a c... 85K
A modular graph-based Retrieval-Augmented Generation (RAG) system 82K
QuerySource is a tool for querying different databases (or REST endpoints) using... 77K
An autonomous agent that conducts deep research on any data using any LLM provid... 74K
Framework for Task orchestration 73K
the LLM vulnerability scanner 73K
An open-source NLP research library, built on PyTorch. 72K
a big lib with many usefull tools and it are not only os and sys tools... 66K
Deep learning framework 62K
Pythonic interface to Ansys Fluent 62K
Evaluation and Tracking for LLM Experiments and AI Agents 58K
Cleaning tool for web scraped text 55K
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key... 54K
Create LLM agents with long-term memory and custom tools 52K
49K