5,648 dependents
Package Description Downloads/month
SGLang is a high-performance serving framework for large language models and mul... 287.7M
State-of-the-Art Text Embeddings 25.7M
πŸ€— PEFT: State-of-the-art Parameter-Efficient Fine-Tuning. 10.8M
A high-throughput and memory-efficient inference and serving engine for LLMs 9.4M
Fast, Flexible and Portable Structured Generation 6.4M
A safetensors extension to efficiently store sparse quantized tensors on disk 6.3M
huggingface trl
Train transformer language models with reinforcement learning. 3.8M
MTEB: Massive Text Embedding Benchmark 2.7M
This package contains the AI models used by the Docling PDF conversion package 2.4M
Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt... 1.9M
πŸš€ Accelerate inference and training of πŸ€— Transformers, Diffusers, TIMM and Sente... 1.7M
Run LLMs with MLX 1.5M
Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt... 1.4M
A framework for building realtime voice AI agents πŸ€–πŸŽ™οΈπŸ“Ή 1.4M
Open WebUI 1.3M
A library for performing inference using trained models. 1.1M
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarizatio... 1.1M
A PyTorch-native inference engine with cache, parallelism, quantization for Diff... 1M
Chronos: Pretrained Models for Time Series Forecasting 928K
Fast and Accurate ML in 3 Lines of Code 914K
Fast and Accurate ML in 3 Lines of Code 859K
OCR, layout analysis, reading order, table recognition in 90+ languages 797K
Open Source framework for voice and multimodal conversational AI 677K
ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It... 629K
Convert PDF to markdown + JSON quickly with high accuracy 566K
Generalist and Lightweight Model for Named Entity Recognition (Extract any entit... 556K
A Data Streaming Library for Efficient Neural Network Training 515K
Training API and CLI 511K
πŸ”Ž πŸ–ΌοΈ πŸ”₯PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS,... 487K
πŸ€— Optimum ONNX: Export your model to ONNX and run inference with ONNX Runtime 441K
Retrieval and Retrieval-augmented LLMs 425K
πŸš€ Efficient implementations for emerging model architectures 400K
GenAI Perf Analyzer CLI - CLI tool to simplify profiling LLMs and Generative AI ... 366K
MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VL... 349K
cortical is the framework for building fabric architectures 325K
https://hf.co/hexgrad/Kokoro-82M 312K
The Security Toolkit for LLM Interactions 307K
πŸΈπŸ’¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and p... 292K
AIPerf is a package for performance testing of AI models 285K
Transformers-compatible library for applying various compression algorithms to L... 285K
spaCy pipelines for pre-trained BERT and other transformers 277K
A Neural Framework for MT Evaluation 272K
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22... 233K
228K
ReMe: Memory Management Kit for Agents - Remember Me, Refine Me. 226K
[ICLR 2026] RF-DETR is a real-time object detection and segmentation model archi... 222K
Efficient few-shot learning with Sentence Transformers 214K
A library integrating embedding and reranker models from OpenAI, SentenceTransfo... 210K
Qwen-TTS python package 209K
coqui-ai tts
πŸΈπŸ’¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and p... 202K