5,648 dependents
| Package | Description | Downloads/month |
|---|---|---|
| SGLang is a high-performance serving framework for large language models and mul... | 287.7M | |
| State-of-the-Art Text Embeddings | 25.7M | |
| π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning. | 10.8M | |
| A high-throughput and memory-efficient inference and serving engine for LLMs | 9.4M | |
| Fast, Flexible and Portable Structured Generation | 6.4M | |
| A safetensors extension to efficiently store sparse quantized tensors on disk | 6.3M | |
| Train transformer language models with reinforcement learning. | 3.8M | |
| MTEB: Massive Text Embedding Benchmark | 2.7M | |
| This package contains the AI models used by the Docling PDF conversion package | 2.4M | |
| Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt... | 1.9M | |
| π Accelerate inference and training of π€ Transformers, Diffusers, TIMM and Sente... | 1.7M | |
| Run LLMs with MLX | 1.5M | |
| Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt... | 1.4M | |
| A framework for building realtime voice AI agents π€ποΈπΉ | 1.4M | |
| Open WebUI | 1.3M | |
| A library for performing inference using trained models. | 1.1M | |
| WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarizatio... | 1.1M | |
| A PyTorch-native inference engine with cache, parallelism, quantization for Diff... | 1M | |
| Chronos: Pretrained Models for Time Series Forecasting | 928K | |
| Fast and Accurate ML in 3 Lines of Code | 914K | |
| Fast and Accurate ML in 3 Lines of Code | 859K | |
| OCR, layout analysis, reading order, table recognition in 90+ languages | 797K | |
| Open Source framework for voice and multimodal conversational AI | 677K | |
| ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It... | 629K | |
| Convert PDF to markdown + JSON quickly with high accuracy | 566K | |
| Generalist and Lightweight Model for Named Entity Recognition (Extract any entit... | 556K | |
| A Data Streaming Library for Efficient Neural Network Training | 515K | |
| Training API and CLI | 511K | |
| π πΌοΈ π₯PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS,... | 487K | |
| π€ Optimum ONNX: Export your model to ONNX and run inference with ONNX Runtime | 441K | |
| Retrieval and Retrieval-augmented LLMs | 425K | |
| π Efficient implementations for emerging model architectures | 400K | |
| GenAI Perf Analyzer CLI - CLI tool to simplify profiling LLMs and Generative AI ... | 366K | |
| MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VL... | 349K | |
| cortical is the framework for building fabric architectures | 325K | |
| https://hf.co/hexgrad/Kokoro-82M | 312K | |
| The Security Toolkit for LLM Interactions | 307K | |
| πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and p... | 292K | |
| AIPerf is a package for performance testing of AI models | 285K | |
| Transformers-compatible library for applying various compression algorithms to L... | 285K | |
| spaCy pipelines for pre-trained BERT and other transformers | 277K | |
| A Neural Framework for MT Evaluation | 272K | |
| ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22... | 233K | |
| 228K | ||
| ReMe: Memory Management Kit for Agents - Remember Me, Refine Me. | 226K | |
| [ICLR 2026] RF-DETR is a real-time object detection and segmentation model archi... | 222K | |
| Efficient few-shot learning with Sentence Transformers | 214K | |
| A library integrating embedding and reranker models from OpenAI, SentenceTransfo... | 210K | |
| Qwen-TTS python package | 209K | |
| πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and p... | 202K |