2,210 dependents
Package Description Downloads/month
SGLang is a high-performance serving framework for large language models and mul... 287.7M
SWE-bench: Can Language Models Resolve Real-world Github Issues? 38.3M
huggingface trl
Train transformer language models with reinforcement learning. 3.8M
MTEB: Massive Text Embedding Benchmark 2.7M
Our library for RL environments + evals 2.3M
The 100 line AI agent that solves GitHub issues or helps you in your command lin... 2M
Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt... 1.9M
Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt... 1.4M
A framework for few-shot evaluation of language models. 1.4M
Supercharge Your LLM Application Evaluations πŸš€ 1.3M
A DataSource for reading and writing HuggingFace Datasets in Spark 1M
A framework for evaluating and optimizing agents and models using sandboxed envi... 775K
πŸ”Ž πŸ–ΌοΈ πŸ”₯PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS,... 487K
Retrieval and Retrieval-augmented LLMs 425K
PyTorch native post-training library 405K
Scalable SWE datasets 356K
MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VL... 349K
AIPerf is a package for performance testing of AI models 285K
Transformers-compatible library for applying various compression algorithms to L... 285K
The Argilla python server SDK 277K
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22... 233K
Efficient few-shot learning with Sentence Transformers 214K
πŸ€— LeRobot: Making AI for Robotics more accessible with end-to-end learning 204K
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, ... 171K
This is an open-source version of the representation engineering framework for s... 167K
A Python library for defining, testing, and using reward functions 144K
Therapeutics Commons (TDC): Multimodal Foundation for Therapeutic Science 127K
VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice D... 122K
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup du... 122K
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework 122K
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech wi... 104K
A framework offers an OS simulator within a Python Code Interface for AI Agents 96K
Collection of evals for Inspect AI 96K
Training Sparse Autoencoders on Language Models 92K
An implementation of transformers tailored for mechanistic interpretability. 89K
The Python Risk Identification Tool for LLMs (PyRIT) is a library used to assess... 77K
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ ... 75K
the LLM vulnerability scanner 73K
Post-training with Tinker 73K
Transformers for Information Retrieval, Text Classification, NER, QA, Language M... 73K
Late Interaction Models Training & Retrieval 71K
Interact with the Databricks Generative AI APIs in python 71K
A SOTA quantization algorithm for high-accuracy low-bit LLM inference, seamlessl... 71K
A Lightweight LLM Post-Training Library 68K
Topic modeling neural toolkit 63K
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic wor... 60K
Dreadnode Strikes SDK 59K
A PyTorch native platform for training generative AI models 54K
Pyserini is a Python toolkit for reproducible information retrieval research wit... 50K
LGPL contributions to the axolotl framework 49K