158 dependents
| Package | Description | Downloads/month |
|---|---|---|
| Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt... | 1.9M | |
| Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt... | 1.4M | |
| Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, ... | 171K | |
| This is an open-source version of the representation engineering framework for s... | 167K | |
| 🤗 AutoTrain Advanced | 34K | |
| Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024) | 29K | |
| Go ahead and axolotl questions | 20K | |
| InstructLab Training Library - Efficient Fine-Tuning with Message-Format Data | 16K | |
| Soup turns the pain of LLM fine-tuning into a simple workflow. One config, one c... | 11K | |
| Deep Learning Experiment | 7K | |
| InstructLab Core package. Use this to chat with a model and execute the Instruc... | 7K | |
| Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (... | 6K | |
| 6K | ||
| FL Lifecycle Operations Management Platform | 6K | |
| Engram-PEFT: Efficient Parameter-Efficient Fine-Tuning with Engram | 6K | |
| Text2Text Language Modeling Toolkit | 6K | |
| Smash your AI models - Pro Version | 6K | |
| Brando's Ultimate Utils for Science, Machine Learning, and AI | 4K | |
| Lean LLM fine-tuning toolkit with EC2 orchestration for SFT, DPO, and LoRA train... | 4K | |
| Advanced Machine Learning Training Platform - IN DEVELOPMENT | 3K | |
| Foundation AI - Reinforcement Learning Library | 3K | |
| factory SDK | 3K | |
| Automated Hyperparameter Optimization Platform for Efficient LLM Fine-Tuning | 3K | |
| Smash your AI models | 3K | |
| ADyFT(Auto Dynamic Fine Tuning) automates parameter-efficient fine-tuning of Lar... | 3K | |
| Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open s... | 2K | |
| Roboreason package | 2K | |
| A framework for optimizing DSPy programs with RL | 2K | |
| Headless LLM fine-tuning in 3 lines — smart defaults, VRAM-aware batch sizing, m... | 2K | |
| A no-code toolkit to finetune LLMs on your local GPU—just upload data, pick a ta... | 2K | |
| Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024) | 2K | |
| 一个将原始语料库变成微调需要的alpaca数据集格式的工具 | 2K | |
| TextRL - reinforcement learning for text generation, built on HuggingFace TRL. | 2K | |
| Agent-as-Annotators: Structured Distillation of Web Agent Capabilities | 1K | |
| LLM fine-tuning and alignment framework for the Kailash platform | 1K | |
| Config-driven, YAML-first open-source LLM training platform. Fine-tune language ... | 1K | |
| Config-driven LLM fine-tuning with safety evaluation, EU AI Act compliance, 6 al... | 1K | |
| This is a helper library to push data to HuggingFace. | 1K | |
| FMS HF Tuning | 1K | |
| Nexuss Transformer Framework (NTF) - Blank Slate LLM Training with RLHF & EthioB... | 1K | |
| 1K | ||
| Huggingface bolts for geniusrise | 1K | |
| Model-agnostic edge deployment analysis framework — PLE memory analysis, TurboQu... | 1K | |
| LeanDojo-v2 is an end-to-end framework for training, evaluating, and deploying A... | 1K | |
| Official Konic Agent Development Toolkit | 980 | |
| Official implementation for "ALI-Agent: Assessing LLMs'Alignment with Human Valu... | 923 | |
| python library based on transformers for transfer learning | 908 | |
| An external provider for Llama Stack allowing for the use of RamaLama for infere... | 860 | |
| Fine-tune LFM 2.5 1.2B for coding tasks on Kaggle multi-GPU with auto-publish to... | 813 | |
| Experiential Reinforcement Learning (ERL) — a thin wrapper on HuggingFace TRL's ... | 772 |