158 dependents
Package Description Downloads/month
Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt... 1.9M
Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt... 1.4M
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, ... 171K
This is an open-source version of the representation engineering framework for s... 167K
🤗 AutoTrain Advanced 34K
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024) 29K
Go ahead and axolotl questions 20K
InstructLab Training Library - Efficient Fine-Tuning with Message-Format Data 16K
Soup turns the pain of LLM fine-tuning into a simple workflow. One config, one c... 11K
Deep Learning Experiment 7K
InstructLab Core package. Use this to chat with a model and execute the Instruc... 7K
Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (... 6K
6K
FL Lifecycle Operations Management Platform 6K
Engram-PEFT: Efficient Parameter-Efficient Fine-Tuning with Engram 6K
Text2Text Language Modeling Toolkit 6K
Smash your AI models - Pro Version 6K
Brando's Ultimate Utils for Science, Machine Learning, and AI 4K
Lean LLM fine-tuning toolkit with EC2 orchestration for SFT, DPO, and LoRA train... 4K
Advanced Machine Learning Training Platform - IN DEVELOPMENT 3K
Foundation AI - Reinforcement Learning Library 3K
factory SDK 3K
Automated Hyperparameter Optimization Platform for Efficient LLM Fine-Tuning 3K
Smash your AI models 3K
ADyFT(Auto Dynamic Fine Tuning) automates parameter-efficient fine-tuning of Lar... 3K
Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open s... 2K
Roboreason package 2K
A framework for optimizing DSPy programs with RL 2K
Headless LLM fine-tuning in 3 lines — smart defaults, VRAM-aware batch sizing, m... 2K
A no-code toolkit to finetune LLMs on your local GPU—just upload data, pick a ta... 2K
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024) 2K
一个将原始语料库变成微调需要的alpaca数据集格式的工具 2K
TextRL - reinforcement learning for text generation, built on HuggingFace TRL. 2K
Agent-as-Annotators: Structured Distillation of Web Agent Capabilities 1K
LLM fine-tuning and alignment framework for the Kailash platform 1K
Config-driven, YAML-first open-source LLM training platform. Fine-tune language ... 1K
Config-driven LLM fine-tuning with safety evaluation, EU AI Act compliance, 6 al... 1K
This is a helper library to push data to HuggingFace. 1K
FMS HF Tuning 1K
Nexuss Transformer Framework (NTF) - Blank Slate LLM Training with RLHF & EthioB... 1K
1K
Huggingface bolts for geniusrise 1K
Model-agnostic edge deployment analysis framework — PLE memory analysis, TurboQu... 1K
LeanDojo-v2 is an end-to-end framework for training, evaluating, and deploying A... 1K
Official Konic Agent Development Toolkit 980
Official implementation for "ALI-Agent: Assessing LLMs'Alignment with Human Valu... 923
python library based on transformers for transfer learning 908
An external provider for Llama Stack allowing for the use of RamaLama for infere... 860
Fine-tune LFM 2.5 1.2B for coding tasks on Kaggle multi-GPU with auto-publish to... 813
Experiential Reinforcement Learning (ERL) — a thin wrapper on HuggingFace TRL's ... 772