Dependents of trl - PyPI Stats

158 dependents

Package	Description	Downloads/month
unsloth	Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt...	1.9M
unsloth-zoo	Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt...	1.4M
ms-swift	Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, ...	171K
wisent	This is an open-source version of the representation engineering framework for s...	167K
autotrain-advanced	🤗 AutoTrain Advanced	34K
llamafactory	Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)	29K
axolotl	Go ahead and axolotl questions	20K
instructlab-training	InstructLab Training Library - Efficient Fine-Tuning with Message-Format Data	16K
soup-cli	Soup turns the pain of LLM fine-tuning into a simple workflow. One config, one c...	11K
torchstudio	Deep Learning Experiment	7K
instructlab	InstructLab Core package. Use this to chat with a model and execute the Instruc...	7K
hpsv3	Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (...	6K
trl-env		6K
fedops	FL Lifecycle Operations Management Platform	6K
engram-peft	Engram-PEFT: Efficient Parameter-Efficient Fine-Tuning with Engram	6K
text2text	Text2Text Language Modeling Toolkit	6K
pruna-pro	Smash your AI models - Pro Version	6K
ultimate-utils	Brando's Ultimate Utils for Science, Machine Learning, and AI	4K
tt-lmf	Lean LLM fine-tuning toolkit with EC2 orchestration for SFT, DPO, and LoRA train...	4K
aitraining	Advanced Machine Learning Training Platform - IN DEVELOPMENT	3K
fai-rl	Foundation AI - Reinforcement Learning Library	3K
factory-sdk	factory SDK	3K
ellora	Automated Hyperparameter Optimization Platform for Efficient LLM Fine-Tuning	3K
pruna	Smash your AI models	3K
auto-lora	ADyFT(Auto Dynamic Fine Tuning) automates parameter-efficient fine-tuning of Lar...	3K
oumi	Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open s...	2K
roboreason	Roboreason package	2K
arbor-ai	A framework for optimizing DSPy programs with RL	2K
backpropagate	Headless LLM fine-tuning in 3 lines — smart defaults, VRAM-aware batch sizing, m...	2K
modelforge-finetuning	A no-code toolkit to finetune LLMs on your local GPU—just upload data, pick a ta...	2K
llmtuner	Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)	2K
finetune-cli	一个将原始语料库变成微调需要的alpaca数据集格式的工具	2K
textrl	TextRL - reinforcement learning for text generation, built on HuggingFace TRL.	2K
agent-as-annotators	Agent-as-Annotators: Structured Distillation of Web Agent Capabilities	1K
kailash-align	LLM fine-tuning and alignment framework for the Kailash platform	1K
llm-forge-new	Config-driven, YAML-first open-source LLM training platform. Fine-tune language ...	1K
forgelm	Config-driven LLM fine-tuning with safety evaluation, EU AI Act compliance, 6 al...	1K
huggify-data	This is a helper library to push data to HuggingFace.	1K
fms-hf-tuning	FMS HF Tuning	1K
nexuss-transformer	Nexuss Transformer Framework (NTF) - Blank Slate LLM Training with RLHF & EthioB...	1K
questionnaire-mistral		1K
geniusrise-vision	Huggingface bolts for geniusrise	1K
dhurandhar	Model-agnostic edge deployment analysis framework — PLE memory analysis, TurboQu...	1K
lean-dojo-v2	LeanDojo-v2 is an end-to-end framework for training, evaluating, and deploying A...	1K
konic	Official Konic Agent Development Toolkit	980
ali-agent	Official implementation for "ALI-Agent: Assessing LLMs'Alignment with Human Valu...	923
predacons	python library based on transformers for transfer learning	908
ramalama-stack	An external provider for Llama Stack allowing for the use of RamaLama for infere...	860
lfm-trainer	Fine-tune LFM 2.5 1.2B for coding tasks on Kaggle multi-GPU with auto-publish to...	813
erl-trainer	Experiential Reinforcement Learning (ERL) — a thin wrapper on HuggingFace TRL's ...	772