49 dependents
| Package | Description | Downloads/month |
|---|---|---|
| SGLang is a high-performance serving framework for large language models and mul... | 287.7M | |
| Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt... | 1.4M | |
| On-device AI across mobile, embedded and edge for PyTorch | 196K | |
| Support PyTorch model conversion with LiteRT. | 43K | |
| LLM model quantization (compression) toolkit with HW acceleration support for Nv... | 38K | |
| A package for NeuCodec, based on xcodec2. | 24K | |
| Support PyTorch model conversion with LiteRT. | 21K | |
| Go ahead and axolotl questions | 20K | |
| TensorRT LLM provides users with an easy-to-use Python API to define Large Langu... | 16K | |
| Support PyTorch model conversion with LiteRT. | 13K | |
| Offline inference engine for art, real-time voice conversations, LLM powered cha... | 12K | |
| Smash your AI models - Pro Version | 6K | |
| SGLang is a high-performance serving framework for large language models and mul... | 4K | |
| 🚀 Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Fa... | 3K | |
| Smash your AI models | 3K | |
| Crilla is a simple way to introduce optimized single-GPU training into your proj... | 3K | |
| Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open s... | 2K | |
| ChemBFN: Bayesian Flow Network Framework for Chemistry Tasks. Developed in Hiros... | 2K | |
| Cosmos-RL is a flexible and scalable Reinforcement Learning framework specialize... | 2K | |
| A modular SDK for training ML models (image, tabular, timeseries, etc.) | 2K | |
| VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Spe... | 1K | |
| GPU-accelerated training for CALM (Catastrophically Abridged Language Models) | 1K | |
| A library for XCodec2 model. | 1K | |
| An easy-to-use library and command-line tool for TTS | 1K | |
| CpGPT: a Foundation Model for DNA Methylation. | 843 | |
| NVIDIA's package for core modules common across TAO Toolkit DNNs. | 739 | |
| Neural Graphics Model Gym is a Python® toolkit for developing real-time Neural G... | 421 | |
| An Evolutionary-scale Model (ESM) for protein function prediction from amino aci... | 351 | |
| Tiny PyTorch GPU-benchmark CLI | 317 | |
| NVIDIA AITune is an inference toolkit designed for tuning and deploying Deep Lea... | 284 | |
| A protein sequence embedding model distilled from ESMC. | 250 | |
| SOLO | 227 | |
| Package for ASRChild | 216 | |
| A Fine-tuning assistant for large language models | 193 | |
| ACE-Step 1.5 | 190 | |
| SGLang fork for ppc64le with CUDA 12.4 and Torch Triton support | 186 | |
| Finetrainers is a work-in-progress library to support (accessible) training of d... | 175 | |
| PyTorch Quantization Framework For OCP MX Datatypes. | 118 | |
| FireRedTTS2 - speech generation utilities and model wrapper | 109 | |
| Tool for converting PyTorch models into raw C codes with minimal dependency and ... | 87 | |
| A native-PyTorch library for LLM fine-tuning | 85 | |
| Composable model compression for PyTorch — prune, quantize, and ship. | 79 | |
| A library for reproducible model training | 75 | |
| Add your description here | 68 | |
| CoreAI | 47 | |
| LLMFlowStack is a framework for training and using LLMs (LLaMA, GPT-OSS, Gemma, ... | 22 | |
| Handwritten + image OCR. | 11 | |
| Common used component in AI applications. (inference interface, processing utils... | 3 | |
| SGLang is a fast serving framework for large language models and vision language... | 2 |