59 dependents
| Package | Description | Downloads/month |
|---|---|---|
| SGLang is a high-performance serving framework for large language models and mul... | 287.7M | |
| PyTorch native post-training library | 405K | |
| Post-training with Tinker | 73K | |
| Easy-to-use and powerful LLM and SLM library with awesome model zoo. | 36K | |
| TensorRT LLM provides users with an easy-to-use Python API to define Large Langu... | 16K | |
| Python Speech Language Sample Analysis | 10K | |
| Python Speech Language Sample Analysis | 8K | |
| OpenAI and Neuronpedia's implementation of automated-interpretability, with some... | 6K | |
| caching utilities | 6K | |
| GPUStack | 5K | |
| JetStream is a throughput and memory optimized engine for LLM inference on XLA d... | 4K | |
| SGLang is a high-performance serving framework for large language models and mul... | 4K | |
| Advanced market data API aggregator for analysis and real-time feeds. | 4K | |
| Evals is a framework for evaluating LLMs and LLM systems, and an open-source reg... | 3K | |
| 3K | ||
| fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型,任意10G以上显卡即可推理满血DeepSeek。双路900... | 3K | |
| Cosmos-RL is a flexible and scalable Reinforcement Learning framework specialize... | 2K | |
| VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recip... | 2K | |
| A CLI tool for tabular data | 1K | |
| Open AI simple evals - packaged by NVIDIA | 1K | |
| tiktoken is a fast BPE tokeniser for use with OpenAI's models | 1K | |
| Idiomatic way to build chatgpt apps using async generators in python | 1K | |
| An external provider for Llama Stack allowing for the use of RamaLama for infere... | 860 | |
| Build trusted, faster, and more powerful applications with the Blazon Capabiliti... | 749 | |
| 713 | ||
| Evaluating Text-to-Visual Generation with Image-to-Text Generation. | 704 | |
| small vlm for training and experiments | 660 | |
| dr-agent-lib is an agent library for building deep research agents | 616 | |
| Open GenAI Stack | 613 | |
| Project VAIL Model Registry | 495 | |
| A Quick Library with Llama 3.1 & 3.2 Tokenization | 490 | |
| James' cookbook of evaluations and finetuning experiments | 474 | |
| Foundation model for EEG reconstruction and interpolation | 324 | |
| A framework for load testing llmbench APIs | 314 | |
| Automated Interpretability | 291 | |
| The goal of this AI agent is to generate personalised and rich UI components bas... | 274 | |
| A native-PyTorch library for large scale LLM training | 251 | |
| Codebase for generation-time and post-hoc text watermarking, as well as watermar... | 221 | |
| SGLang fork for ppc64le with CUDA 12.4 and Torch Triton support | 186 | |
| HPC-AI TECH 's Fine-tuning SDK | 181 | |
| Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities | 156 | |
| scDiffusion-X: Diffusion Model for Single-Cell Multiome Data Generation and Anal... | 130 | |
| aigc_evals | 121 | |
| State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More! | 119 | |
| VeOmni: Scaling any Modality Model Training to any Accelerators with PyTorch nat... | 118 | |
| Llama Agentic System | 117 | |
| A PyTorch native platform for training generative AI models | 113 | |
| A utility to extract setup commands from a GitHub repository | 110 | |
| structre context for code project | 110 | |
| structre context for code project | 105 |