PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Llama Python Packages

Python packages with the GitHub topic llama. Sorted by relevance, with stars and monthly downloads.
sgl-project
sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

298.9M 27K 6K
vllm-project
vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

9.2M 79K 16K
strands-agents
strands-agents

A model-driven approach to building AI agents in just a few lines of code.

5.6M 6K 816
pytorch
torchao

PyTorch native quantization and sparsity for training and inference

3.5M 3K 502
strands-agents
strands-agents-tools

A set of tools that gives agents powerful capabilities.

3.1M 1K 293
unslothai
unsloth

Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

1.9M 64K 6K
unslothai
unsloth-zoo

Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

1.4M 64K 6K
Aider-AI
aider-chat

aider is AI pair programming in your terminal

887K 44K 4K
linkedin
liger-kernel

Efficient Triton Kernels for LLM Training

793K 6K 526
explosion
curated-transformers

🤖 A PyTorch library of curated Transformer models and their composable components

529K 895 35
zilliztech
gptcache

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

480K 8K 580
sgl-project
sglang-kernel

SGLang is a high-performance serving framework for large language models and multimodal models.

269K 27K 6K
sgl-project
sgl-kernel

SGLang is a high-performance serving framework for large language models and multimodal models.

257K 27K 6K
modelscope
ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, Phi4, ...) (AAAI 2025).

174K 14K 1K
strands-agents
strands-agents-builder

An example agent demonstrating streaming, tool use, and interactivity from your terminal. This agent builder can help you to build your own agents and tools.

148K 407 86
vllm-project
vllm-tpu

A high-throughput and memory-efficient inference and serving engine for LLMs

144K 79K 16K
tensorzero
tensorzero

TensorZero is an open-source LLMOps platform that unifies an LLM gateway, observability, evaluation, optimization, and experimentation.

78K 11K 821
transformerlab
transformerlab

The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU clusters.

63K 5K 510
linkedin
liger-kernel-nightly

Efficient Triton Kernels for LLM Training

63K 6K 526
xorbitsai
xinference

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.

45K 9K 824
PaddlePaddle
paddlenlp

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

36K 13K 3K
AstrBotDevs
astrbot

AI Agent Assistant that integrates lots of IM platforms, LLMs, plugins and AI feature, and can be your openclaw alternative. ✨

32K 31K 2K
hiyouga
llamafactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

29K 71K 9K
ther1d
shell-gpt

A command-line productivity tool powered by AI large language models like GPT-5, will help you accomplish your tasks faster and more efficiently.

27K 12K 959
    • Data from PyPI, GitHub, ClickHouse, and BigQuery