PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
aphrodite-engine
aphrodite-engine

Large-scale LLM inference engine

7K 2K 194
Tencent
angelslim

Model compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency.

7K 884 95
intel
intel-extension-for-transformers

Repository of Intel® Intel Extension for Transformers

6K 2K 217
youngharold
tightwad

Mixed-vendor GPU inference cluster manager with speculative decoding

3K 20 2
dualform-labs
m5-infer

Extraordinary speed, extraordinary quality — an MLX-based inference engine for Apple Silicon.

2K 0 1
SafeAILab
eagle-llm

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

234 2K 275
Tencent
angelslim-fork

A toolkit for compress llm model.

233 884 95
lpoee
opencac

OpenCAC: Claude Code, Antigravity, and Codex working together across cloud APIs and local models.

173 2 0
llmsresearch
specstream

Fast LLM inference with 2.8x speedup using speculative decoding

74 8 1
    • Data from PyPI, GitHub, ClickHouse, and BigQuery