PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Speculative Decoding Python Packages

Python packages with the GitHub topic speculative-decoding. Sorted by relevance, with stars and monthly downloads.
youssofal
mtplx

2.24x decode TPS increase On Qwen 3.6 27B @ temp 0.6 | Native MTP Speculative Decoding On Apple Silicon With No External Drafter.

11K 268 11
Tencent
angelslim

Model compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency.

7K 1K 123
aphrodite-engine
aphrodite-engine

Large-scale LLM inference engine

7K 2K 197
intel
intel-extension-for-transformers

Repository of Intel® Intel Extension for Transformers

6K 2K 217
youngharold
tightwad

Mixed-vendor GPU inference cluster manager with speculative decoding

3K 21 2
dualform-labs
m5-infer

Extraordinary speed, extraordinary quality — an MLX-based inference engine for Apple Silicon.

2K 0 1
SafeAILab
eagle-llm

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

331 2K 276
lpoee
opencac

Multi-agent orchestration CLI for AI coding tools. Chain Claude Code, Antigravity, and Codex with validated handoffs, JSONL audit logging, hybrid cloud/local routing, and speculative decoding for local LLMs.

130 2 0
Tencent
angelslim-fork

A toolkit for compress llm model.

122 1K 123
llmsresearch
specstream

Fast LLM inference with 2.8x speedup using speculative decoding

82 8 1
    • Data from PyPI, GitHub, ClickHouse, and BigQuery