Ttft Python Packages

llm-autotune

Zero-config local LLM optimization for Ollama, LM Studio, and Apple Silicon MLX. Reduces TTFT by 40%, wall time for local agents by 46%, and RAM usage by 3x.

7K 24 1

infermark

LLM inference benchmarking toolkit. Measure TTFT, inter-token latency, throughput, and P50–P99 across concurrency levels.

844 1 0

voice-budget

The only voice agent context manager with a TTFT feedback loop

377 2 0

Search Packages