Consumer Gpu Python Packages

turboquant-vllm

TurboQuant KV cache compression plugin for vLLM — asymmetric K/V, 8 models validated, consumer GPUs

8K 46 5

zorac

Interactive CLI chat client for vLLM inference servers with persistent sessions and automatic context management

224 1 0

Search Packages