Airllm Python Packages

rabbitllm

Run 70B+ LLMs on a single 4GB GPU — no quantization required. Layer-streaming inference for consumer hardware.

344 53 9