speed-of-light
TokenSpeed is a speed-of-light LLM inference engine.
A PyTorch optimizer that implements a relativistic gradient clipping mechanism, inspired by the theory of special relativity.