MLX + Metal implementation of mHC: Manifold-Constrained Hyper-Connections by DeepSeek-AI.
Fused Triton encode/decode kernels for TurboQuant KV cache compression, powered by Randomized Hadamard Transform.