qwen3-next
2.24x decode TPS increase On Qwen 3.6 27B @ temp 0.6 | Native MTP Speculative Decoding On Apple Silicon With No External Drafter.
🚀 Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support