Dependents of torch-npu

12 dependents

Package	Description	Downloads/month
vllm-ascend	Community maintained hardware plugin for vLLM on Ascend	7K
dlinfer-ascend	DeepLink Inference Extension	1K
silicondiff-npu	SiliconDiff-NPU	580
flash-attn-npu	High-performance FlashAttention implementation for Ascend NPU	486
torchtitan-npu	Ascend End-to-End Large Model Training Adaptation Framework Based on torchtitan	481
npu-adapter	昇腾快速迁移适配包	387
npu-vllm	A high-throughput and memory-efficient inference and serving engine for LLMs	180
nano-vllm-npu	a lightweight vLLM implementation built from scratch and runs on NPU.	137
dlcompiler	triton for dsa	72
openmind-accelerate	The openmind-accelerate is a product which allows you to use NVIDIA Megatron-LM ...	63
telellm	A high-throughput and memory-efficient inference and serving engine for LLMs	51
cmq-test	vLLM Ascend backend plugin	36