Amd Python Packages | PyPI Stats

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

9.4M 79K 16K

pyopencl

OpenCL integration for Python, plus shiny features

187K 1K 249

dstack

Vendor-agnostic orchestration for training, inference and agentic workloads across NVIDIA, AMD, TPU, and Tenstorrent on clouds, Kubernetes, and bare metal.

174K 2K 224

vllm-tpu

A high-throughput and memory-efficient inference and serving engine for LLMs

143K 79K 16K

lmcache

Supercharge Your LLM with the Fastest KV Cache Layer

120K 8K 1K

conch-triton-kernels

A "standard library" of Triton kernels.

44K 24 3

amd-gaia

Build AI agents for your PC

6K 1K 92

amdfan

Updated AMD Fan control utility forked from amdgpu-fan and updated.

1K 38 9

grilly

GPU-accelerated neural network operations using Vulkan compute shaders

1K 26 1

uccl

UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)

1K 1K 144

eyepie

A python package to read, analyse and visualize OCT and fundus data from various sources.

1K 88 17

l9gpu

GPU telemetry with workload attribution. One OTLP agent per node ties hardware metrics (NVIDIA, AMD, Intel Gaudi) to the K8s pod or Slurm job burning the GPU — so you know who's paying for that idle H100.

1K 10 2

nabla-ml

Nabla: High-Performance Scientific Computing

995 335 13

eyepy

A python package to read, analyse and visualize OCT and fundus data from various sources.

919 88 17

gputop

A simple real-time GPU monitoring tool for NVIDIA, AMD and Intel GPUs.

785 9 0

onnx-web

web UI for running ONNX models

557 234 30

vllm-hust

A high-throughput and memory-efficient inference and serving engine for LLMs

437 79K 16K

wxy-test

A high-throughput and memory-efficient inference and serving engine for LLMs

375 2K 1K

vllm-xft

A high-throughput and memory-efficient inference and serving engine for LLMs

345 79K 16K

ai-dynamo-vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

344 79K 16K

vllm-acc

A high-throughput and memory-efficient inference and serving engine for LLMs

342 79K 16K

nextai-vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

273 79K 16K

gpuctl

GPU contorl and failure notification/recovery

238 2 0

vllm-consul

A high-throughput and memory-efficient inference and serving engine for LLMs

219 79K 16K

Search Packages