PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Cuda Python Packages

Python packages with the GitHub topic cuda. Sorted by relevance, with stars and monthly downloads.
sgl-project
sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

298.9M 27K 6K
vllm-project
vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

9.2M 79K 16K
OpenNMT
ctranslate2

Fast inference engine for Transformer models

8.4M 4K 478
catboost
catboost

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

6.4M 9K 1K
NVIDIA
nvidia-cutlass-dsl

CUDA Templates and Python DSLs for High-Performance Linear Algebra

4.1M 10K 2K
flashinfer-ai
flashinfer-python

FlashInfer: Kernel Library for LLM Serving

4.1M 6K 948
NVIDIA
nvidia-cutlass-dsl-libs-base

CUDA Templates and Python DSLs for High-Performance Linear Algebra

3.5M 10K 2K
pytorch
torchao

PyTorch native quantization and sparsity for training and inference

3.5M 3K 502
cupy
cupy-cuda12x

NumPy & SciPy for GPU

3.4M 11K 1K
flashinfer-ai
flashinfer-cubin

FlashInfer: Kernel Library for LLM Serving

2.7M 6K 948
replicate
cog

Containers for machine learning

2.5M 9K 687
isl-org
open3d

Open3D: A Modern Library for 3D Data Processing

1.7M 14K 3K
meta-pytorch
torchrec

Pytorch domain library for recommendation systems

1.5M 3K 642
NVIDIA
warp-lang

A Python framework for GPU-accelerated simulation, robotics, and machine learning.

975K 7K 494
PennyLaneAI
pennylane-lightning

The Lightning plugin ecosystem provides fast quantum state-vector and tensor network simulators written in C++ for use with PennyLane.

969K 136 52
XuehaiPan
nvitop

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

424K 7K 230
cupy
cupy-cuda11x

NumPy & SciPy for GPU

355K 11K 1K
cupy
cupy-cuda13x

NumPy & SciPy for GPU

340K 11K 1K
sgl-project
sglang-kernel

SGLang is a high-performance serving framework for large language models and multimodal models.

269K 27K 6K
rapidsai
libcudf-cu12

cuDF - GPU DataFrame Library

260K 10K 1K
sgl-project
sgl-kernel

SGLang is a high-performance serving framework for large language models and multimodal models.

257K 27K 6K
rapidsai
rmm-cu12

RAPIDS Memory Manager

229K 694 247
rapidsai
pylibraft-cu12

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.

226K 1K 231
rapidsai
libraft-cu12

RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.

223K 1K 231
    • Data from PyPI, GitHub, ClickHouse, and BigQuery