18 dependents
Package Description Downloads/month
Python bindings for NVSHMEM 794K
The CUDA target for Numba 468K
Utilities for Dask and CUDA interactions 264K
Python Bindings for the Unified Communication X library (UCX) 119K
UCX communication module for Dask Distributed 112K
Python bindings for NVSHMEM 101K
NVIDIA Math Python libraries 46K
Optimized primitives for collective multi-GPU communication 32K
CUDA Core Compute Libraries 17K
TensorRT LLM provides users with an easy-to-use Python API to define Large Langu... 16K
Python Bindings for the Unified Communication X library (UCX) 4K
UCX communication module for Dask Distributed 2K
A library for the analysis of multi-modal tensor tomography data 635
Completely Fused Distributed MoE 613
A GPU-based simulator for time-dependent quantum systems. 350
GPU-accelerated tractography package 274
Implementing the Mandelbrot set using Nvidia's CUDA API 83
Triton language and compiler extension for distributed deep learning systems 49