18 dependents
| Package | Description | Downloads/month |
|---|---|---|
| Python bindings for NVSHMEM | 794K | |
| The CUDA target for Numba | 468K | |
| Utilities for Dask and CUDA interactions | 264K | |
| Python Bindings for the Unified Communication X library (UCX) | 119K | |
| UCX communication module for Dask Distributed | 112K | |
| Python bindings for NVSHMEM | 101K | |
| NVIDIA Math Python libraries | 46K | |
| Optimized primitives for collective multi-GPU communication | 32K | |
| CUDA Core Compute Libraries | 17K | |
| TensorRT LLM provides users with an easy-to-use Python API to define Large Langu... | 16K | |
| Python Bindings for the Unified Communication X library (UCX) | 4K | |
| UCX communication module for Dask Distributed | 2K | |
| A library for the analysis of multi-modal tensor tomography data | 635 | |
| Completely Fused Distributed MoE | 613 | |
| A GPU-based simulator for time-dependent quantum systems. | 350 | |
| GPU-accelerated tractography package | 274 | |
| Implementing the Mandelbrot set using Nvidia's CUDA API | 83 | |
| Triton language and compiler extension for distributed deep learning systems | 49 |