18 dependents
Package Description Downloads/month
TensorRT LLM provides users with an easy-to-use Python API to define Large Langu... 16K
The official Embedl Hub Python client library. 5K
AIBooster - Performance intelligence and observability tools for AI workloads 2K
XSlim is an offline quantization tools based on PPQ 1K
Model converter for Luxonis' cameras. Convert your model from ONNX, TF, ... to a... 1K
Converts darknet into other modern models 1K
('NeMo Model => Riva Deployment Converter',) 1K
Triton Model Navigator: An inference toolkit for optimizing and deploying machin... 1K
🚀 Easier & Faster YOLO Deployment Toolkit for NVIDIA 🛠️ 971
Shrinks ONNX files by quantizing large float constants into eight bit equivalent... 668
[Decommissioned] Ammo: a unified algorithmic model optimization and deployment t... 484
Simple Split for ONNX. A simple tool that automatically splits ONNX models of sp... 459
Multi-backend deep learning upscalers for pixtreme 408
Colibry NPU compiler 396
Add your description here 260
CaBRNet - Case-Based Reasoning Networks made simple 190
DNN Compiler for Heterogeneous SoCs 188
A CLI utility for using TensorRT Cloud 175