17 dependents
| Package | Description | Downloads/month |
|---|---|---|
| Sony Semiconductor Solutions Corporation, PyTorch to Uni Model | 59K | |
| Machine learning compiler based on MLIR for TPU v1.28.1-g43676b3-20260429 | 3K | |
| Repository of Neural Compressor ORT | 2K | |
| Base layers and quantization tools | 2K | |
| 2K | ||
| Lightning Fast: Faiss CPU + Onnx Quantized Multilingual Embedding Model | 2K | |
| YOLOv8 to ONNX Exporter with Pre and Post Processing | 1K | |
| A package for making predictions using a custom-trained ONNX floorplan/epc/propr... | 832 | |
| Load ONNX embedding models into Oracle AI Database with one command | 792 | |
| Framework for model compression, based on FEDOT. | 509 | |
| Optimum Library is an extension of the Hugging Face Transformers library, provid... | 452 | |
| SeaVAD: Voice Activity Detection module with silero and state machine. | 333 | |
| 325 | ||
| A python project aimed at extracting embeddings from textual data and performing... | 180 | |
| 110 | ||
| JSTProve — Verifiable ML by Inference Labs (Python CLI) | 68 | |
| SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; lead... | 6 |