17 dependents
Package Description Downloads/month
Sony Semiconductor Solutions Corporation, PyTorch to Uni Model 59K
Machine learning compiler based on MLIR for TPU v1.28.1-g43676b3-20260429 3K
Repository of Neural Compressor ORT 2K
Base layers and quantization tools 2K
2K
Lightning Fast: Faiss CPU + Onnx Quantized Multilingual Embedding Model 2K
YOLOv8 to ONNX Exporter with Pre and Post Processing 1K
A package for making predictions using a custom-trained ONNX floorplan/epc/propr... 832
Load ONNX embedding models into Oracle AI Database with one command 792
Framework for model compression, based on FEDOT. 509
Optimum Library is an extension of the Hugging Face Transformers library, provid... 452
SeaVAD: Voice Activity Detection module with silero and state machine. 333
325
A python project aimed at extracting embeddings from textual data and performing... 180
110
JSTProve — Verifiable ML by Inference Labs (Python CLI) 68
SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; lead... 6