PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Model Compression Python Packages

Python packages with the GitHub topic model-compression. Sorted by relevance, with stars and monthly downloads.
tensorflow
tensorflow-model-optimization

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

108K 2K 347
horseee
deepcache

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

61K 963 52
Microsoft
nni

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

36K 14K 2K
VainF
torch-pruning

[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.

24K 3K 378
tensorflow
tf-model-optimization-nightly

A suite of tools that users, both novice and advanced can use to optimize machine learning models for deployment and execution.

10K 2K 347
kxytechnologies
kxy

A Powerful Serverless Pre-Learning and Post-Learning Analysis Toolkit

4K 51 12
666DZY666
micronet

A model compression and deploy lib.

2K 2K 477
FasterAI-Labs
fasterai

FasterAI: Prune and Distill your models with FastAI and PyTorch

1K 261 19
Picovoice
picollm

On-device LLM Inference Powered by X-Bit Quantization

1K 311 25
Microsoft
nni-daily

Neural Network Intelligence package

920 14K 2K
GeoffreyWang1117
uni-layer

30+ layer contribution metrics from 7 theoretical categories for PyTorch model compression. Bridges for Torch-Pruning and PEFT/LoRA.

877 0 0
r-papso
torch-optim

PyTorch models optimization by neural network pruning

862 3 1
aquvitae
aquvitae

The easiest Knowledge Distillation library for Light Weight DeepLearning

681 88 10
lpalbou
model-quantizer

A tool for quantizing large language models

639 2 0
gershonc
octopus-ml

A collection of handy ML and data visualization and validation tools. Go ahead and train, evaluate and validate your ML models and data with minimal effort.

635 23 5
tianyic
only-train-once

Only Train Once (OTO): Automatic One-Shot General DNN Training and Compression Framework

599 311 48
Picovoice
picollmdemo

picoLLM Inference Engine demos

514 311 25
danhicks96
prismkv

3-D Stacked-Plane KV Cache Quantizer — defensive prior art publication

494 1 1
SforAiDL
kd-lib

A Pytorch Knowledge Distillation library for benchmarking and extending works in the domains of Knowledge Distillation, Pruning, and Quantization.

446 649 61
musco-ai
musco-pytorch

MUSCO: Multi-Stage COmpression of Neural Networks

376 72 17
mlzxy
qsparse

Train neural networks with joint quantization and pruning on both weights and activations using any pytorch modules

330 42 2
microsoft
archai

Platform for Neural Architecture Search

284 486 93
Argonaut790
fused-turboquant

Fused Triton encode/decode kernels for TurboQuant KV cache compression, powered by Randomized Hadamard Transform.

226 8 0
m-pektas
bfas

Brute Force Architecture Search

195 4 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery