Model Compression Python Packages

tensorflow-model-optimization

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

108K 2K 347

deepcache

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

61K 963 52

nni

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

36K 14K 2K

torch-pruning

[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.

24K 3K 378

tf-model-optimization-nightly

A suite of tools that users, both novice and advanced can use to optimize machine learning models for deployment and execution.

10K 2K 347

kxy

A Powerful Serverless Pre-Learning and Post-Learning Analysis Toolkit

4K 51 12

micronet

A model compression and deploy lib.

2K 2K 477

fasterai

FasterAI: Prune and Distill your models with FastAI and PyTorch

1K 261 19

picollm

On-device LLM Inference Powered by X-Bit Quantization

1K 311 25

nni-daily

Neural Network Intelligence package

920 14K 2K

uni-layer

30+ layer contribution metrics from 7 theoretical categories for PyTorch model compression. Bridges for Torch-Pruning and PEFT/LoRA.

877 0 0

torch-optim

PyTorch models optimization by neural network pruning

862 3 1

aquvitae

The easiest Knowledge Distillation library for Light Weight DeepLearning

681 88 10

model-quantizer

A tool for quantizing large language models

639 2 0

octopus-ml

A collection of handy ML and data visualization and validation tools. Go ahead and train, evaluate and validate your ML models and data with minimal effort.

635 23 5