PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Inference Engine Python Packages

Python packages with the GitHub topic inference-engine. Sorted by relevance, with stars and monthly downloads.
pylint-dev
astroid

A common base representation of python source code for pylint and other projects

50.6M 575 323
quic
qai-hub-models

Qualcomm® AI Hub Models is our collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.

41K 1K 175
siliconflow
onediff

an out-of-the-box acceleration library for diffusion models

30K 2K 129
nilp0inter
experta

Expert Systems for Python

12K 188 46
Auctalis
nocturnusai

Verified knowledge for AI agents. Compress context, extract and store facts, define rules, and ask questions — get deterministic answers with proof, not LLM guesses. Connect agents via MCP, Python SDK, TypeSc

8K 2 0
aphrodite-engine
aphrodite-engine

Large-scale LLM inference engine

8K 2K 194
qualcomm
qai-hub-models-cli

Qualcomm® AI Hub Models is our collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.

7K 1K 175
siliconflow
onediffx

OneDiff: An out-of-the-box acceleration library for diffusion models.

7K 2K 129
brontoguana
krasis

Krasis is no longer distributed via PyPI. Install from GitHub: https://github.com/brontoguana/krasis

5K 447 22
friendliai
friendli-client

[⛔️ DEPRECATED] Friendli: the fastest serving engine for generative AI

5K 50 7
nobodywho-ooo
nobodywho

NobodyWho is an inference engine that lets you run LLMs locally and efficiently on any device.

4K 861 56
NeuroBrix
neurobrix

Universal Deep Learning Inference Engine — execute any AI model without model-specific code

3K 8 1
FedML-AI
fedml

FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on any GPU cloud or on-premise cluster. Built on this library, TensorOpera AI (https://TensorOpera.ai) is your generative AI platform at scale.

3K 4K 766
dualform-labs
m5-infer

Extraordinary speed, extraordinary quality — an MLX-based inference engine for Apple Silicon.

2K 0 1
kyegomez
exxa

Exa - Pytorch

2K 26 4
Zyora-Dev
zllm-zse

The inference engine the open-source world built for itself.

1K 151 2
chengzeyi
para-attn

https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching

1K 426 45
ovg-project
kvcached

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

1K 902 107
banderlog
opencv-python-inference-engine

Wrapper package for OpenCV with Inference Engine python bindings.

734 34 6
jithin8mathew
yolomosaic

A Python library for visualizing YOLO detections and segmented instances on large orthomosaic images, with the ability to generate shapefiles for GIS integration

721 0 0
Image-Py
planer

Powerful Light Artificial NEuRon inference framework for CNN

718 62 14
iBz-04
quaynor

Embed local models in your app

641 3 0
openvinotoolkit
ov-training-kit

Wrappers for scikit-learn, PyTorch and Tensorflow models with OpenVINO optimization

564 174 182
Aisuko
kimchima

The collections of tools for ML model development.

535 0 2
    • Data from PyPI, GitHub, ClickHouse, and BigQuery