PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
Blaizzy
mlx-vlm

MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.

349K 5K 506
microsoft
foundry-local-sdk

Foundry Local Manager Python SDK: Control-plane SDK for Foundry Local.

319K 2K 307
tobocop2
lilbee

Terminal-first local search and AI chat over your documents, code, and crawled websites. Semantic + hybrid search, vision OCR, auto-built wiki, browsable GGUF model catalog. Works as CLI, TUI, MCP server, REST API, or Python library. Offline by default, no sidecar services.

20K 16 3
tanavc1
llm-autotune

Zero-config local LLM optimization for Ollama, LM Studio, and Apple Silicon MLX. Reduces TTFT by 40%, wall time for local agents by 46%, and RAM usage by 3x.

7K 24 1
lyonzin
knowledge-rag

Drop docs, search instantly from Claude Code — 12 MCP tools, 20 format parsers, hybrid search + reranking. Zero servers, zero API keys, 100% local.

6K 67 14
sinsniwal
hey-cli-python

Your terminal buddy that turns plain English into shell scripts — and runs them. Powered by Ollama. 100% local, 100% private.

5K 4 0
nrl-ai
edgevox

Offline voice agent framework for robots.

3K 4 0
microsoft
foundry-local-sdk-winml

Foundry Local Manager Python SDK: Control-plane SDK for Foundry Local.

3K 2K 307
MRWillisT
pullnexus

Pull-on-demand skill registry for local LLMs. Search, install, and contribute skills, tools, datasets, and playbooks for Ollama, LM Studio, and any local AI setup.

2K 1 0
tanaos
artifex

Small Language Model Inference, Fine-Tuning and Observability.

2K 93 13
raphasouthall
neurostack

Build, maintain, and search your knowledge vault. CLI + MCP server with stale note detection, semantic search, and neuroscience-grounded memory.

2K 41 3
jonigl
ollama-mcp-bridge

Extend the Ollama API with dynamic AI tool integration from multiple MCP (Model Context Protocol) servers. Fully compatible, transparent, and developer-friendly, ideal for building powerful local LLM applications, AI agents, and custom chatbots

2K 83 25
Keshavsharma-code
deepsleep-ai

Zero-cost background coding agent with layered memory, idle-time dreaming, MCP server, and Ollama support.

2K 35 14
aminoy77
hellochusquis

Terminal AI agent with multi-provider fallback, file management, code execution, and persistent memory. Supports OpenRouter, Ollama, OpenAI, Anthropic, Gemini, Groq and 15+ more providers.

2K 1 0
mozilla-ai
encoderfile

Python bindings for encoderfile.

2K 93 15
msb-msb
mycoswarm

Distributed AI for everyone. Turn forgotten hardware into a thinking network.

2K 3 2
Gustavjiversen01
lexaloud

A local, private Linux text-to-speech tool. Select text in any app, press a hotkey, hear it read by Kokoro-82M on your GPU.

1K 0 0
HaseebKhalid1507
velocirag

Lightning-fast RAG for AI agents. ONNX-powered, 4-layer fusion, MCP server. No PyTorch.

1K 6 1
Akshat190
gitbriefly

Your daily developer standup — powered by your Git history

1K 0 0
zraisan
globalmm

Add vision to any local LLM, no training.

1K 2 0
fcjr
ltts

Quick CLI for local text-to-speech using Qwen3-TTS or Kokoro TTS.

791 9 0
mahimairaja
locallens

Search your files by talking to them - 100% offline

742 9 1
GRID-INTELLIGENCE
grid-intelligence

GRID - Geometric Resonance Intelligence Driver: A comprehensive framework for exploring complex systems through geometric resonance patterns, cognitive decision support, local-first AI, and event-driven agentic systems

635 0 0
iBz-04
quaynor

Embed local models in your app

593 3 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery