PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
JohnSnowLabs
spark-nlp

State of the Art Natural Language Processing

1.1M 4K 743
khoj-ai
khoj

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

61K 34K 2K
khoj-ai
khoj-assistant

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

48K 34K 2K
xorbitsai
xinference

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.

43K 9K 824
gptme
gptme

Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make your own persistent autonomous agent on top!

25K 4K 383
Maximilian-Winter
llama-cpp-agent

The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured output. Works also with models not fine-tuned to JSON output and function calls.

15K 630 70
OEvortex
webscout

Webscout is the all-in-one search and AI toolkit you need. Discover insights with Yep.com, DuckDuckGo, and Phind; access cutting-edge AI models; transcribe YouTube videos; generate temporary emails and phone numbers; perform text-to-speech conversions; and much more!

13K 344 63
abdeladim-s
pyllamacpp

Python bindings for llama.cpp

13K 68 24
containers
ramalama

RamaLama is an open-source developer tool that simplifies the local serving of AI models from any source and facilitates their use for inference in production, all through the familiar language of containers.

11K 3K 337
jjang-ai
jang

JANG — GGUF for MLX. YOU MUST USE JANG_Q RUNTIME. Adaptive Mixed-Precision Quantization + Runtime for Apple Silicon

9K 142 20
eliranwong
toolmate

ToolMate AI, developed by Eliran Wong, is a cutting-edge AI companion that seamlessly integrates agents, tools, and plugins to excel in conversations, generative work, and task execution. Supports custom workflow and plugins to automate multi-step actions.

6K 178 23
llmware-ai
llmware

Unified framework for building enterprise RAG pipelines with small, specialized models

5K 15K 3K
ddh0
easy-llama

Python package wrapping llama.cpp for on-device LLM inference

4K 105 7
luongnv89
claude-codex-local

Hit your limit? Need privacy? Just swap the model, everything else stays

4K 22 1
corefrg
lexicont

Policy-driven agent for real-time text moderation

3K 1 1
TAO71-AI
i4-0-client-py

Fully modular AI server and client

3K 2 0
Freed-Wu
translate-shell

Translate text by google, bing, youdaozhiyun, haici, stardict, openai, large language model of local machine, etc at same time from CLI, GUI (GNU/Linux, Android, macOS and Windows), REPL, python, shell and vim.

3K 50 4
eliranwong
toolmate-lite

ToolMate AI, developed by Eliran Wong, is a cutting-edge AI companion that seamlessly integrates agents, tools, and plugins to excel in conversations, generative work, and task execution. Supports custom workflow and plugins to automate multi-step actions.

3K 178 23
eliranwong
toolmate-android

ToolMate AI, developed by Eliran Wong, is a cutting-edge AI companion that seamlessly integrates agents, tools, and plugins to excel in conversations, generative work, and task execution. Supports custom workflow and plugins to automate multi-step actions.

3K 178 23
vinhnx
vtai

VT.ai - multimodal AI chat app with dynamic conversation routing

2K 112 17
kyegomez
exxa

Exa - Pytorch

2K 26 4
julep-ai
steadytext

Deterministic text generation and embeddings with zero configuration

1K 43 2
BrunoArsioli
llama-optimus

Lightweight Python tool using Optuna for tuning llama.cpp flags: towards optimal tok/s for your machine

1K 29 5
zraisan
globalmm

Add vision to any local LLM, no training.

1K 2 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery