PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Llamacpp Python Packages

Python packages with the GitHub topic llamacpp. Sorted by relevance, with stars and monthly downloads.
JohnSnowLabs
spark-nlp

State of the Art Natural Language Processing

1.1M 4K 743
khoj-ai
khoj

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

60K 34K 2K
khoj-ai
khoj-assistant

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

47K 34K 2K
xorbitsai
xinference

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.

44K 9K 824
gptme
gptme

Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make your own persistent autonomous agent on top!

26K 4K 383
Maximilian-Winter
llama-cpp-agent

The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM models, execute structured function calls and get structured output. Works also with models not fine-tuned to JSON output and function calls.

15K 630 70
abdeladim-s
pyllamacpp

Python bindings for llama.cpp

14K 68 24
OEvortex
webscout

Webscout is the all-in-one search and AI toolkit you need. Discover insights with Yep.com, DuckDuckGo, and Phind; access cutting-edge AI models; transcribe YouTube videos; generate temporary emails and phone numbers; perform text-to-speech conversions; and much more!

14K 344 63
containers
ramalama

RamaLama is an open-source developer tool that simplifies the local serving of AI models from any source and facilitates their use for inference in production, all through the familiar language of containers.

11K 3K 337
jjang-ai
jang

JANG — GGUF for MLX. YOU MUST USE JANG_Q RUNTIME. Adaptive Mixed-Precision Quantization + Runtime for Apple Silicon

10K 142 20
eliranwong
toolmate

ToolMate AI, developed by Eliran Wong, is a cutting-edge AI companion that seamlessly integrates agents, tools, and plugins to excel in conversations, generative work, and task execution. Supports custom workflow and plugins to automate multi-step actions.

7K 178 23
llmware-ai
llmware

Unified framework for building enterprise RAG pipelines with small, specialized models

6K 15K 3K
luongnv89
claude-codex-local

Hit your limit? Need privacy? Just swap the model, everything else stays

4K 22 1
ddh0
easy-llama

Python package wrapping llama.cpp for on-device LLM inference

4K 105 7
corefrg
lexicont

Policy-driven agent for real-time text moderation

3K 1 1
eliranwong
toolmate-lite

ToolMate AI, developed by Eliran Wong, is a cutting-edge AI companion that seamlessly integrates agents, tools, and plugins to excel in conversations, generative work, and task execution. Supports custom workflow and plugins to automate multi-step actions.

3K 178 23
TAO71-AI
i4-0-client-py

Fully modular AI server and client

3K 2 0
Freed-Wu
translate-shell

Translate text by google, bing, youdaozhiyun, haici, stardict, openai, large language model of local machine, etc at same time from CLI, GUI (GNU/Linux, Android, macOS and Windows), REPL, python, shell and vim.

3K 50 4
eliranwong
toolmate-android

ToolMate AI, developed by Eliran Wong, is a cutting-edge AI companion that seamlessly integrates agents, tools, and plugins to excel in conversations, generative work, and task execution. Supports custom workflow and plugins to automate multi-step actions.

3K 178 23
vinhnx
vtai

VT.ai - multimodal AI chat app with dynamic conversation routing

2K 112 17
kyegomez
exxa

Exa - Pytorch

2K 26 4
julep-ai
steadytext

Deterministic text generation and embeddings with zero configuration

1K 43 2
BrunoArsioli
llama-optimus

Lightweight Python tool using Optuna for tuning llama.cpp flags: towards optimal tok/s for your machine

1K 29 5
luo-anthony
developergpt

DeveloperGPT is a LLM-powered command line tool that enables natural language to terminal commands and in-terminal chat.

1K 45 5
    • Data from PyPI, GitHub, ClickHouse, and BigQuery