PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Rag Pipeline Python Packages

Python packages with the GitHub topic rag-pipeline. Sorted by relevance, with stars and monthly downloads.
davidpirogov
toon-llm

Token-Oriented Object Notation (TOON) is an LLM-optimized data serialization format implemented in Python.

28K 9 3
JonathanBerhe
gvdb

High-performance distributed vector database with gRPC API and high-dimensional vector support

12K 0 0
Project-Navi
navi-sanitize

Deterministic input sanitization for untrusted text — invisible characters, homoglyphs, and encoding tricks, handled before your code sees them. Zero dependencies, no ML. Python 3.12+.

9K 2 0
project-david-ai
projectdavid-platform

A single pip installed package will orchestrate a production ready instance of the AI stack in any environment

9K 1 0
superagentxai
superagentx

Move from idea to production in hours with policy-driven autonomous AI agents. Unified Control Plane: Centralised tools, MCPs, models, data, and policies with consistent observability and governance.

5K 191 43
ddickmann
latence-solver

CPU-reference-first Tabu Search Quadratic Knapsack solver with optional accelerator hooks

5K 11 1
SynapseKit
synapsekit

Minimal, async-first Python framework for production LLM apps- 2 hard deps, no magic, no SaaS.

4K 18 17
nanonets
nanoindex

Agentic RAG Harness for long documents, Tree and Graph based reasoning. Cited answers down to the pixel

4K 49 5
SwiftWing21
helix-context

Agent knowledge index — IDF-weighted retrieval without embedding models. SQLite-only, sub-650ms at 17K+ entries on consumer hardware.

3K 5 0
vunone
ennoia

Declarative Document Indexing (DDI) Schemas for RAG — LLM-powered pre-indexing and hybrid retrieval.

3K 30 3
superagentxai
superagentx-handlers

Move from idea to production in hours with policy-driven autonomous AI agents. Unified Control Plane: Centralised tools, MCPs, models, data, and policies with consistent observability and governance.

3K 191 43
hallengray
rag-forge-core

Production-grade RAG pipelines with evaluation baked in

3K 7 0
hallengray
rag-forge-observability

Production-grade RAG pipelines with evaluation baked in

3K 7 0
hallengray
rag-forge-evaluator

Production-grade RAG pipelines with evaluation baked in

3K 7 0
NetApp
netapp-aide-mcp

MCP server for NetApp AI Data Engine

2K 0 0
ddickmann
voyager-index

Shard-first late-interaction retrieval for ColBERT and ColPali style workloads with CPU/GPU modes, Triton MaxSim, BM25 hybrid search, durable CRUD/WAL, multimodal preprocessing, and base64-ready reference APIs.

2K 11 1
laxmimerit
ragwire

RAGWire — Production-grade RAG toolkit for document ingestion and retrieval with hybrid search support

2K 14 3
vrraj
vrraj-bm25s-retriever

Lexical routing layer for LLM tool selection. Filter MCP-discovered and registry tools before prompt assembly using fast BM25S retrieval.

966 0 0
sanonone
kektordb-client

An official Python client for KektorDB. AI memory system combining vector search with temporal knowledge graph. Built-in cognitive engine for agents. Supports memory decay, contradiction detection, and MCP integration.

733 70 5
rodmena-limited
ragit

Correct complete RAG -- built for Highway Workflow Engine

639 4 0
kelkalot
simpleaudit

Lightweight AI Safety Auditing Framework

610 9 4
vrraj
vrraj-llm-adapter

Provider-agnostic, registry-driven LLM adapter for text generation and embeddings with normalized outputs - includes an interactive test UI.

549 1 0
AI-Buddy-Catalyst-Labs
insta-rag

A python module library that simplifies RAG through abstraction

528 27 2
Emmanuel-Bamidele
supavector

Persistent AI memory engine

509 3 2
    • Data from PyPI, GitHub, ClickHouse, and BigQuery