PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Retrieval Python Packages

Python packages with the GitHub topic retrieval. Sorted by relevance, with stars and monthly downloads.
qdrant
fastembed

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

11.8M 3K 196
embeddings-benchmark
mteb

MTEB: Massive Text Embedding Benchmark

2.7M 3K 608
xhluca
bm25s

Fast BM25 search in Python, powered by Numpy and Numba

1.4M 2K 99
beir-cellar
beir

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

42K 2K 243
VectifyAI
pageindex

📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG

38K 26K 2K
qdrant
fastembed-gpu

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

19K 3K 196
ContextualAI
gritlm

Generative Representational Instruction Tuning

16K 691 50
ARM-DOE
act-atmos

Atmospheric data Community Toolkit - A python based toolkit for exploring and analyzing time series atmospheric datasets

11K 183 40
ben-ranford
cellin

build long-lived multimodal memory, dream over it, and retrieve context with transparent weighting

11K 0 0
meinardmueller
libfmp

libfmp - Python package for teaching and learning Fundamentals of Music Processing (FMP)

10K 222 20
MinishLab
semble

Fast and Accurate Code Search for Agents. Uses ~98% fewer tokens than grep+read

10K 514 46
usemoss
inferedge-moss

Official Repo of Moss

9K 330 33
mixedbread-ai
mxbai-rerank

Crispy reranking models by Mixedbread

7K 51 7
intel
intel-extension-for-transformers

Repository of Intel® Intel Extension for Transformers

6K 2K 217
lemon07r
vera-ai

Local code search combining BM25, vector similarity, and cross-encoder reranking. Parses 60+ languages with tree-sitter, runs entirely offline, and returns structured results with file paths, line ranges, and symbol metadata. Built in Rust.

6K 70 8
MadMando
prism-rag

PRISM — Epistemic Graph RAG with Spreading Activation. Novel retrieval that understands how knowledge relates, not just what it says.

6K 1 0
VectifyAI
openkb

OpenKB: Open LLM Knowledge Base

5K 1K 113
answerdotai
byaldi

Use late-interaction multi-modal models such as ColPali in just a few lines of code.

5K 847 93
memodb-io
memobase

User Profile-Based Long-Term Memory for AI Chatbot Applications.

5K 3K 209
mohankrishnaalavala
context-router-cli

Memory-aware context engine for AI coding agents — up to 91% fewer tokens, 17/18 rank-1 across 6 OSS projects. MCP-native, multi-repo, with persistent observations & decisions.

5K 6 2
xhluca
bm25

Fast BM25 search in Python, powered by Numpy and Numba

4K 2K 99
robotrocketscience
aelfrice

Bayesian memory that learns from feedback for LLM agents

4K 3 0
bicardinal
brinicle

A resource-efficient C++ vector index engine built for low-RAM production workloads

4K 11 0
illuin-tech
vidore-benchmark

Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.

4K 271 35
    • Data from PyPI, GitHub, ClickHouse, and BigQuery