PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Semantic Search Python Packages

Python packages with the GitHub topic semantic-search. Sorted by relevance, with stars and monthly downloads.
lancedb
lancedb

Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.

7.1M 10K 868
embeddings-benchmark
mteb

MTEB: Massive Text Embedding Benchmark

2.7M 3K 608
deepset-ai
haystack-ai

Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.

776K 25K 3K
unum-cloud
usearch

Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary Objects × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍

587K 4K 311
zilliztech
gptcache

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

479K 8K 580
PrithivirajDamodaran
flashrank

Lite & Super-fast re-ranking for your search & retrieval pipelines. Supports SoTA Listwise and Pairwise reranking based on LLMs and cross-encoders and more. Created by Prithivi Da, open for PRs & Collaborations.

414K 965 69
docarray
docarray

Represent, send, store and search multimodal data

144K 3K 241
deepset-ai
farm-haystack

Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.

67K 25K 3K
khoj-ai
khoj

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

60K 34K 2K
khoj-ai
khoj-assistant

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

47K 34K 2K
neuml
txtai

💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows

47K 12K 808
sysid
bkmr

Knowledge Management for Humans and Agents

24K 250 10
wallter
trw-memory

Persistent memory engine with hybrid retrieval, tiered storage, and semantic dedup for AI agents

24K 1 1
alibaba
zvec

A lightweight, lightning-fast, in-process vector database

23K 10K 546
tobocop2
lilbee

Terminal-first local search and AI chat over your documents, code, and crawled websites. Semantic + hybrid search, vision OCR, auto-built wiki, browsable GGUF model catalog. Works as CLI, TUI, MCP server, REST API, or Python library. Offline by default, no sidecar services.

21K 16 3
weaviate
weaviate-cli

CLI tool for Weaviate

20K 32 19
trustgraph-ai
trustgraph-base

The context development platform. Store, enrich, and retrieve structured knowledge with graph-native infrastructure, semantic retrieval, and portable context cores.

20K 2K 235
alexklibisz
elastiknn-client

Elasticsearch plugin for nearest neighbor search. Store vectors and run similarity search using exact and approximate algorithms.

20K 393 50
trustgraph-ai
trustgraph-embeddings-hf

The context development platform. Store, enrich, and retrieve structured knowledge with graph-native infrastructure, semantic retrieval, and portable context cores.

18K 2K 235
trustgraph-ai
trustgraph

The context development platform. Store, enrich, and retrieve structured knowledge with graph-native infrastructure, semantic retrieval, and portable context cores.

17K 2K 235
qualixar
superlocalmemory

World's first local-only AI memory to break 74% retrieval and 60% zero-LLM on LoCoMo. No cloud, no APIs, no data leaves your machine. Additionally, mode C (LLM/Cloud) - 87.7% LoCoMo. Research-backed. arXiv: 2603.14588

17K 133 14
trustgraph-ai
trustgraph-vertexai

The context development platform. Store, enrich, and retrieve structured knowledge with graph-native infrastructure, semantic retrieval, and portable context cores.

17K 2K 235
trustgraph-ai
trustgraph-flow

The context development platform. Store, enrich, and retrieve structured knowledge with graph-native infrastructure, semantic retrieval, and portable context cores.

17K 2K 235
trustgraph-ai
trustgraph-bedrock

The context development platform. Store, enrich, and retrieve structured knowledge with graph-native infrastructure, semantic retrieval, and portable context cores.

17K 2K 235
    • Data from PyPI, GitHub, ClickHouse, and BigQuery