PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Semantic Search Engine Python Packages

Python packages with the GitHub topic semantic-search-engine. Sorted by relevance, with stars and monthly downloads.
nuclia
nucliadb-utils

NucliaDB, The AI Search database for RAG

196K 720 58
nuclia
nucliadb-models

NucliaDB, The AI Search database for RAG

144K 720 58
nuclia
nucliadb-protos

NucliaDB, The AI Search database for RAG

143K 720 58
nuclia
nucliadb-telemetry

NucliaDB, The AI Search database for RAG

129K 720 58
nuclia
nucliadb-dataset

NucliaDB, The AI Search database for RAG

125K 720 58
nuclia
nucliadb

NucliaDB, The AI Search database for RAG

109K 720 58
nuclia
nucliadb-sdk

NucliaDB, The AI Search database for RAG

61K 720 58
nuclia
nidx-protos

NucliaDB, The AI Search database for RAG

36K 720 58
nuclia
nidx-binding

Bindings for nidx (part of nucliadb)

11K 720 58
lemon07r
vera-ai

Local code search combining BM25, vector similarity, and cross-encoder reranking. Parses 60+ languages with tree-sitter, runs entirely offline, and returns structured results with file paths, line ranges, and symbol metadata. Built in Rust.

6K 70 8
ad-freiburg
qlever

Graph database implementing the RDF and SPARQL standards. Very fast and scales to more than a trillion triples on a single commodity machine

5K 827 113
ddickmann
latence-solver

CPU-reference-first Tabu Search Quadratic Knapsack solver with optional accelerator hooks

5K 11 1
ddickmann
voyager-index

Shard-first late-interaction retrieval for ColBERT and ColPali style workloads with CPU/GPU modes, Triton MaxSim, BM25 hybrid search, durable CRUD/WAL, multimodal preprocessing, and base64-ready reference APIs.

2K 11 1
0xDebabrata
citrusdb

(distributed) vector database

687 105 13
ddickmann
colsearch

High-performance late-interaction retrieval engine for on-prem AI. ColBERT/ColPali multi-vector search with Rust fused MaxSim, Triton GPU kernels, ROQ quantization, LEMUR routing, WAL-backed CRUD, and a FastAPI server — single machine, CPU or GPU.

398 12 1
M9nx
codexa

Codexa is a local semantic code intelligence CLI designed to help AI assistants and developers understand large codebases faster. It indexes repositories, parses code structure, generates embeddings, and enables powerful semantic search across functions, classes, and modules.

352 30 2
s-emanuilov
litepali

Lightweight ColPali-based retrieval for cloud

330 122 11
bhavsarpratik
semantic-search

Make semantic search easier

120 15 2
KenHung
ezra-search

Ezra 聖經語意搜尋 - Semantic Search Engine for Chinese Bible

61 0 1
    • Data from PyPI, GitHub, ClickHouse, and BigQuery