PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
smarthi
pymuvera

Python library for MUVERA multi-vector retrieval via Fixed Dimensional Encodings. ColBERT / ColQwen2 / ColQwen3.5 compatible.

2K 1 0
Shangri-la-0428
thronglets

P2P shared memory substrate for AI agents — stigmergic knowledge network via libp2p

464 4 1
oduwsdl
otmt

This system evaluates a collection of mementos (archived web pages) to determine which are off topic. The collection can be part of an Archive-It collection, a single TimeMap, or stored in a WARC file.

275 9 3
serega
gaoya

Locality Sensitive Hashing

263 80 9
Marcnuth
deduplication

Remove duplicate documents via popular algorithms such as SimHash, SpotSig, Shingling, etc.

238 18 5
saeeddhqan
entropy-hash

EntropyHash: near duplicate detection algorithm

167 0 0
hybridtheory
floc-simhash

A fast python implementation of the SimHash algorithm

85 27 7
kiwirafe
xiangsi

中文文本相似度计算器

83 170 23
kiwirafe
xiangshi

中文文本相似度计算器

13 170 23
    • Data from PyPI, GitHub, ClickHouse, and BigQuery