PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
ekzhu
datasketch

MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW

6.7M 3K 317
beowolx
rensa

High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datasets

65K 241 21
sourmash-bio
sourmash

Quickly search, compare, and analyze genomic and metagenomic data sets.

12K 543 91
src-d
libmhcuda

Accelerated Weighted MinHash-ing on GPU

698 122 26
justinbt1
akin

Python library for detecting near duplicate texts in a corpus at scale.

354 9 0
serega
gaoya

Locality Sensitive Hashing

263 80 9
lgautier
mashing-pumpkins

Minhash and maxhash library in Python, combining flexibility, expressivity, and performance.

120 22 3
kiwirafe
xiangsi

中文文本相似度计算器

83 170 23
dnbaker
sketch-ds

C++ Implementations of sketch data structures with SIMD Parallelism, including Python bindings

54 158 14
kiwirafe
xiangshi

中文文本相似度计算器

13 170 23
    • Data from PyPI, GitHub, ClickHouse, and BigQuery