PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
J535D165
recordlinkage

A powerful and modular toolkit for record linkage and duplicate detection in Python

4.6M 1K 153
luozhouyang
strsimpy

A library implementing different string similarity and distance measures using Python.

346K 1K 124
dingkeyan93
dists-pytorch

IQA: Deep Image Structure and Texture Similarity Metric

78K 478 46
luozhouyang
strsim

A library implementing different string similarity and distance measures using Python.

21K 1K 124
iscc
iscc-core

ISCC - Codec & Algorithms

6K 23 5
related-sciences
nxontology

NetworkX-based Python library for representing ontologies

6K 95 7
shibing624
text2vec

text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

4K 5K 427
mqcomplab
bblean

BitBIRCH-Lean, a memory-efficient implementation of BitBIRCH designed for high-throughput clustering of huge molecular libraries

4K 116 13
iscc
iscc-sdk

SDK for creating ISCCs (International Standard Content Codes)

3K 19 7
matiskay
html-similarity

Compare html similarity using structural and style metrics

3K 218 24
wi2trier
cbrkit

Customizable Case-Based Reasoning (CBR) toolkit for Python with a built-in API and CLI.

3K 22 4
elisemercury
difpy

difPy - Python package for finding duplicate and similar images

2K 546 71
cnpem
mhcxgraph

A Python package for detecting potential T cell receptor cross-reactivity based on peptide–MHC structures

1K 5 0
shibing624
similarities

Similarities: a toolkit for similarity calculation and semantic search. 相似度计算、匹配搜索工具包,支持亿级数据文搜文、文搜图、图搜图,python3开发,开箱即用。

1K 902 88
nelsonwenner
shapesimilarity

:chart_with_upwards_trend: The package allows you to check the similarity between two shapes/curves, using Frechet distance together with Procrustes analysis.

867 48 9
iscc
iscc-usearch

Scalable approximate nearest neighbor search for variable-length binary bit-vectors using NPHD metric.

839 3 0
lignum-vitae
goombay

Python implementation of several local, global, and multiple sequence alignment algorithms intended to calculate distance, show alignment, and display the underlying matrices.

669 21 3
Wazzabeee
copy-spotter

Make plagiarism detection easier. This script will find similar sentences between given files and highlight them in a side by side comparison.

625 58 16
google
unisim

UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.

618 147 9
usc-isi-i2
rltk

Record Linkage ToolKit (Find and link entities)

606 111 22
ryusudol
pytorch-cka

Centered Kernel Alignment (CKA) with Efficient Computation and Layer-wise Visualization for PyTorch

527 5 1
DengBoCong
sentence2vec

an elegant sentence2vec

513 182 32
raamana
mrivis

medical image visualization library and development toolkit

504 23 3
momonga-ml
gower-exp

Production-ready Gower distance with modern Python tooling

444 12 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery