PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
LibreTranslate
removedup

Remove duplicates from parallel corpora

6K 7 1
veltzer
pyunique

Pyunique helps you get rid of duplicate files

478 0 0
deplicate
deplicate

Advanced Duplicate File Finder for Python. Nothing is impossible to solve.

436 79 17
KeyWeeUsr
thebear

Bear - the decluttering deduplicator

274 4 1
zeronyk
imageduplicatefinder

Simple duplication finder for Images, matches on names and then compares image hashes.

194 0 1
yugn
yadupe

Recursively scan one or more given directories for duplicate files.

172 0 1
hansalemaos
arrayhascher

Fast hash in 2D Arrays (Numpy/Pandas/lists/tuples)

153 1 0
hansalemaos
screwduplicates

provides a simple and efficient way to remove duplicates from an iterable (even with unhashable elements, optional order preservation)

120 0 0
hansalemaos
dropduplicatesplanb

Drops duplicates in DataFrames with tedious dtypes

116 0 0
hansalemaos
a-pandas-ex-duplicates-to-df

Creates a DataFrame/Series from duplicates

116 0 0
vuolter
deplicate-cli

Command Line Interface for deplicate.

112 3 1
hansalemaos
drop-duplicates-nested-list

Drops duplicates from nested list

80 0 0
hansalemaos
duplicateindexer

Find duplicates in multiple lists and return their indices and values.

73 0 0
jmsv
listset

remove duplicates from lists

68 0 0
hansalemaos
stridesduplicatefinder

Calculate overlapping values between two arrays and return the results as a DataFrame

59 0 0
NicolasBi
dupe-eraser

A command-line tool which automate the deletion of duplicate files based on their hash or perceptual-hash.

51 13 0
dealfonso
searchdups

Search for duplicate files

34 0 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery