PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
voxel51
fiftyone

Refine high-quality datasets and visual AI models

179K 11K 752
voxel51
fiftyone-db

Refine high-quality datasets and visual AI models

169K 11K 752
cleanlab
cleanlab

Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

58K 11K 890
cleanlab
cleanvision

Automatically find issues in image datasets and practice data-centric computer vision.

9K 1K 80
cleanlab
cleanlab-studio

Client interface to Cleanlab Studio

4K 31 10
voxel51
fiftyone-db-ubuntu2204

Refine high-quality datasets and visual AI models

3K 11K 752
Digital-Dermatology
selfclean

[NeurIPS 2024] 🧼🔎 A holistic self-supervised data cleaning strategy to detect irrelevant samples, near duplicates and label errors.

2K 37 2
voxel51
fiftyone-desktop

FiftyOne Desktop

2K 11K 752
aai-institute
pydvl

The Python Data Valuation Library

672 145 10
voxel51
fiftyone-db-ubuntu2004

Refine high-quality datasets and visual AI models

539 11K 752
opendataval
opendataval

OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)

509 100 11
mdbloice
labeller

Quickly set up an image labelling web application for manually tagging images for machine learning tasks.

478 9 2
cleanlab
cleanlab-cli

Client interface to Cleanlab Studio

459 31 10
Hyper3Labs
hyperview

HyperView curates datasets and provides model introspection in hyperbolic and Euclidean geometries.

338 17 2
cleanlab
example-package-elisno

The standard package for data-centric AI, machine learning with label errors, and automatically finding and fixing dataset issues in Python.

300 11K 890
ear-team
bambird

Unsupervised classification to improve the quality of a bird song recording dataset. https://doi.org/10.1016/j.ecoinf.2022.101952

205 31 7
Docta-ai
docta-ai

Docta.ai

198 3K 256
code-kern-ai
kern-refinery

The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.

195 1K 73
voxel51
fiftyone-db-debian9

FiftyOne DB

185 11K 752
code-kern-ai
refinery-python-sdk

Official Python SDK for Kern AI refinery.

159 20 3
voxel51
fiftyone-db-ubuntu1604

Project FiftyOne database

139 11K 752
JieyuZ2
ws-benchmark

[NeurIPS 2021] WRENCH: Weak supeRvision bENCHmark

135 227 34
voxel51
fiftyone-db-rhel7

Refine high-quality datasets and visual AI models

135 11K 752
code-kern-ai
kern-python-client

Official Python SDK for Kern AI refinery.

1 20 3
    • Data from PyPI, GitHub, ClickHouse, and BigQuery