PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
HumanSignal
label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

114K 27K 4K
segments-ai
segments-ai

Segments.ai Python SDK

86K 27 10
cleanlab
cleanlab

Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

58K 11K 890
shoumikchow
bbox-visualizer

Make drawing and labeling bounding boxes a piece of cake

43K 413 35
alteryx
composeml

A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels for supervised learning.

19K 510 50
doccano
auto-labeling-pipeline

doccano auto labeling pipeline helps doccano to annotate a document automatically.

12K 45 18
doccano
doccano

Open source annotation tool for machine learning practitioners.

10K 11K 2K
Toloka
toloka-kit

Toloka-Kit is a Python library for working with Toloka API.

5K 212 35
cleanlab
cleanlab-studio

Client interface to Cleanlab Studio

4K 31 10
davidjurgens
potato-annotation

potato: the portable annotation tool

3K 380 70
doccano
doccano-client

A simple client for doccano API.

3K 87 68
langformers
langformers

🚀 Unified NLP Pipelines for Language Models

2K 19 1
MichaelAkridge-NOAA
coral-annotation-tool

CAT: Coral Annotation Tool for Structure from Motion (SfM) Orthomosaic coral reef annotation.

2K 4 1
strickvl
panlabel

Universal annotation converter

1K 15 0
phurwicz
hover

:speedboat: Label data at scale. Fun and precision included.

999 330 19
cleanlab
cleanlab-cli

Client interface to Cleanlab Studio

459 31 10
heartexlabs
pyheartex

Heartex Python SDK - Connect your own models to Heartex Data Labeling

405 27 7
smrfeld
dash-annotate-cv

A Python library for computer vision annotation tasks using Dash

381 11 2
liuxiaotong
knowlyr-datalabel

Serverless annotation framework with LLM pre-labeling, inter-annotator agreement analysis & offline HTML interface. CLI + MCP ready.

301 0 0
cleanlab
example-package-elisno

The standard package for data-centric AI, machine learning with label errors, and automatically finding and fixing dataset issues in Python.

300 11K 890
villagecomputing
superpipe-py

build unstructured to structured data transformation pipelines

291 108 2
ksavkin
swiftlabel

Keyboard-first image classification tool for ML practitioners

235 6 0
villagecomputing
labelkit

Superpipe - optimized LLM pipelines for structured data

195 108 2
code-kern-ai
kern-refinery

The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.

195 1K 73
    • Data from PyPI, GitHub, ClickHouse, and BigQuery