PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
snorkel-team
snorkel

A system for quickly generating training data with weak supervision

79K 6K 854
cleanlab
cleanlab

Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

58K 11K 890
recognai
rubrix

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

12K 5K 483
argilla-io
argilla-server

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

2K 5K 483
NorskRegnesentral
skweak

skweak: A software toolkit for weak supervision applied to NLP tasks

1K 926 77
doccano
spacy-partial-tagger

Sequence Tagger for Partially Annotated Dataset in spaCy

753 24 2
AlvaroCavalcante
auto-annotate

Labeling is boring. Use this tool to speed up your next object detection project!

393 162 33
StatBiomed
finest

FineST: Fine-grained Spatial Transcriptomic

391 19 2
decile-team
decile-spear

SPEAR is a library for data programming with semi-supervision that provides facility to programmatically label and build training data

336 110 23
Shihab-Shahriar
scikit-clean

A collection of algorithms for detecting and handling label noise

304 16 3
cleanlab
example-package-elisno

The standard package for data-centric AI, machine learning with label errors, and automatically finding and fixing dataset issues in Python.

286 11K 890
wurenzhi
hyperlm

A hyper label model to aggregate multiple weak labels in a single forward pass

254 9 4
argilla-io
argilla-v1

Open-source tool for exploring, labeling, and monitoring data for NLP projects.

237 5K 483
liamtoran
flippers

Flippers is a weak supervision library for creating high quality labels using your domain kownledge and weak supervision sources.

234 4 1
knodle
knodle

A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficiently develop and compare their own methods.

205 108 15
JieyuZ2
ws-benchmark

[NeurIPS 2021] WRENCH: Weak supeRvision bENCHmark

155 227 34
HazyResearch
snorkel-ie

A system for quickly generating training data with weak supervision

32 6K 855
    • Data from PyPI, GitHub, ClickHouse, and BigQuery