PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
databrickslabs
databricks-labs-dqx

Databricks framework to validate Data Quality of pySpark DataFrames and Tables

5.1M 405 111
datafold
collate-data-diff

Compare tables within or across databases

934K 3K 305
Arize-ai
arize

A Python client to interact with Arize API

753K 58 20
datafold
data-diff

Compare tables within or across databases

56K 3K 305
re-data
re-data

re_data - fix data issues before your users & CEO would discover them 😊

6K 2K 125
Bilpapster
streamdaq

🦆 Stream-first data quality monitoring in Python! Learn more: https://arxiv.org/abs/2506.06147

1K 19 2
dqops
dqops

DQOps Data Quality Operations Center

1K 192 36
datachecks
dcs-core

Open Source Data Quality Monitoring.

1K 170 23
weiser-ai
weiser-ai

Enterprise-grade data quality framework with YAML configuration, LLM-friendly design, and advanced statistical validation

667 2 0
waterdipai
datachecks

Open Source Data Quality Monitoring.

514 170 23
Arize-ai
arize-slim

A Python client to interact with Arize API

197 58 20
datafold
cz-data-diff

Command-line tool and Python library to efficiently diff rows across two different databases.

184 3K 305
realdatadriven
etlx-wrapper

Python wrapper for ETLX CLI to run ETL workflows from Python

174 40 3
dqoai
dqoai

Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observability. Configure data quality checks from the UI or in YAML files, let DQOps run the data quality checks daily to detect data quality issues.

2 190 36
    • Data from PyPI, GitHub, ClickHouse, and BigQuery