PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Data Observability Python Packages

Python packages with the GitHub topic data-observability. Sorted by relevance, with stars and monthly downloads.
datahub-project
acryl-datahub

The Context Platform for your Data and AI Stack

4.6M 12K 3K
elementary-data
elementary-data

The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

1.2M 2K 214
datahub-project
acryl-datahub-airflow-plugin

The Context Platform for your Data and AI Stack

963K 12K 3K
open-metadata
openmetadata-ingestion

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

391K 14K 2K
datahub-project
acryl-datahub-dagster-plugin

The Context Platform for your Data and AI Stack

159K 12K 3K
linkedin
acryl-executor

The Context Platform for your Data and AI Stack

148K 12K 3K
datahub-project
acryl-datahub-gx-plugin

The Context Platform for your Data and AI Stack

60K 12K 3K
open-metadata
openmetadata-managed-apis

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

39K 14K 2K
datahub-project
prefect-datahub

The Context Platform for your Data and AI Stack

34K 12K 3K
InfuseAI
piperider-nightly

Code review for data in dbt

22K 494 23
datahub-project
datahub-agent-context

The Context Platform for your Data and AI Stack

21K 12K 3K
sodadata
soda-spark

Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes

7K 64 7
re-data
re-data

re_data - fix data issues before your users & CEO would discover them 😊

6K 2K 125
InfuseAI
piperider

Code review for data in dbt

5K 494 23
DataKitchen
dataops-testgen

DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data quality test generation and execution by data profiling,  new dataset hygiene review, AI generation of data quality validation tests, ongoing testing of data refreshes, & continuous anomaly monitoring

3K 73 6
open-metadata
openmetadata-airflow-managed-apis

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

3K 14K 2K
data-drift
driftdb

Historical metric store

2K 331 12
rbmuller
scherlok

A detective for your data. Zero-config data quality monitoring — works with dbt, Postgres, BigQuery, Snowflake. No YAML.

2K 1 1
opendatadiscovery
odd-collector-sdk

ODD Collector

2K 4 0
datahub-project
acryl-datahub-airflow-plugin-patched

The Context Platform for your Data and AI Stack

1K 12K 3K
data-drift
datagit

Metrics Observability & Troubleshooting

1K 331 12
ottogroup
koality

Library for data checks and data quality monitoring based on duckdb.

1K 4 1
datachecks
dcs-core

Open Source Data Quality Monitoring.

1K 170 23
dqops
dqops

DQOps Data Quality Operations Center

1K 192 36
    • Data from PyPI, GitHub, ClickHouse, and BigQuery