PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Data Governance Python Packages

Python packages with the GitHub topic data-governance. Sorted by relevance, with stars and monthly downloads.
datahub-project
acryl-datahub

The Context Platform for your Data and AI Stack

4.6M 12K 3K
reata
sqllineage

SQL Lineage Analysis Tool powered by Python

1.7M 2K 276
elementary-data
elementary-data

The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

1.2M 2K 214
datahub-project
acryl-datahub-airflow-plugin

The Context Platform for your Data and AI Stack

963K 12K 3K
open-metadata
openmetadata-ingestion

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

391K 14K 2K
datahub-project
acryl-datahub-dagster-plugin

The Context Platform for your Data and AI Stack

159K 12K 3K
linkedin
acryl-executor

The Context Platform for your Data and AI Stack

148K 12K 3K
datahub-project
acryl-datahub-gx-plugin

The Context Platform for your Data and AI Stack

60K 12K 3K
open-metadata
openmetadata-managed-apis

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

39K 14K 2K
datahub-project
prefect-datahub

The Context Platform for your Data and AI Stack

34K 12K 3K
datahub-project
datahub-agent-context

The Context Platform for your Data and AI Stack

21K 12K 3K
OpenDQV
opendqv

Open-source, contract-driven data quality validation. Shift-left enforcement at the point of write — before data enters your pipeline.

15K 10 2
flyersworder
agentic-data-contracts

YAML-first, domain-driven data governance for AI agents — teach agents your business domains, metrics, and rules before they write SQL

9K 6 0
open-metadata
openmetadata-airflow-managed-apis

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

3K 14K 2K
data-drift
driftdb

Historical metric store

2K 331 12
Titan-Systems
titan-core

Titan Core: Snowflake infrastructure as code

2K 480 39
MetricProvenance
odgs

Open Data Governance Standard — Sovereign Validation Engine

2K 0 0
datahub-project
acryl-datahub-airflow-plugin-patched

The Context Platform for your Data and AI Stack

1K 12K 3K
data-drift
datagit

Metrics Observability & Troubleshooting

1K 331 12
datachecks
dcs-core

Open Source Data Quality Monitoring.

1K 170 23
open-metadata
openmetadata-ingestion-core

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

1K 14K 2K
datahub-project
acryl-datahub-airflow-plugin-hcc-patched

The Context Platform for your Data and AI Stack

1K 12K 3K
daxa-ai
pebblo

Pebblo enables developers to safely load data and promote their Gen AI app to deployment

931 149 44
mesmacosta
datacatalog-util

A Python package to centralize some Google Cloud Data Catalog scripts, this repo contains commands like bulk CSV operations that help leverage Data Catalog features.

796 20 7
    • Data from PyPI, GitHub, ClickHouse, and BigQuery