PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Data Catalog Python Packages

Python packages with the GitHub topic data-catalog. Sorted by relevance, with stars and monthly downloads.
datahub-project
acryl-datahub

The Context Platform for your Data and AI Stack

4.6M 12K 3K
datahub-project
acryl-datahub-airflow-plugin

The Context Platform for your Data and AI Stack

963K 12K 3K
open-metadata
openmetadata-ingestion

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

391K 14K 2K
datahub-project
acryl-datahub-dagster-plugin

The Context Platform for your Data and AI Stack

159K 12K 3K
linkedin
acryl-executor

The Context Platform for your Data and AI Stack

148K 12K 3K
amundsen-io
amundsen-common

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

140K 5K 966
datahub-project
acryl-datahub-gx-plugin

The Context Platform for your Data and AI Stack

60K 12K 3K
open-metadata
openmetadata-managed-apis

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

39K 14K 2K
datahub-project
prefect-datahub

The Context Platform for your Data and AI Stack

34K 12K 3K
intake
intake-esm

An intake plugin for parsing an Earth System Model (ESM) catalog and loading assets into xarray datasets.

27K 160 53
datahub-project
datahub-agent-context

The Context Platform for your Data and AI Stack

21K 12K 3K
recap-cloud
recap-core

Work with your web service, database, and streaming schemas in a single format.

10K 351 26
docglow
docglow

Modern documentation site generator for dbt Core — lineage explorer, health scoring, full-text search. Live demo: https://demo.docglow.com

6K 88 2
open-metadata
openmetadata-airflow-managed-apis

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

3K 14K 2K
amundsen-io
amundsen-search

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

3K 5K 966
apache
apache-gravitino

World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.

2K 3K 818
Intugle
intugle

The GenAI-powered toolkit for automated data intelligence.

2K 148 43
tokern
piicatcher

Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub

2K 340 97
datahub-project
acryl-datahub-airflow-plugin-patched

The Context Platform for your Data and AI Stack

1K 12K 3K
gauthierpiarrette
dbt-features

Feature catalog for dbt projects, built for ML teams.

1K 0 0
open-metadata
openmetadata-ingestion-core

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

1K 14K 2K
carte-data
carte-cli

A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable front end that's just HTML.

1K 29 0
datahub-project
acryl-datahub-airflow-plugin-hcc-patched

The Context Platform for your Data and AI Stack

1K 12K 3K
related-sciences
articat

articat: data artifact catalog

798 17 2
    • Data from PyPI, GitHub, ClickHouse, and BigQuery