PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
meltano
meltano

Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.

1.2M 2K 235
elementary-data
elementary-data

The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

1.2M 2K 214
SQLMesh
sqlmesh

Scalable and efficient data transformation framework - backwards compatible with dbt.

508K 3K 380
polyaxon
traceml

Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.

142K 530 47
polyaxon
datatile

Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.

113K 530 47
mouradmourafiq
pandas-summary

Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.

113K 530 47
InfuseAI
recce

The data-validation toolkit for enhanced dbt (data build tool) PR review

48K 454 26
InfuseAI
recce-nightly

The data-validation toolkit for enhanced dbt (data build tool) PR review

42K 454 26
vmware
quickstart-vdk

One framework to develop, deploy and operate data workflows with Python and SQL.

36K 481 66
DataRecce
recce-cloud-nightly

The data-validation toolkit for enhanced dbt (data build tool) PR review

14K 454 26
tenzir
tenzir

Tenzir is the data pipeline engine for security teams.

13K 737 103
DataRecce
recce-cloud

The data-validation toolkit for enhanced dbt (data build tool) PR review

6K 454 26
vmware
vdk-core

One framework to develop, deploy and operate data workflows with Python and SQL.

4K 481 66
tulibraries
tulflow

Package of Temple University Library Indexing & ETL functions used by Airflow.

3K 3 1
DataKitchen
dataops-testgen

DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data quality test generation and execution by data profiling,  new dataset hygiene review, AI generation of data quality validation tests, ongoing testing of data refreshes, & continuous anomaly monitoring

3K 73 6
awslabs
aws-ddk-core

An open source development framework to help you build data workflows and modern data architecture on AWS.

3K 271 24
vmware
vdk-jupyterlab-extension

One framework to develop, deploy and operate data workflows with Python and SQL.

2K 481 66
fabdendev
dagster-mcp

MCP server that wraps the Dagster GraphQL API — monitor and operate runs, assets, schedules, sensors, and backfills from any MCP client

2K 4 2
Titan-Systems
titan-core

Titan Core: Snowflake infrastructure as code

2K 480 39
polyaxon
polyaxon-cli

Polyaxon Core Client & CLI to streamline MLOps

2K 19 17
sernst
cauldron-notebook

Interactive computing for complex data processing, modeling and analysis in Python 3

2K 79 128
stateful-y
kedro-dagster

Kedro plugin to support running pipelines on Dagster

2K 23 1
eyecan-ai
pipelime-python

A swiss army knife for data processing!

2K 19 0
beneath-hq
beneath

Beneath is a serverless real-time data platform ⚡️

1K 84 10
    • Data from PyPI, GitHub, ClickHouse, and BigQuery