PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
great-expectations
great-expectations

Always know what to expect from your data.

31.4M 11K 2K
databrickslabs
databricks-labs-dqx

Databricks framework to validate Data Quality of pySpark DataFrames and Tables

5.1M 405 111
ydataai
ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

1.9M 14K 2K
ydataai
pandas-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

614K 14K 2K
great-expectations
great-expectations-experimental

Always know what to expect from your data.

573K 11K 2K
great-expectations
acryl-great-expectations

Always know what to expect from your data.

410K 11K 2K
open-metadata
openmetadata-ingestion

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

394K 14K 2K
polyaxon
traceml

Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.

142K 530 47
fbdesignpro
sweetviz

Visualize and compare datasets, target values and associations, with one line of code.

118K 3K 288
polyaxon
datatile

Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.

113K 530 47
mouradmourafiq
pandas-summary

Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.

113K 530 47
cleanlab
cleanlab

Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

58K 11K 890
open-metadata
openmetadata-managed-apis

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

39K 14K 2K
InfuseAI
piperider-nightly

Code review for data in dbt

24K 494 23
cleanlab
cleanvision

Automatically find issues in image datasets and practice data-centric computer vision.

9K 1K 80
polyaxon
haupt

Lineage metadata API, artifacts streams, sandbox, API, and spaces for Polyaxon

8K 451 207
InfuseAI
piperider

Code review for data in dbt

6K 494 23
ironmussa
optimuspyspark

:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark

6K 2K 232
cleanlab
cleanlab-studio

Client interface to Cleanlab Studio

4K 31 10
Data-Centric-AI-Community
fg-data-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

4K 14K 2K
ing-bank
popmon

Monitor the stability of a Pandas or Spark dataframe ⚙︎

4K 511 36
desbordante
desbordante

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

3K 477 99
open-metadata
openmetadata-airflow-managed-apis

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

3K 14K 2K
ShriniwasAhirrao
parseiq

AI-powered data quality and metadata analysis agent

1K 2 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery