PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Exploratory Data Analysis Python Packages

Python packages with the GitHub topic exploratory-data-analysis. Sorted by relevance, with stars and monthly downloads.
great-expectations
great-expectations

Always know what to expect from your data.

31.4M 11K 2K
ydataai
ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

1.9M 14K 2K
ydataai
pandas-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

603K 14K 2K
great-expectations
great-expectations-experimental

Always know what to expect from your data.

571K 11K 2K
great-expectations
acryl-great-expectations

Always know what to expect from your data.

416K 11K 2K
fbdesignpro
sweetviz

Visualize and compare datasets, target values and associations, with one line of code.

123K 3K 288
tommyod
kdepy

Kernel Density Estimation in Python

64K 644 102
cleanlab
cleanlab

Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

58K 11K 890
InfuseAI
piperider-nightly

Code review for data in dbt

22K 494 23
JasonKessler
scattertext

Beautiful visualizations of how language differs among document types.

20K 2K 286
zhihanyue
qgridnext

Advancing QGrid, an interactive grid for exploring DataFrames in JupyterLab/Notebook

16K 38 2
darenr
report-creator

Tool to assemble HTML reports using python components with charts and diagrams.

13K 11 1
dvgodoy
handyspark

HandySpark - bringing pandas-like capabilities to Spark dataframes

12K 200 27
sfu-db
dataprep

Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.

11K 2K 221
cleanlab
cleanvision

Automatically find issues in image datasets and practice data-centric computer vision.

10K 1K 80
InfuseAI
piperider

Code review for data in dbt

5K 494 23
Data-Centric-AI-Community
fg-data-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

5K 14K 2K
desbordante
desbordante

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

3K 477 99
SmooSenseAI
smoosense

Interactively browse multimodal tabular data

3K 109 13
PetrKorab
arabica

Python package for text mining of time-series data

3K 75 16
ikernavarro4
datanarrator

Convierte cualquier DataFrame de pandas en un análisis en lenguaje natural

2K 0 0
lux-org
lux-api

Automatically visualize your pandas dataframe via a single print! 📊 💡

2K 5K 382
marcosalvalaggio
edamame

Exploratory data analysis tools

2K 4 0
Tim-Abwao
eda-report

Automate exploratory data analysis and reporting.

2K 10 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery