PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
great-expectations
great-expectations

Always know what to expect from your data.

31.4M 11K 2K
ydataai
ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

1.9M 14K 2K
ydataai
pandas-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

614K 14K 2K
great-expectations
great-expectations-experimental

Always know what to expect from your data.

573K 11K 2K
great-expectations
acryl-great-expectations

Always know what to expect from your data.

410K 11K 2K
fbdesignpro
sweetviz

Visualize and compare datasets, target values and associations, with one line of code.

118K 3K 288
tommyod
kdepy

Kernel Density Estimation in Python

63K 644 102
cleanlab
cleanlab

Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

58K 11K 890
InfuseAI
piperider-nightly

Code review for data in dbt

24K 494 23
JasonKessler
scattertext

Beautiful visualizations of how language differs among document types.

20K 2K 286
zhihanyue
qgridnext

Advancing QGrid, an interactive grid for exploring DataFrames in JupyterLab/Notebook

15K 38 2
dvgodoy
handyspark

HandySpark - bringing pandas-like capabilities to Spark dataframes

12K 200 27
darenr
report-creator

Tool to assemble HTML reports using python components with charts and diagrams.

12K 11 1
sfu-db
dataprep

Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.

11K 2K 221
cleanlab
cleanvision

Automatically find issues in image datasets and practice data-centric computer vision.

9K 1K 80
InfuseAI
piperider

Code review for data in dbt

6K 494 23
Data-Centric-AI-Community
fg-data-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

4K 14K 2K
SmooSenseAI
smoosense

Interactively browse multimodal tabular data

4K 109 13
desbordante
desbordante

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

3K 477 99
PetrKorab
arabica

Python package for text mining of time-series data

3K 75 16
ikernavarro4
datanarrator

Convierte cualquier DataFrame de pandas en un análisis en lenguaje natural

3K 0 0
marcosalvalaggio
edamame

Exploratory data analysis tools

2K 4 0
lux-org
lux-api

Automatically visualize your pandas dataframe via a single print! 📊 💡

2K 5K 382
Tim-Abwao
eda-report

Automate exploratory data analysis and reporting.

2K 10 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery