PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Data Analysis Python Packages

Python packages with the GitHub topic data-analysis. Sorted by relevance, with stars and monthly downloads.
pandas-dev
pandas

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

684.3M 49K 20K
scikit-learn
scikit-learn

scikit-learn: machine learning in Python

207.1M 66K 27K
aws
redshift-connector

Redshift Python Connector. It supports Python Database API Specification v2.0.

49.6M 218 87
statsmodels
statsmodels

Statsmodels: statistical modeling and econometrics in Python

36.7M 11K 3K
streamlit
streamlit

Streamlit — A faster way to build and share data apps.

27.7M 44K 4K
gradio-app
gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

16.3M 43K 3K
scikit-learn-contrib
imbalanced-learn

A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning

13.2M 7K 1K
gradio-app
gradio-client

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

9.8M 43K 3K
has2k1
plotnine

A Grammar of Graphics for Python

3.6M 5K 246
databrickslabs
dbl-tempo

API for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation

3M 341 59
akfamily
akshare

AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库

2.7M 19K 3K
ydataai
ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

1.9M 14K 2K
dylan-profiler
visions

Type System for Data Analysis in Python

1.6M 217 20
elementary-data
elementary-data

The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.

1.2M 2K 214
scikit-hep
awkward

Manipulate JSON-like data with NumPy-like idioms.

1.1M 958 121
pydata
pandas-datareader

Extract data from a wide range of Internet sources into a pandas DataFrame.

843K 3K 692
arvkevi
kneed

Knee point detection in Python :chart_with_upwards_trend:

831K 808 75
scikit-hep
awkward-cpp

Manipulate JSON-like data with NumPy-like idioms.

806K 958 121
flyteorg
flyteidl

Dynamic, resilient AI orchestration. Coordinate data, models, and compute as you build AI workflows.

636K 7K 812
ydataai
pandas-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

611K 14K 2K
predict-idlab
plotly-resampler

Visualize large time series data with plotly.py

541K 1K 74
dfm
corner

Make some beautiful corner plots

477K 570 234
alan-turing-institute
clevercsv

CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.

401K 1K 80
reflex-dev
reflex-hosting-cli

🕸️ Web apps in pure Python 🐍

377K 28K 2K
    • Data from PyPI, GitHub, ClickHouse, and BigQuery