PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Pandas Python Packages

Python packages with the GitHub topic pandas. Sorted by relevance, with stars and monthly downloads.
pandas-dev
pandas

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

684.3M 49K 20K
tqdm
tqdm

:zap: A Fast, Extensible Progress Bar for Python and CLI

455.7M 31K 1K
huggingface
datasets

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

118.9M 21K 3K
aws
awswrangler

pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).

86.3M 4K 727
narwhals-dev
narwhals

Lightweight and extensible compatibility layer between dataframe libraries!

83.1M 2K 190
jmcnamara
xlsxwriter

A Python module for creating Excel XLSX files.

78.9M 4K 663
mwaskom
seaborn

Statistical data visualization in Python

55.8M 14K 2K
dask
dask

Parallel computing with task scheduling

26.5M 14K 2K
delta-io
deltalake

A native Rust library for Delta Lake, with bindings into Python

22.8M 3K 615
pydata
xarray

N-D labeled arrays and datasets in Python

20.8M 4K 1K
ranaroussi
yfinance

Download market data from Yahoo! Finance's API

18.6M 23K 3K
geopandas
geopandas

Python tools for geographic data

17.8M 5K 1K
pandera-dev
pandera

A light-weight, flexible, and expressive statistical data testing library

8.9M 4K 395
jmcarpenter2
swifter

A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner

8.2M 3K 104
robin900
gspread-dataframe

Read/write Google spreadsheets using pandas DataFrames

3.6M 260 24
dimastbk
python-calamine

Python binding for Rust's library for reading excel and odf file - calamine.

3.6M 459 14
fugue-project
fugue

A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.

3.3M 2K 100
ToucanToco
fastexcel

A fast excel reader for Rust and Python

3.1M 230 20
thombashi
pytablewriter

pytablewriter is a Python library to write a table in various formats: AsciiDoc / CSV / Elasticsearch / HTML / JavaScript / JSON / LaTeX / LDJSON / LTSV / Markdown / MediaWiki / NumPy / Excel / Pandas / Python / reStructuredText / SQLite / TOML / TSV.

3.1M 630 47
databrickslabs
dbl-tempo

API for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation

3M 341 59
capitalone
datacompy

Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!

2.6M 639 160
amirziai
flatten-json

Flatten JSON in Python

2.5M 553 97
chezou
tabula-py

Simple wrapper of tabula-java: extract table from PDF into pandas DataFrame

2.3M 2K 304
modin-project
modin

Modin: Scale your Pandas workflows by changing a single line of code

2.1M 10K 673
    • Data from PyPI, GitHub, ClickHouse, and BigQuery