PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
narwhals-dev
narwhals

Lightweight and extensible compatibility layer between dataframe libraries!

82.4M 2K 190
dask
dask

Parallel computing with task scheduling

26.5M 14K 2K
pydata
xarray

N-D labeled arrays and datasets in Python

20.6M 4K 1K
dask
distributed

A distributed task scheduler for Dask

8.6M 2K 757
jmcarpenter2
swifter

A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner

8.2M 3K 104
fugue-project
fugue

A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.

3.4M 2K 100
capitalone
datacompy

Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!

2.6M 639 160
data-apis
array-api-compat

Compatibility layer for common array libraries to support the Array API

2.4M 121 43
fugue-project
fugue-sql-antlr

A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.

1.2M 2K 100
Nixtla
mlforecast

Scalable machine 🤖 learning for time series forecasting.

978K 1K 125
ray-project
xgboost-ray

Distributed XGBoost on Ray

730K 154 35
jcmgray
autoray

Abstract your array operations.

699K 170 11
stumpy-dev
stumpy

STUMPY is a powerful and scalable Python library for modern time series analysis

464K 4K 348
p2p-ld
numpydantic

Type annotations for specifying, validating, and serializing arrays with arbitrary backends in Pydantic (and beyond)

383K 140 5
xarray-contrib
flox

Fast & furious GroupBy operations for dask.array

360K 135 22
rapidsai
libcudf-cu12

cuDF - GPU DataFrame Library

266K 10K 1K
dask
dask-cloudprovider

Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...

254K 147 116
rapidsai
pylibcudf-cu12

cuDF - GPU DataFrame Library

216K 10K 1K
rapidsai
cudf-cu12

cuDF - GPU DataFrame Library

203K 10K 1K
polyaxon
traceml

Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.

142K 530 47
rapidsai
dask-cudf-cu12

cuDF - GPU DataFrame Library

120K 10K 1K
pytroll
pyresample

Geospatial image resampling in Python

114K 379 98
polyaxon
datatile

Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.

113K 530 47
mouradmourafiq
pandas-summary

Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.

113K 530 47
    • Data from PyPI, GitHub, ClickHouse, and BigQuery