PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Dataframe Python Packages

Python packages with the GitHub topic dataframe. Sorted by relevance, with stars and monthly downloads.
snowflakedb
snowflake-snowpark-python

Snowflake Snowpark Python API

52.9M 333 148
pola-rs
polars

Extremely fast Query Engine for DataFrames, written in Rust

52.8M 38K 3K
pola-rs
polars-runtime-32

Extremely fast Query Engine for DataFrames, written in Rust

39.2M 38K 3K
graphframes
graphframes

GraphFrames is a package for Apache Spark which provides DataFrame-based Graphs

2.9M 1K 266
modin-project
modin

Modin: Scale your Pandas workflows by changing a single line of code

2.1M 10K 673
lk-geimfari
mimesis

Mimesis is a fast Python library for generating fake data in multiple languages.

1.9M 5K 359
databricks
koalas

Koalas: pandas API on Apache Spark

1.4M 3K 368
sfu-db
connectorx

Fastest library to load data from DB to DataFrames in Rust and Python

1.3M 3K 211
graphframes
graphframes-py

GraphFrames is a package for Apache Spark which provides DataFrame-based Graphs

1.3M 1K 266
pola-rs
polars-lts-cpu

Extremely fast Query Engine for DataFrames, written in Rust

739K 38K 3K
pyjanitor-devs
pyjanitor

Clean APIs for data cleaning. Python implementation of R package Janitor

654K 1K 184
intake
awkward-pandas

For when your data won't fit in your dataframe

350K 50 6
debugger24
pyspark-test

Testing library for pyspark, inspired from pandas testing module but for pyspark, to help users write unit tests.

339K 21 5
Kanaries
pygwalker

PyGWalker: Turn your dataframe into an interactive UI for visual analysis

331K 16K 865
man-group
arcticdb

ArcticDB is a high performance, serverless DataFrame database built for the Python Data Science ecosystem.

304K 2K 178
crflynn
pbspark

protobuf pyspark conversion

290K 23 6
rapidsai
libcudf-cu12

cuDF - GPU DataFrame Library

254K 10K 1K
rapidsai
pylibcudf-cu12

cuDF - GPU DataFrame Library

219K 10K 1K
skrub-data
skrub

Machine learning with dataframes

204K 2K 214
rapidsai
cudf-cu12

cuDF - GPU DataFrame Library

202K 10K 1K
freqtrade
technical

Various indicators developed or collected for the Freqtrade

199K 998 243
apache
sf-hamilton

Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.

172K 2K 187
pola-rs
polars-runtime-compat

Extremely fast Query Engine for DataFrames, written in Rust

155K 38K 3K
alteryx
woodwork

Woodwork is a Python library that provides robust methods for managing and communicating data typing information.

150K 154 24
    • Data from PyPI, GitHub, ClickHouse, and BigQuery