PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Pandas Dataframe Python Packages

Python packages with the GitHub topic pandas-dataframe. Sorted by relevance, with stars and monthly downloads.
delta-io
deltalake

A native Rust library for Delta Lake, with bindings into Python

22.8M 3K 615
pandera-dev
pandera

A light-weight, flexible, and expressive statistical data testing library

8.8M 4K 395
influxdata
influxdb-client

InfluxDB 2.0 python client

8.5M 788 186
jmcarpenter2
swifter

A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner

8.1M 3K 104
robin900
gspread-dataframe

Read/write Google spreadsheets using pandas DataFrames

3.6M 260 24
thombashi
pytablewriter

pytablewriter is a Python library to write a table in various formats: AsciiDoc / CSV / Elasticsearch / HTML / JavaScript / JSON / LaTeX / LDJSON / LTSV / Markdown / MediaWiki / NumPy / Excel / Pandas / Python / reStructuredText / SQLite / TOML / TSV.

3.1M 630 47
Roche
pyreadstat

Python package to read and write sas, spss and stata files into/from pandas and polars data frames. It is a wrapper for the C library readstat.

2.5M 421 71
ydataai
ydata-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

1.9M 14K 2K
jupyter-incubator
hdijupyterutils

Jupyter magics and kernels for working with remote Spark clusters

1.7M 1K 448
jupyter-incubator
autovizwidget

Jupyter magics and kernels for working with remote Spark clusters

1.7M 1K 448
evidentlyai
evidently

Evidently is ​​an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.

1.2M 7K 836
posit-dev
great-tables

Make awesome display tables using Python

608K 3K 126
ydataai
pandas-profiling

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

603K 14K 2K
ofajardo
pyreadr

Python package to read and write R RData and Rds files into/from pandas dataframes. No R or other external dependencies required.

186K 336 25
fbdesignpro
sweetviz

Visualize and compare datasets, target values and associations, with one line of code.

123K 3K 288
maybelinot
df2gspread

Manage Google Spreadsheets in Pandas DataFrame with Python

122K 131 34
mongodb-labs
pymongoarrow

MongoDB integrations for Apache Arrow. Export MongoDB documents to numpy array, parquet files, and pandas dataframes in one line of code.

99K 113 18
BCG-X-Official
sklearndf

DataFrame support for scikit-learn.

95K 63 9
thombashi
pytablereader

A Python library to load structured table data from files/strings/URL with various data format: CSV / Excel / Google-Sheets / HTML / JSON / LDJSON / LTSV / Markdown / SQLite / TSV.

71K 109 13
delta-io
hops-deltalake

A native Rust library for Delta Lake, with bindings into Python

60K 3K 615
deepchecks
deepchecks

Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.

58K 4K 294
RCoff
smartsheet-dataframe

Converts Smartsheet sheets and reports to a Pandas DataFrame

49K 14 3
thombashi
simplesqlite

SimpleSQLite is a Python library to simplify SQLite database operations: table creation, data insertion and get data as other data formats. Simple ORM functionality for SQLite.

46K 135 15
rasbt
biopandas

Working with molecular structures in pandas DataFrames

37K 751 117
    • Data from PyPI, GitHub, ClickHouse, and BigQuery