PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
aws
awswrangler

pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).

86.2M 4K 727
scikit-hep
awkward

Manipulate JSON-like data with NumPy-like idioms.

1.1M 957 121
scikit-hep
awkward-cpp

Manipulate JSON-like data with NumPy-like idioms.

847K 957 121
scikit-hep
awkward0

Manipulate arrays of complex data structures as easily as Numpy.

331K 214 39
cldellow
parquet-metadata

Dump metadata about a Parquet file.

210K 11 2
mongodb-labs
pymongoarrow

MongoDB integrations for Apache Arrow. Export MongoDB documents to numpy array, parquet files, and pandas dataframes in one line of code.

97K 113 18
influxdata
flightsql-dbapi

DB API 2 interface for Flight SQL with SQLAlchemy extras.

47K 43 5
scikit-hep
awkward1

Manipulate JSON-like data with NumPy-like idioms.

38K 957 121
developmentseed
lonboard

Fast, interactive geospatial data visualization in Jupyter.

38K 940 52
nanoporetech
lib-pod5

Pod5: a high performance file format for nanopore reads.

28K 174 37
AndreaBozzo
dataprof

Library and CLI for profiling tabular data

26K 14 1
tradewelltech
protarrow

Convert from protobuf to arrow and back

25K 40 6
PSU3D0
formualizer

Embeddable spreadsheet engine — parse, evaluate & mutate Excel workbooks from Rust, Python, or the browser. Arrow-powered, 320+ functions.

16K 121 14
nanoporetech
pod5

Pod5: a high performance file format for nanopore reads.

15K 174 37
columnar-tech
dbc

dbc is the command-line tool for installing and managing ADBC drivers

14K 103 9
abdenlab
oxbow

Oxbow makes genomic data ready for high-performance analytics.

9K 153 15
mluttikh
xml2arrow

Convert XML data to Apache Arrow tables

8K 1 0
Query-farm
vgi-rpc

Transport-agnostic RPC framework built on Apache Arrow IPC serialization. Define RPC interfaces as Python Protocol classes with automatic schema derivation, typed client proxies, and streaming support.

7K 10 0
bug-ops
pyhdb-rs

SAP HANA meets modern Python. Rust-powered driver with zero-copy Arrow, native Polars/pandas support, async pooling. Includes MCP server for AI assistants.

6K 1 0
hypertopos
hypertopos

Understand the structure of your data — without training machine learning models

6K 2 0
arrowjet
arrowjet

The fastest way to move data in and out of database.

5K 1 1
scikit-hep
awkward-numba

Manipulate arrays of complex data structures as easily as Numpy.

3K 214 39
rpy2
rpy2-arrow

Share Apache Arrow datasets between Python and R.

3K 19 3
tradewelltech
beavers

Python stream processing for analytics

3K 41 2
    • Data from PyPI, GitHub, ClickHouse, and BigQuery