PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
uber
petastorm

Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.

287K 2K 284
mongodb-labs
pymongoarrow

MongoDB integrations for Apache Arrow. Export MongoDB documents to numpy array, parquet files, and pandas dataframes in one line of code.

97K 113 18
zachspar
parquet-py

A simple command-line interface & Python API for parquet

14K 1 0
Tendo33
parq-cli

A powerful command-line tool for Parquet files

2K 2 0
QTSurfer
lastra-convert

CLI converter for the Lastra columnar time series file format. Parquet / CSV / Arrow ↔️ Lastra round-trips.

1K 0 0
uber
hops-petastorm

Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.

204 2K 284
IgnacioMB
csvcli

A light-weight command-line tool to browse and query CSV, Excel and Apache Parquet files, regardless of their size.

174 3 0
sami5001
parquet-converter

Python utility to convert TXT and CSV files to Parquet

157 1 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery