PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
databrickslabs
dbldatagen

Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines

274K 460 93
datalpia
laketower

Oversee your lakehouse

3K 12 0
mrjsj
blueno

A Python ETL library for creating declarative data pipelines.

1K 1 0
ismailhammounou
db2ixf

db2ixf is a python package with a CLI that simplifies the parsing and processing of IBM Integration eXchange Format (IXF) files.

1K 16 1
mrjsj
msfabricutils

Spark-free Python utilities for Microsoft Fabric focused on Data Engineering using Polars and delta-rs

946 42 6
xbrianh
xdlake

A loose implementation of the deltalake protocol, written in Python on top of pyarrow, focused on extensibility, customizability, and distributed data.

339 4 0
openaleph
ftm-lakehouse

Data standard and archive storage for structured FollowTheMoney data, leaked data, private and public document collections.

284 5 1
investigativedata
leakrfc

Data standard and archive storage for structured FollowTheMoney data, leaked data, private and public document collections.

255 4 1
dataresearchcenter
flydelta

A Flight SQL proxy for Delta Lake. Query Delta tables via Apache Arrow Flight with efficient streaming and predicate pushdown.

168 1 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery