PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
databrickslabs
databricks-labs-dqx

Databricks framework to validate Data Quality of pySpark DataFrames and Tables

5.1M 405 111
databrickslabs
dbldatagen

Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines

274K 460 93
HashLoad
freeza-offset

Spark stream consumption commit in kafka consumer group

1K 16 1
prophecy-io
prophecy-spark-ai

High-performance AI/ML library for Spark to build and deploy your LLM applications in production.

243 51 15
    • Data from PyPI, GitHub, ClickHouse, and BigQuery