100 dependents
Package Description Downloads/month
Apache Airflow - A platform to programmatically author, schedule, and monitor wo... 20.1M
Python library to interact with Amazon SageMaker Unified Studio 11.4M
PyAirbyte 388K
Singer target for Snowflake, built with the Meltano SDK for Singer Targets. 362K
An orchestration platform for the development, production, and observation of da... 199K
The Privacy Engineering & Compliance Framework 90K
ingestr is a CLI tool to copy data between any databases with a single command s... 69K
A warehouse-native semantic modeling layer that creates unified customer profile... 53K
Evaluation and Tracking for LLM Experiments and AI Agents 38K
YData allows to use the *Data-Centric* tools from the YData ecosystem to acceler... 29K
A library to handle JSON with snowflake-sqlalchemy. 17K
An orchestration platform for the development, production, and observation of da... 8K
Utilities for Python from Mayuran Visakan 8K
Ontology-based MCP server that analyzes database schemas (PostgreSQL, Snowflake,... 4K
droughty is an analytics engineering toolkit, helping keep your workflow dry. 4K
Internal package. Use this at your own risk, support not guaranteed 4K
Python connection utilities for the Snowflake Data warehouse 4K
Visivo CLI for BI and visualizations as code 3K
DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observabi... 3K
a monorepo featuring modular microkernel frameworks and single purpose extension... 3K
Singer tap for Snowflake, built with the Meltano SDK for Singer Taps. 3K
Define, govern, and model event data for warehouse-first product analytics. 3K
Databao agent is an open-source agent that enables you to chat with your data an... 3K
Python package with helper functions for the Data team. 2K
Komodo Development Kit — Python SDK and CLI for the Komodo Health platform (Snow... 2K
Data Catalog for Databases and Data Warehouses 2K
Collection of geospatial and transformational functions used by Expert Intellige... 2K
MetricFlow allows you to define, build, and maintain metrics in code. 2K
Test with compare 2K
Essential Python toolkit for Deepnote environments 2K
Database Intelligence Layer - Multi-database connectivity with SQLShield integra... 2K
All-in-1 MCP server for developers 1K
Superduper: End-to-end framework for building custom AI applications and agents. 1K
AladdinSDK 1K
A DBT Python runner for Postgres 1K
zsvoboda dbd
dbd is a database prototyping tool that enables data analysts and engineers to q... 1K
Agent Data Distillation Platform 1K
QALITA Platform Core lib for common function used in pack 1K
Fork of Permifrost for Gemma 956
A tool to compare data from different sources. 884
A handy package to load Google Sheets to your database right from the CLI and wi... 769
Reasoning Interface for Text-to-Analytics (RITA) - Natural language SQL and NoSQ... 729
ProcessTracker is a framework for managing data integration processes. 725
The Nuvolos Python library for database connectivity 667
Enterprise-grade data quality framework with YAML configuration, LLM-friendly de... 667
Python Data for the Galaxy Project 665
A sweet CLI tool to help dbt users enforce documentation and testing on their db... 647
Snowflake permissions management tool with Apache Iceberg table support 646
hckr - Awesome CLI for developers 632
Esta biblioteca de código esta pensada para compartir funcionalidades entre todo... 607