215 dependents
Package Description Downloads/month
Apache Airflow - A platform to programmatically author, schedule, and monitor wo... 14.6M
This library has moved to https://github.com/googleapis/google-cloud-python/tree... 982K
Extensible Python SDK for developing Flyte tasks and workflows. Simple to get st... 519K
A profiling and performance analysis tool for machine learning 442K
Google Storage plugin for dvc 436K
228K
ingestr is a CLI tool to copy data between any databases with a single command s... 69K
MCP and API interfaces that let the agents do the admin work 60K
Snowpark Connect for Spark 59K
The open source research environment for AI researchers to seamlessly train, eva... 54K
Ascend community code. 49K
Data Memory: the operational data context layer for AI agents - typed, versioned... 46K
nannyml: post-deployment data science in python 42K
Dask + BigQuery integration 38K
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax 34K
A profiling and performance analysis tool for machine learning 33K
llama-index readers gcs integration 33K
YData allows to use the *Data-Centric* tools from the YData ecosystem to acceler... 29K
Go ahead and axolotl questions 20K
Jupyter Notebooks in S3 - Jupyter Contents Manager implementation 19K
Superlinked server enables fast and scalable vector search and storage 19K
Automation, Data Mash, Message Learning, AI Ops, Quantum Ops 19K
synapse sdk 15K
Security scanner for AI/ML model files. Detects malicious code, backdoors, and v... 14K
Python bindings for the lance-graph Cypher engine 11K
Open-source deep-learning framework for exploring, building and deploying AI wea... 10K
Python projects for Carol 10K
Accelerate, Optimize performance with streamlined training and serving options w... 9K
molfeat - the hub for all your molecular featurizers 9K
Dapla python utilities library 9K
PINDER: The Protein INteraction Dataset and Evaluation Resource 7K
Analyse MalariaGEN data from Python 6K
Tools and clients for working with the Dapla Metadata system 6K
The leading data integration platform for ETL / ELT data pipelines from APIs, da... 6K
Runtime support library for Chalk AI 6K
Project combining flowfile core (backend) and flowfile_worker (compute offloader... 5K
Utilities for scaling geospatial analyses 5K
Nucleobench optimizers and tasks. 5K
Repository for Digital Earth Africa Sandbox, including: Jupyter notebooks, scrip... 5K
Fellesfunksjoner lagd av ressursgruppen for Python 5K
Pushdown compute from Snowflake to DuckDB running on your infrastructure 5K
lib310 Python Package 4K
Data storage utilities and processing pipelines used by CDP instances. 4K
EimerDB 4K
python for asset management 4K
Biobanking data processing, annotation, and association workflows 4K
Setup & configure LaminDB. 4K
Pakke for felles funksjoner i seksjon 422 3K
Amora Data Build Tool 3K
dataset preparation for data-driven weather models 3K