5,562 dependents
Package Description Downloads/month
🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use ... 116.3M
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptun... 86.2M
Google Cloud Client Libraries for Python 43.2M
The open source AI engineering platform for agents, LLMs, and ML models. MLflow ... 36M
Google Cloud Client Libraries for Python 33.5M
See PECO-1396 for more details about this repository. 28.9M
Streamlit — A faster way to build and share data apps. 27.3M
🦜🔗 LangChain interfaces to Google's suite of AI products (e.g. Gemini & Vertex A... 23.6M
Apache Airflow - A platform to programmatically author, schedule, and monitor wo... 22.1M
Apache Airflow - A platform to programmatically author, schedule, and monitor wo... 20.1M
Apache Airflow - A platform to programmatically author, schedule, and monitor wo... 14.6M
Databricks Connect Client 13.8M
Python library to interact with Amazon SageMaker Unified Studio 11.4M
Apache Beam SDK for Python 7.8M
Developer-friendly OSS embedded retrieval library for multimodal AI. Search More... 7M
An open-source, code-first Python toolkit for building, evaluating, and deployin... 7M
ODPS Python SDK and data analysis framework 4.6M
chDB is an in-process OLAP SQL Engine 🚀 powered by ClickHouse 4.4M
A modern, enterprise-ready business intelligence web application 3.8M
A collection of python utility functions 3.4M
Apache DataFusion Python Bindings 3.4M
Type annotations for pyarrow 3.2M
python wrapper for Lance columnar format 3M
Semantic link for Microsoft Fabric 2.3M
AI Observability & Evaluation 2.2M
An open source SDK for logging, storing, querying, and visualizing multimodal an... 2.1M
Tecton Python SDK 1.8M
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ... 1.6M
SageMaker MLOps package for workflow orchestration and model building 1.5M
Apache Spark - A unified analytics engine for large-scale data processing 1.4M
An open protocol for secure data sharing 1.4M
Python SDK for Chalk 1.3M
Open WebUI 1.3M
Python library to access and analyze SEC Edgar filings, XBRL financial statement... 1.1M
Fast and Accurate ML in 3 Lines of Code 1.1M
A DataSource for reading and writing HuggingFace Datasets in Spark 1M
This library has moved to https://github.com/googleapis/google-cloud-python/tree... 982K
API Client for the Materials Project 866K
High-performance data engine for AI and multimodal workloads. Process images, au... 814K
The Open Source Feature Store for AI/ML 774K
A Python client to interact with Arize API 753K
The machine learning client library that is used for interacting with Snowflake ... 708K
Kepler.gl is a powerful open source geospatial analysis tool for large-scale dat... 674K
Provides a common interface to many IR ranking datasets. 567K
Transcript Analysis for AI Agents 560K
The official Python client library for Databento 548K
A tool to read XML files as pandas dataframes. 502K
Python helpers for G4X. 430K
The package is to coordinate dependencies within AzureML packages. This pack... 404K
Basic tools and wrappers for enabling not-too-alien syntax when running columnar... 399K