5,562 dependents
| Package | Description | Downloads/month |
|---|---|---|
| 🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use ... | 116.3M | |
| pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptun... | 86.2M | |
| Google Cloud Client Libraries for Python | 43.2M | |
| The open source AI engineering platform for agents, LLMs, and ML models. MLflow ... | 36M | |
| Google Cloud Client Libraries for Python | 33.5M | |
| See PECO-1396 for more details about this repository. | 28.9M | |
| Streamlit — A faster way to build and share data apps. | 27.3M | |
| 🦜🔗 LangChain interfaces to Google's suite of AI products (e.g. Gemini & Vertex A... | 23.6M | |
| Apache Airflow - A platform to programmatically author, schedule, and monitor wo... | 22.1M | |
| Apache Airflow - A platform to programmatically author, schedule, and monitor wo... | 20.1M | |
| Apache Airflow - A platform to programmatically author, schedule, and monitor wo... | 14.6M | |
| Databricks Connect Client | 13.8M | |
| Python library to interact with Amazon SageMaker Unified Studio | 11.4M | |
| Apache Beam SDK for Python | 7.8M | |
| Developer-friendly OSS embedded retrieval library for multimodal AI. Search More... | 7M | |
| An open-source, code-first Python toolkit for building, evaluating, and deployin... | 7M | |
| ODPS Python SDK and data analysis framework | 4.6M | |
| chDB is an in-process OLAP SQL Engine 🚀 powered by ClickHouse | 4.4M | |
| A modern, enterprise-ready business intelligence web application | 3.8M | |
| A collection of python utility functions | 3.4M | |
| Apache DataFusion Python Bindings | 3.4M | |
| Type annotations for pyarrow | 3.2M | |
| python wrapper for Lance columnar format | 3M | |
| Semantic link for Microsoft Fabric | 2.3M | |
| AI Observability & Evaluation | 2.2M | |
| An open source SDK for logging, storing, querying, and visualizing multimodal an... | 2.1M | |
| Tecton Python SDK | 1.8M | |
| TFDS is a collection of datasets ready to use with TensorFlow, Jax, ... | 1.6M | |
| SageMaker MLOps package for workflow orchestration and model building | 1.5M | |
| Apache Spark - A unified analytics engine for large-scale data processing | 1.4M | |
| An open protocol for secure data sharing | 1.4M | |
| Python SDK for Chalk | 1.3M | |
| Open WebUI | 1.3M | |
| Python library to access and analyze SEC Edgar filings, XBRL financial statement... | 1.1M | |
| Fast and Accurate ML in 3 Lines of Code | 1.1M | |
| A DataSource for reading and writing HuggingFace Datasets in Spark | 1M | |
| This library has moved to https://github.com/googleapis/google-cloud-python/tree... | 982K | |
| API Client for the Materials Project | 866K | |
| High-performance data engine for AI and multimodal workloads. Process images, au... | 814K | |
| The Open Source Feature Store for AI/ML | 774K | |
| A Python client to interact with Arize API | 753K | |
| The machine learning client library that is used for interacting with Snowflake ... | 708K | |
| Kepler.gl is a powerful open source geospatial analysis tool for large-scale dat... | 674K | |
| Provides a common interface to many IR ranking datasets. | 567K | |
| Transcript Analysis for AI Agents | 560K | |
| The official Python client library for Databento | 548K | |
| A tool to read XML files as pandas dataframes. | 502K | |
| Python helpers for G4X. | 430K | |
| The package is to coordinate dependencies within AzureML packages. This pack... | 404K | |
| Basic tools and wrappers for enabling not-too-alien syntax when running columnar... | 399K |