49,153 dependents
Package Description Downloads/month
🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use ... 116.3M
Databricks SQL Connector for Python 104.9M
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptun... 86.2M
Statistical data visualization in Python 55.6M
Google Cloud Client Libraries for Python 43.2M
Statsmodels: statistical modeling and econometrics in Python 36.3M
The open source AI engineering platform for agents, LLMs, and ML models. MLflow ... 36M
Google Cloud Client Libraries for Python 33.5M
Always know what to expect from your data. 31.4M
Streamlit — A faster way to build and share data apps. 27.3M
Apache Airflow - A platform to programmatically author, schedule, and monitor wo... 22.1M
N-D labeled arrays and datasets in Python 20.6M
Apache Airflow - A platform to programmatically author, schedule, and monitor wo... 20.1M
Download market data from Yahoo! Finance's API 18.4M
Python tools for geographic data 17.6M
FastF1 is a python package for accessing and analyzing Formula 1 results, schedu... 16.3M
Build and share delightful machine learning apps, all in Python. 🌟 Star to suppo... 16.2M
Python API for Deequ 15.7M
python implementation of the parquet columnar file format. 15.4M
Apache Airflow - A platform to programmatically author, schedule, and monitor wo... 14.6M
A game theoretic approach to explain the output of any machine learning model. 14.5M
Databricks Connect Client 13.8M
Python library to interact with Amazon SageMaker Unified Studio 11.4M
CmdStanPy is a lightweight interface to Stan for Python users which provides the... 9.6M
A library for training and deploying machine learning models on Amazon SageMaker 9.3M
Tool for producing high quality forecasts for time series data that has multiple... 8.6M
Mosaic AI Agent Framework SDK 6.8M
Google's Operations Research tools: 6.8M
Python SDK for Milvus Vector Database 6.6M
A fast, scalable, high performance Gradient Boosting on Decision Trees library, ... 6.4M
ialbert bio
Making bioinformatics fun again 4.7M
chDB is an in-process OLAP SQL Engine 🚀 powered by ClickHouse 4.4M
Docling core data types and transformations 3.9M
A high-performance implementation of Wilkinson formulas for Python. 3.9M
A modern, enterprise-ready business intelligence web application 3.8M
Helper functions to plot, evaluate, preprocess and engineer features for forecas... 3.7M
Read/write Google spreadsheets using pandas DataFrames 3.6M
A Grammar of Graphics for Python 3.6M
A scales package for python 3.6M
The Python CDK empowers hundreds of Airbyte connectors, including low-code and n... 3.5M
A collection of python utility functions 3.4M
A statistical library designed to fill the void in Python's time series analysis... 3.4M
A unified interface for distributed computing. Fugue executes SQL, Python, Panda... 3.4M
llama-index readers file integration 3.3M
IBM watsonx.ai sample models, notebooks and apps. 3.1M
The Yandex Query official HTTP client 3.1M
Panel: The powerful data exploration & web app framework for Python 2.9M
A package for encoding categorical variables for machine learning 2.9M
Survival analysis in Python 2.8M
Lightning ⚡️ fast forecasting with statistical and econometric models. 2.8M