1,446 dependents
Package Description Downloads/month
S3 Filesystem 638.7M
The official Python client for the Hugging Face Hub. 240.6M
🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use ... 116.3M
Pythonic file-system interface for Google Cloud Storage 94.9M
Tensors and Dynamic neural networks in Python with strong GPU acceleration 83.9M
pathlib api extended to use fsspec backends 44.9M
PyIceberg 35.8M
Parallel computing with task scheduling 26.5M
PyAthena is a Python DB API 2.0 (PEP 249) client for Amazon Athena. 17.8M
python implementation of the parquet columnar file format. 15.4M
Write 70% less code by using the SDK to build custom extractors and loaders that... 13.3M
Access Azure Blobs and Data Lake Storage (ADLS) Gen2 with fsspec and dask 13.3M
Prefect is a workflow orchestration framework for building resilient data pipeli... 12.4M
Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code ... 11.5M
Build and share delightful machine learning apps, all in Python. 🌟 Star to suppo... 9.7M
dlt-hub dlt
data load tool (dlt) is an open source Python library that makes data loading ea... 7.5M
LlamaIndex is the leading document agent and OCR platform 7.1M
Apache Airflow - A platform to programmatically author, schedule, and monitor wo... 4.8M
A fsspec based filesystem-like interface to drives exposed through the Microsoft... 3.7M
yaml include other yaml 3.6M
A collection of python utility functions 3.4M
Inspect: A framework for large language model evaluations 3.4M
treeverse dvc
🦉 Data Versioning and ML Experiments 2.9M
SSH Filesystem -- Async SSH/SFTP backend for fsspec 2.3M
DVC's data management subsystem 2.2M
SCM wrapper and fsspec filesystem for Git for use in DVC. 2.1M
dvc objects - contains filesystem and object-db level abstractions to use in dvc... 2M
HTTP plugin for dvc 1.8M
Graph Neural Network Library for PyTorch 1.8M
An open protocol for secure data sharing 1.4M
Visualise your Kedro data and machine-learning pipelines and track your experime... 1.4M
A collection of self-contained fsspec-based filesystems 1.3M
Evidently is ​​an open-source ML and LLM observability framework. Evaluate, test... 1.2M
AWS S3 plugin for dvc 1.2M
Manipulate JSON-like data with NumPy-like idioms. 1.1M
ROOT I/O in pure Python and NumPy. 1M
This library has moved to https://github.com/googleapis/google-cloud-python/tree... 982K
Scalable machine 🤖 learning for time series forecasting. 978K
aider is AI pair programming in your terminal 864K
Fast and Accurate ML in 3 Lines of Code 859K
Kedro is a toolbox for production-ready data science. It uses software engineeri... 822K
High-performance data engine for AI and multimodal workloads. Process images, au... 814K
Data catalog, search and load 799K
A scalable generative AI framework built for researchers and developers working ... 798K
LlamaIndex is the leading document agent and OCR platform 754K
The machine learning client library that is used for interacting with Snowflake ... 708K
A lightweight library for PyTorch training tools and utilities 671K
Extensible Python SDK for developing Flyte tasks and workflows. Simple to get st... 519K
A profiling and performance analysis tool for machine learning 442K
Neo4j GraphRAG for Python 419K