551 dependents
Package Description Downloads/month
Inspect: A framework for large language model evaluations 3.4M
SageMaker MLOps package for workflow orchestration and model building 1.5M
AWS S3 plugin for dvc 1.2M
The machine learning client library that is used for interacting with Snowflake ... 708K
Extensible Python SDK for developing Flyte tasks and workflows. Simple to get st... 519K
The Python-ARM Radar Toolkit. A data model driven interactive toolkit for workin... 311K
llama-index readers s3 integration 164K
Python Library for NASA Earthdata APIs 101K
A hands-on framework for data scientists to learn and implement end-to-end custo... 95K
Megatron's multi-modal data loader 77K
3lc
3LC Python Package - A tool for model-guided, interactive data debugging and enh... 75K
ingestr is a CLI tool to copy data between any databases with a single command s... 69K
MCP and API interfaces that let the agents do the admin work 60K
Snowpark Connect for Spark 59K
Xarray data access for EODAG 57K
CZ CELLxGENE Discover Census 55K
The open source research environment for AI researchers to seamlessly train, eva... 54K
This python package will be stored in AWS CodeArtifact 51K
Data Memory: the operational data context layer for AI agents - typed, versioned... 46K
Data and tools for generating and inspecting OLMo pre-training data. 44K
nannyml: post-deployment data science in python 42K
Open-source deep-learning framework for building, training, and fine-tuning deep... 40K
Base OAREPO package freezeing versions of libraries 39K
EEG-DaSh: an open data, tool, and compute resource — a Python library and catalo... 37K
REsource eXtraction Tool (rex) 37K
Python package for evaluating neuron segmentations in terms of the number of spl... 33K
A friendly package for Kepler & TESS time series analysis in Python. 33K
NeMo Retriever Library is a scalable, performance-oriented document content and ... 31K
DBND is an agile pipeline framework that helps data engineering teams track and ... 30K
YData allows to use the *Data-Centric* tools from the YData ecosystem to acceler... 29K
FAIR Sequence Modeling Toolkit 2 23K
Go ahead and axolotl questions 20K
Python-based Space Physics Environment Data Analysis Software 20K
images for arkitekt 20K
A collection of tools for GeoParquet, built on DuckDB, GDAL & Obstore 20K
Jupyter Notebooks in S3 - Jupyter Contents Manager implementation 19K
Superlinked server enables fast and scalable vector search and storage 19K
synapse sdk 15K
Security scanner for AI/ML model files. Detects malicious code, backdoors, and v... 14K
Simple Workflow Framework based on Hamilton 14K
A Sensor Geometry Application Re-usable by-Design 13K
FINTER API 13K
Satip provides the functionality necessary for 13K
Utility library to support Datup AI MLOps processes 13K
Climate Simulation Operations 12K
Open-source Jupyter Notebooks and Python tools for geospatial analysis with Digi... 11K
A BioIO reader plugin for reading Zarr files in the OME format. 11K
Great Expectations Plugin for Flytekit 10K
Open-source deep-learning framework for exploring, building and deploying AI wea... 10K
Package containing common code and reusable components for pipelines and dags 10K