139 dependents
Package Description Downloads/month
Write 70% less code by using the SDK to build custom extractors and loaders that... 13.3M
An orchestration platform for the development, production, and observation of da... 6.9M
Apache Airflow - A platform to programmatically author, schedule, and monitor wo... 6.7M
Inspect: A framework for large language model evaluations 3.4M
Transcript Analysis for AI Agents 560K
Python tools for querying and manipulating BIDS datasets. 127K
Python package for reading DICOM WSI file sets. 110K
An open and interoperable data framework for spatial omics data 105K
Create temporary directories on the various filesystems for testing 97K
Community supported integrations for the Dagster platform. 94K
NeMo Retriever Library is a scalable, performance-oriented document content and ... 76K
Data representations, APIs, and tools for high quality AV and robotics applicati... 68K
Setup & configure LaminDB. 62K
Python package for earth-observing satellite data processing 52K
Cloud-Optimize your Scientific Data as Virtual Zarr stores, using xarray syntax. 49K
🗂️ DirectoryTree widget for textual, compatible with all filesystems 45K
A user-friendly library that brings familiar DataFrame-style operations to AnnDa... 39K
Efficient Pandas representation for nested associated datasets. 34K
NeMo Retriever Library is a scalable, performance-oriented document content and ... 31K
Developer tools for M3 (MedicalMultitaskModeling) 25K
Hierarchical Adaptive Tiling Scheme 20K
synapse sdk 15K
Python API and CLI tools for working with WEBKNOSSOS datasets, annotations and s... 13K
A modern RAG ingestion pipeline from Nvidia 13K
Minimal package for loading and initializing OlmoEarth models 12K
A tool for developing remote sensing datasets and models. 11K
LSDB - python tool for scalable analysis of large catalogs 11K
pytask is a workflow management system that facilitates reproducible data analys... 11K
argoverse av2
Argoverse 2: Next generation datasets for self-driving perception and forecastin... 11K
HydroMT: Automated and reproducible model building and analysis 9K
Don't write docs. Code them. 8K
Distilabel is an AI Feedback (AIF) framework for building datasets with and for ... 7K
🗜️ (de)serialize json objects with lazy/partial loading containers using msgpack... 7K
Setup & configure LaminDB. 7K
Compatibility helpers of core functionality to help with running in different Py... 6K
Tools and clients for working with the Dapla Metadata system 6K
A prototype to test how fast metadata and specific ecephys unit data can be acce... 6K
Ecephys and behavior workflows for the Mindscope Neuropixels team. 6K
Cloud-native, scalable, and user-friendly multi dimensional energy data! 6K
Python library for reading tiles from wsi tiff-files. 5K
Jupyter notebooks in the terminal 5K
Helpers for universal-pathlib / fsspec 5K
An API for working with raw data from Neon recordings 5K
A high-level wrapper of PyAV providing an easy to use interface to video data. 4K
A backend for pydantic-AI agents and MCP servers. 4K
Base pydantic tools 4K
Import and export data into/from InfluxDB. 3K
Document reader with OCR & image detection support. 3K
Jinja2 utilities, loaders & fsspec integration. 3K
A pipeline orchestration library executing tasks within one python session. It t... 3K