PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Pipelines Python Packages

Python packages with the GitHub topic pipelines. Sorted by relevance, with stars and monthly downloads.
meltano
meltano

Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.

1.1M 2K 235
pytorch
torchx

TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.

267K 422 152
pepkit
pipestat

Pipeline results reporting package

135K 4 2
zenml-io
zenml

ZenML 🙏: One AI Platform from Pipelines to Agents. https://zenml.io.

100K 5K 611
meta-pytorch
torchx-nightly

TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.

89K 422 152
mage-ai
mage-ai

🧙 Build, run, and manage data pipelines for integrating and transforming data.

86K 9K 964
polyaxon
polyaxon

MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle

67K 4K 324
elyra-ai
kfp-notebook

Elyra extends JupyterLab with an AI centric approach.

33K 2K 366
mmourafiq
vents

Open source connections catalog, integrations, tools, alerting, and notifications library.

28K 31 4
pypyr
pypyr

pypyr task-runner cli & api for automation pipelines. Automate anything by combining commands, different scripts in different languages & applications into one pipeline process.

27K 643 29
Avaiga
taipy

Turns Data and AI algorithms into production-ready web applications in no time.

19K 19K 2K
mabel-dev
mabel

😊 mabel is a platform for authoring data processing systems.

17K 7 3
anam-org
metaxy

Pluggable sample-level metadata versioning for incremental multimodal pipelines.

17K 96 6
zenml-io
zenml-nightly

ZenML 🙏: One AI Platform from Pipelines to Agents. https://zenml.io.

15K 5K 611
tenzir
tenzir

Tenzir is the data pipeline engine for security teams.

13K 737 103
lsst
lsst-pex-config

Configuration interface and history-tracking for LSST Data Management.

13K 0 11
4dn-dcic
tibanna

Tibanna helps you run your genomic pipelines on Amazon cloud (AWS). It is used by the 4DN DCIC (4D Nucleome Data Coordination and Integration Center) to process data. Tibanna supports CWL/WDL (w/ docker), Snakemake (w/ conda) and custom Docker/shell command.

13K 72 28
lsst
lsst-pipe-base

LSST Data Management: base classes for data processing tasks

12K 12 11
joocer
data-expectations

Are your data meeting your expectations?

12K 1 1
ASEM000
pytreeclass

Visualize, create, and operate on pytrees in the most intuitive way possible.

12K 46 2
lsst
lsst-ctrl-mpexec

Execution framework for PipelineTask

12K 2 3
ploomber
ploomber

The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️

12K 4K 241
Avaiga
taipy-gui

Turns Data and AI algorithms into production-ready web applications in no time.

11K 19K 2K
lsst
lsst-ctrl-bps

A PipelineTask execution framework for multi-node processing for the LSST Batch Production Service (BPS).

11K 5 6
    • Data from PyPI, GitHub, ClickHouse, and BigQuery