PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Distributed Systems Python Packages

Python packages with the GitHub topic distributed-systems. Sorted by relevance, with stars and monthly downloads.
dmlc
xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

42.9M 28K 9K
fugue-project
fugue

A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.

3.3M 2K 100
fugue-project
fugue-sql-antlr

A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.

1.2M 2K 100
ag2ai
faststream

FastStream is a powerful and easy-to-use asynchronous Python framework for building asynchronous services interacting with event streams such as Apache Kafka, RabbitMQ, NATS, MQTT and Redis.

958K 5K 343
Eventual-Inc
daft

High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale

825K 5K 457
faust-streaming
faust-streaming

Python Stream Processing. A Faust fork

666K 2K 203
dmlc
xgboost-cpu

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

441K 28K 9K
akhundMurad
typeid-python

Python implementation of TypeIDs: type-safe, K-sortable, and globally unique identifiers inspired by Stripe IDs

355K 150 17
irmen
pyro5

Pyro 5 - Python remote objects

275K 382 46
pytorch
torchft-nightly

Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)

217K 500 62
rucio
rucio-clients

Rucio - Scientific Data Management

98K 298 383
pyeventsourcing
eventsourcing

A library for event sourcing in Python.

75K 2K 143
kalepa
safe-init

Safe Init is a Python library that enhances AWS Lambda functions with advanced error handling, logging, monitoring, and resilience features, providing comprehensive observability and reliability for serverless applications.

70K 6 0
v6d-io
vineyard

vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)

55K 951 132
py-sherlock
sherlock

Easy distributed locks for Python with a choice of backends.

39K 378 34
pegasus-isi
pegasus-wms-api

Pegasus Workflow Management System - Automate, recover, and debug scientific computations.

37K 222 90
pegasus-isi
pegasus-wms-common

Pegasus Workflow Management System - Automate, recover, and debug scientific computations.

37K 222 90
google
google-vizier

Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.

25K 2K 110
tastyware
streaq

Fast, async, fully-typed distributed task queue via Redis streams

25K 142 11
v6d-io
vineyard-bdist

vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)

23K 951 132
Attumm
redis-dict

Python dictionary with Redis as backend, built for large datasets. Simplifies Redis operations for large-scale and distributed systems. Supports various data types, namespacing, pipelining, and expiration.

22K 76 13
robinhood
faust

Python Stream Processing

22K 7K 538
v6d-io
vineyard-ml

vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)

18K 951 132
fugue-project
fugue-sql-antlr-cpp

Fugue SQL Antlr C++ Parser

18K 2K 100
    • Data from PyPI, GitHub, ClickHouse, and BigQuery