PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Streaming Data Python Packages

Python packages with the GitHub topic streaming-data. Sorted by relevance, with stars and monthly downloads.
piskvorky
smart-open

Utils for streaming large files (S3, HDFS, gzip, bz2...)

71.2M 3K 385
guillermo-navas-palencia
optbinning

Optimal binning: monotonic binning with constraints. Support batch & stream optimal binning. Scorecard modelling and counterfactual explanations.

220K 522 116
online-ml
river

🌊 Online machine learning in Python

219K 6K 624
quixio
quixstreams

Python Streaming DataFrames for Kafka

75K 2K 105
Sinotrade
shioaji

Shioaji all new cross platform api for trading ( 跨平台證券交易API )

70K 426 31
python-streamz
streamz

Real-time stream processing for python

50K 1K 150
bytewax
bytewax

Python Stream Processing

29K 2K 109
MaterializeInc
dbt-materialize

The live data layer for apps and AI agents. Create up-to-the-second views into your business, just using SQL

5K 6K 500
aramisfacchinetti
streaming-json-parser

A streaming JSON parser that processes JSON data incrementally, handling partial states. Useful for incrementally parsing partial responses from streaming outputs of Large Language Models (LLMs).

5K 13 2
scikit-multiflow
scikit-multiflow

A machine learning package for streaming data in Python. The other ancestor of River.

4K 794 189
readysettech
rdst

Readyset is a MySQL and Postgres wire-compatible caching layer that sits in front of existing databases to speed up queries and horizontally scale read throughput. Under the hood, ReadySet caches the results of cached select statements and incrementally updates these results over time as the underlying data changes.

4K 5K 159
creme-ml
creme

🌊 Online machine learning in Python

4K 6K 624
maki-nage
rxsci

ReactiveX for data science

2K 14 2
selimfirat
pysad

Streaming Anomaly Detection Framework in Python (Outlier Detection for Streaming Data)

2K 286 27
sdpython
pandas-streaming

Streaming API for pandas applied to big datasets

2K 31 9
quantfinlib
screamer

Screamingly fast streaming indicators with C++ performance and Python simplicity.

1K 4 1
thammo4
uvatradier

wahoowa

1K 29 19
streamdal
streamdal-protos

Code-Native Data Privacy

943 615 16
Menziess
slipstream-async

Slipstream provides a data-flow model to simplify development of stateful streaming applications.

710 39 2
readysettech
rdst-staging

ReadySet Diagnostics & SQL Tuning - CLI tool for database diagnostics and query optimization

390 5K 159
lampajr
ptdc

Python Twitter Data Collector

389 4 0
neurodata
sdtf

Exploring streaming options for decision trees and random forests. Based on scikit-learn fork.

360 9 3
geniusrise
geniusrise-listeners

A collection of Spouts that listen to events

343 2 5
Jgprog117
typedkafka

A well-documented, fully type-hinted Kafka client for Python

329 5 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery