134 dependents
Package Description Downloads/month
Apache Spark - A unified analytics engine for large-scale data processing 52.2M
Databricks Connect Client 13.8M
Apache Flink Python API 156K
User's custom models boilerplate 94K
DataRobot Prediction Library 79K
datarobot-mlops library to read and report MLOps statistics 76K
Snowpark Connect for Spark 59K
Python PMML scoring library 42K
35K
Backend used by MXCuBE 33K
Merlion: A Machine Learning Framework for Time Series Intelligence 17K
Exposes OpenJDK's Java parser and scanner to Python 11K
eCalc™ is a software tool for calculation of energy demand and greenhouse gas em... 10K
A Python helper package providing streamlined Spark functions for efficient data... 10K
CLI tool for the Zipline AI platform 8K
NEGotiations Managed by Agent Simulations 7K
PMML evaluator library for Python 6K
An engine for running component based ML pipelines 5K
Scalable identity resolution, entity resolution, data mastering and deduplicatio... 4K
A Python package for Altastata data processing and machine learning integration 4K
Notebook gallery and issue tracking for Atoti 4K
vsm
Vector Space Semantic Modeling Framework for the Indiana Philosophy Ontology Pro... 4K
Booz Allen's lean manufacturing approach for holistically designing, developing ... 3K
Python library for converting Apache Spark ML pipelines to PMML 3K
Implementation of DIN SPEC 70121, ISO 15118-2 and -20 specs for SECC 3K
Python wrapper for KoalaNLP (Korean NLP with Java/Scala) 3K
Bodo's Vectorized SQL execution engine for clusters 3K
ProActive scheduler client module 2K
A Python library for reading XBRL reports 2K
Apache DolphinScheduler Python API, aka PyDolphinscheduler. 2K
CLI tool for the Zipline AI platform 2K
ScienceWorld: An interactive text environment to study AIagents on accomplishing... 2K
A neuroscience library for Python, intended to complement the existing nibabel l... 2K
A knowledge-graph-based digital twin of the world. 2K
This is python wrapper of STEAM SIGMA code 2K
A PySpark-based NLP pipeline made mainly for spelling correction and classifying... 2K
A random forest 2K
This Python API provides a high-level interface to interact with the NDTkit desk... 1K
Bodo Connector for Iceberg 1K
Alink is the Machine Learning algorithm platform based on Flink, developed by th... 1K
A probabilistic approach from an Improbabilistic company 1K
Bring colors to Euclid tiles! 1K
ScienceWorld is a text-based virtual environment centered around accomplishing t... 1K
JupyterNotebook Flink magics 1K
QuarticSDK is the SDK package which exposes the APIs to the user 1K
Geniusrise: Framework for building geniuses 1K
Convenience functions for interacting with the OLIVER workspace 1K
Apache SystemDS - An open source ML system for the end-to-end data science lifec... 1K
Apache Flink Python API 1K
The core of Musket ML 943