Rl Python Packages | PyPI Stats

torchrl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

1.4M 3K 450

judgeval

The open source post-building layer for agents. Our environment data and evals power agent post-training (RL, SFT) and monitoring.

349K 1K 91

sb3-contrib

Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code

296K 712 240

neptune

📘 The experiment tracker for foundation model training

208K 622 75

hud-python

OSS RL environment + evals toolkit

96K 248 57

tianshou

An elegant PyTorch deep reinforcement learning library.

87K 11K 1K

neptune-client

📘 The experiment tracker for foundation model training

57K 622 75

torchrl-nightly

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

37K 3K 450

dopamine-rl

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

27K 11K 1K

rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

17K 872 49

torchstudio

Deep Learning Experiment

7K 2 0

flashbax

⚡ Flashbax: Accelerated Replay Buffers in JAX

7K 279 23

multi-agent-rlenv

Strongly typed reinforcement learning environment framework

6K 1 1

gem-llm

A Gym for Agentic LLMs

6K 478 32

rl-zoo3

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

5K 3K 595

navix

Accelerated minigrid environments with JAX

4K 170 21

awex

A high-performance RL training-inference weight synchronization framework, designed to enable second-level parameter updates from training to inference in RL workflows

4K 150 17

dsse

The Drone Swarm Search project provides an environment for SAR missions built on PettingZoo, where agents, represented by drones, are tasked with locating targets identified as shipwrecked individuals.

4K 72 14

trianglengin

The core logic for the Triangle Puzzle game. Features a fast C++ backend, Pybind11 wrappers, and a Python API designed for AI/ML development and simulation.

3K 0 0