PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Rl Python Packages

Python packages with the GitHub topic rl. Sorted by relevance, with stars and monthly downloads.
pytorch
torchrl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

1.4M 3K 450
JudgmentLabs
judgeval

The open source post-building layer for agents. Our environment data and evals power agent post-training (RL, SFT) and monitoring.

349K 1K 91
Stable-Baselines-Team
sb3-contrib

Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code

296K 712 240
neptune-ai
neptune

📘 The experiment tracker for foundation model training

208K 622 75
hud-evals
hud-python

OSS RL environment + evals toolkit

96K 248 57
thu-ml
tianshou

An elegant PyTorch deep reinforcement learning library.

87K 11K 1K
neptune-ai
neptune-client

📘 The experiment tracker for foundation model training

57K 622 75
pytorch
torchrl-nightly

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

37K 3K 450
google
dopamine-rl

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

27K 11K 1K
google-research
rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

17K 872 49
jiauzhang
torchstudio

Deep Learning Experiment

7K 2 0
instadeepai
flashbax

⚡ Flashbax: Accelerated Replay Buffers in JAX

7K 279 23
yamoling
multi-agent-rlenv

Strongly typed reinforcement learning environment framework

6K 1 1
axon-rl
gem-llm

A Gym for Agentic LLMs

6K 478 32
DLR-RM
rl-zoo3

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

5K 3K 595
epignatelli
navix

Accelerated minigrid environments with JAX

4K 170 21
inclusionAI
awex

A high-performance RL training-inference weight synchronization framework, designed to enable second-level parameter updates from training to inference in RL workflows

4K 150 17
pfeinsper
dsse

The Drone Swarm Search project provides an environment for SAR missions built on PettingZoo, where agents, represented by drones, are tasked with locating targets identified as shipwrecked individuals.

4K 72 14
lguibr
trianglengin

The core logic for the Triangle Puzzle game. Features a fast C++ backend, Pybind11 wrappers, and a Python API designed for AI/ML development and simulation.

3K 0 0
abundant-ai
oddish

Run Harbor tasks in the cloud with scheduling, monitoring, and persistent state

3K 7 1
luccabb
moonfish

~2000 Elo Python Chess Engine that implements: Negamax, PeSTO’s Evaluation, Null Move, Quiescence Search, Lazy SMP.

2K 25 4
sintefneodroid
neodroid

Python interface for the Neodroid platform 💻

1K 8 5
gbionics
amp-rsl-rl

🔁 AMP-RSL-RL: Adversarial Motion Priors for robotic RL (PPO + motion imitation)

1K 303 25
UniEnvOrg
unienv

Framework unifying robot environments and data APIs

1K 0 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery