PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
pytorch
torchrl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

1.3M 3K 450
JudgmentLabs
judgeval

The open source post-building layer for agents. Our environment data and evals power agent post-training (RL, SFT) and monitoring.

330K 1K 91
Stable-Baselines-Team
sb3-contrib

Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code

301K 712 240
neptune-ai
neptune

📘 The experiment tracker for foundation model training

206K 622 75
hud-evals
hud-python

OSS RL environment + evals toolkit

100K 248 57
thu-ml
tianshou

An elegant PyTorch deep reinforcement learning library.

84K 11K 1K
neptune-ai
neptune-client

📘 The experiment tracker for foundation model training

57K 622 75
pytorch
torchrl-nightly

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

37K 3K 450
google
dopamine-rl

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

26K 11K 1K
google-research
rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

15K 872 49
jiauzhang
torchstudio

Deep Learning Experiment

7K 2 0
instadeepai
flashbax

⚡ Flashbax: Accelerated Replay Buffers in JAX

7K 279 23
yamoling
multi-agent-rlenv

Strongly typed reinforcement learning environment framework

6K 1 1
axon-rl
gem-llm

A Gym for Agentic LLMs

6K 478 32
DLR-RM
rl-zoo3

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

6K 3K 595
epignatelli
navix

Accelerated minigrid environments with JAX

4K 170 21
pfeinsper
dsse

The Drone Swarm Search project provides an environment for SAR missions built on PettingZoo, where agents, represented by drones, are tasked with locating targets identified as shipwrecked individuals.

4K 72 14
inclusionAI
awex

A high-performance RL training-inference weight synchronization framework, designed to enable second-level parameter updates from training to inference in RL workflows

3K 150 17
abundant-ai
oddish

Run Harbor tasks in the cloud with scheduling, monitoring, and persistent state

3K 7 1
lguibr
trianglengin

The core logic for the Triangle Puzzle game. Features a fast C++ backend, Pybind11 wrappers, and a Python API designed for AI/ML development and simulation.

3K 0 0
luccabb
moonfish

~2000 Elo Python Chess Engine that implements: Negamax, PeSTO’s Evaluation, Null Move, Quiescence Search, Lazy SMP.

2K 25 4
sintefneodroid
neodroid

Python interface for the Neodroid platform 💻

1K 8 5
gbionics
amp-rsl-rl

🔁 AMP-RSL-RL: Adversarial Motion Priors for robotic RL (PPO + motion imitation)

1K 303 25
UniEnvOrg
unienv

Framework unifying robot environments and data APIs

1K 0 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery