15 dependents
Package Description Downloads/month
Combining advancements in discrete deep reinforcement learning 47K
Implementation of Danijar's latest iteration for his Dreamer line of work 18K
Implementation of π₀, the robotic foundation model architecture proposed by Phys... 9K
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of th... 8K
Implementation of Soft Actor Critic and some of its improvements in Pytorch 5K
Implementation of a transformer for reinforcement learning using `x-transformers... 5K
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Sho... 5K
LocoFormer - Generalist Locomotion via Long-Context Adaptation 4K
Pytorch implementation of Evolutionary Policy Optimization, from Wang et al. of ... 4K
Implementation of Humanoid Standing Up, from the paper "Learning Humanoid Standi... 3K
Implementation of the new SOTA for model based RL, from the paper "Improving Tra... 3K
Contrastive Reinforcement Learning 3K
Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Aut... 3K
Implementation of ReWiND, "Language-Guided Rewards Teach Robot Policies without ... 983
Value networks 906