15 dependents
| Package | Description | Downloads/month |
|---|---|---|
| Combining advancements in discrete deep reinforcement learning | 47K | |
| Implementation of Danijar's latest iteration for his Dreamer line of work | 18K | |
| Implementation of π₀, the robotic foundation model architecture proposed by Phys... | 9K | |
| Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of th... | 8K | |
| Implementation of Soft Actor Critic and some of its improvements in Pytorch | 5K | |
| Implementation of a transformer for reinforcement learning using `x-transformers... | 5K | |
| Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Sho... | 5K | |
| LocoFormer - Generalist Locomotion via Long-Context Adaptation | 4K | |
| Pytorch implementation of Evolutionary Policy Optimization, from Wang et al. of ... | 4K | |
| Implementation of Humanoid Standing Up, from the paper "Learning Humanoid Standi... | 3K | |
| Implementation of the new SOTA for model based RL, from the paper "Improving Tra... | 3K | |
| Contrastive Reinforcement Learning | 3K | |
| Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Aut... | 3K | |
| Implementation of ReWiND, "Language-Guided Rewards Teach Robot Policies without ... | 983 | |
| Value networks | 906 |