19 dependents
Package Description Downloads/month
Implementation of Danijar's latest iteration for his Dreamer line of work 19K
Implementation of TabTransformer, attention network for tabular data, in Pytorch 13K
Implementation of π₀, the robotic foundation model architecture proposed by Phys... 9K
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of th... 8K
Implementation of the MetaController proposed in "Emergent temporal abstractions... 7K
Explorations into the proposed Streaming Deep Reinforcement Learning, from Unive... 7K
Pytorch implementation of Evolutionary Policy Optimization, from Wang et al. of ... 4K
LocoFormer - Generalist Locomotion via Long-Context Adaptation 4K
Implementation of a transformer for reinforcement learning using `x-transformers... 4K
Contrastive Reinforcement Learning 3K
x-evolution 2K
Implementation of Mimic-Video, Video-Action Models for SOTA Generalizable Robot ... 2K
Discrete Distribution Network 1K
Unofficial implementation of Hippoformer, Integrating Hippocampus-inspired Spati... 1K
Implementation of ReWiND, "Language-Guided Rewards Teach Robot Policies without ... 949
Value networks 863
Implementation of 2-simplicial attention proposed by Clift et al. (2019) and the... 800
ViLLa-X 422
Implementation of Dex1B: Learning with 1B Demonstrations for Dexterous Manipulat... 311