23 dependents
Package Description Downloads/month
An implementation of local windowed attention for language modeling 461K
Implementation of Alphafold 3 from Google Deepmind in Pytorch 28K
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation... 18K
Implementation of TabTransformer, attention network for tabular data, in Pytorch 13K
Implementation of rectified flow and some of its followup research / improvement... 9K
Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch 7K
Implementation of Band Split Roformer, SOTA Attention network for music source s... 7K
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Sho... 5K
LocoFormer - Generalist Locomotion via Long-Context Adaptation 4K
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Image... 4K
Implementation of the new SOTA for model based RL, from the paper "Improving Tra... 3K
Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Aut... 2K
Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch 2K
Memory-Augmented Sequence Models in Pytorch 2K
Unofficial implementation of iTransformer - SOTA Time Series Forecasting using A... 2K
Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch 2K
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google De... 2K
Gaia2 - Pytorch 2K
GotenNet in Pytorch 2K
MMDiT 1K
Quartic Transformer 961
Implementation of 2-simplicial attention proposed by Clift et al. (2019) and the... 800
Titans architecture and MIRAS framework for test-time memorization in long-conte... 66