60 dependents
Package Description Downloads/month
Generative models for conditional audio generation 118K
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech wi... 104K
Implementation of Alphafold 3 from Google Deepmind in Pytorch 28K
Imagen - unprecedented photorealism × deep level of language understanding 20K
Implementation of Danijar's latest iteration for his Dreamer line of work 19K
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation... 18K
Implementation of Denoising Diffusion Probabilistic Model in Pytorch 18K
Implementation of MeshGPT, SOTA Mesh generation using Attention, in Pytorch 16K
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural netw... 15K
Implementation of rectified flow and some of its followup research / improvement... 9K
Implementation of π₀, the robotic foundation model architecture proposed by Phys... 9K
Explorations into the proposed Streaming Deep Reinforcement Learning, from Unive... 7K
Implementation of HS-TasNet, "Real-time Low-latency Music Source Separation usin... 6K
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Sho... 5K
Implementation of MagViT2 Tokenizer in Pytorch 5K
Implementation of ETSformer, state of the art time-series Transformer, in Pytorc... 5K
Implementation of Soft Actor Critic and some of its improvements in Pytorch 5K
Pytorch implementation of Evolutionary Policy Optimization, from Wang et al. of ... 4K
Implementation of a transformer for reinforcement learning using `x-transformers... 4K
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Image... 4K
Implementation and explorations into Blackbox Gradient Sensing (BGS), an evoluti... 3K
RIN - Recurrent Interface Network - Pytorch 3K
Implementation of the new SOTA for model based RL, from the paper "Improving Tra... 3K
Implementation of Phenaki Video, which uses Mask GIT to produce text guided vide... 3K
Contrastive Reinforcement Learning 3K
Crilla is a simple way to introduce optimized single-GPU training into your proj... 3K
Natural Speech 2 - Pytorch 3K
Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Aut... 2K
Implementation of the training framework proposed in Self-Rewarding Language Mod... 2K
Implementation of Muse: Text-to-Image Generation via Masked Generative Transform... 2K
Gaia2 - Pytorch 2K
An Implementation of Temporal Straightening for Latent Planning 2K
SILMA TTS v1 Official Repo — a Lightweight Open Bilingual Text to Speech Model 2K
Implementation of Autoregressive Diffusion in Pytorch 2K
MC Dropout (Gal & Ghahramani, 2016) - Pytorch 2K
Discrete Distribution Network 1K
Deep Ensembles - Pytorch 1K
Implementation of Parti, Google's pure attention-based text-to-image neural netw... 1K
Tiny Recursive Model 958
Value networks 863
Implementation of Bit Diffusion, Hinton's group's attempt at discrete denoising ... 771
SDFT - Pytorch 648
X-Voice 571
Starlight - unprecedented photorealism × deep level of language understanding 495
Superfeel adaptation of implementation of SoundStorm, Efficient Parallel Audio G... 472
Train and inference for Computer Vision models made easy. 446
Generative models for conditional audio generation 371
A simple, hackable text-to-speech system in PyTorch and MLX 342
[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for g... 310
Opensynth is a library forsynthetic energy demand generation. 295