60 dependents
| Package | Description | Downloads/month |
|---|---|---|
| Generative models for conditional audio generation | 118K | |
| Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech wi... | 104K | |
| Implementation of Alphafold 3 from Google Deepmind in Pytorch | 28K | |
| Imagen - unprecedented photorealism × deep level of language understanding | 20K | |
| Implementation of Danijar's latest iteration for his Dreamer line of work | 19K | |
| Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation... | 18K | |
| Implementation of Denoising Diffusion Probabilistic Model in Pytorch | 18K | |
| Implementation of MeshGPT, SOTA Mesh generation using Attention, in Pytorch | 16K | |
| Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural netw... | 15K | |
| Implementation of rectified flow and some of its followup research / improvement... | 9K | |
| Implementation of π₀, the robotic foundation model architecture proposed by Phys... | 9K | |
| Explorations into the proposed Streaming Deep Reinforcement Learning, from Unive... | 7K | |
| Implementation of HS-TasNet, "Real-time Low-latency Music Source Separation usin... | 6K | |
| Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Sho... | 5K | |
| Implementation of MagViT2 Tokenizer in Pytorch | 5K | |
| Implementation of ETSformer, state of the art time-series Transformer, in Pytorc... | 5K | |
| Implementation of Soft Actor Critic and some of its improvements in Pytorch | 5K | |
| Pytorch implementation of Evolutionary Policy Optimization, from Wang et al. of ... | 4K | |
| Implementation of a transformer for reinforcement learning using `x-transformers... | 4K | |
| Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Image... | 4K | |
| Implementation and explorations into Blackbox Gradient Sensing (BGS), an evoluti... | 3K | |
| RIN - Recurrent Interface Network - Pytorch | 3K | |
| Implementation of the new SOTA for model based RL, from the paper "Improving Tra... | 3K | |
| Implementation of Phenaki Video, which uses Mask GIT to produce text guided vide... | 3K | |
| Contrastive Reinforcement Learning | 3K | |
| Crilla is a simple way to introduce optimized single-GPU training into your proj... | 3K | |
| Natural Speech 2 - Pytorch | 3K | |
| Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Aut... | 2K | |
| Implementation of the training framework proposed in Self-Rewarding Language Mod... | 2K | |
| Implementation of Muse: Text-to-Image Generation via Masked Generative Transform... | 2K | |
| Gaia2 - Pytorch | 2K | |
| An Implementation of Temporal Straightening for Latent Planning | 2K | |
| SILMA TTS v1 Official Repo — a Lightweight Open Bilingual Text to Speech Model | 2K | |
| Implementation of Autoregressive Diffusion in Pytorch | 2K | |
| MC Dropout (Gal & Ghahramani, 2016) - Pytorch | 2K | |
| Discrete Distribution Network | 1K | |
| Deep Ensembles - Pytorch | 1K | |
| Implementation of Parti, Google's pure attention-based text-to-image neural netw... | 1K | |
| Tiny Recursive Model | 958 | |
| Value networks | 863 | |
| Implementation of Bit Diffusion, Hinton's group's attempt at discrete denoising ... | 771 | |
| SDFT - Pytorch | 648 | |
| X-Voice | 571 | |
| Starlight - unprecedented photorealism × deep level of language understanding | 495 | |
| Superfeel adaptation of implementation of SoundStorm, Efficient Parallel Audio G... | 472 | |
| Train and inference for Computer Vision models made easy. | 446 | |
| Generative models for conditional audio generation | 371 | |
| A simple, hackable text-to-speech system in PyTorch and MLX | 342 | |
| [NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for g... | 310 | |
| Opensynth is a library forsynthetic energy demand generation. | 295 |