71 dependents
Package Description Downloads/month
A framework for efficient model inference with omni-modality models 477K
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech wi... 104K
Implementation of MeshGPT, SOTA Mesh generation using Attention, in Pytorch 16K
pix2tex: Using a ViT to convert images of equations into LaTeX code. 11K
Implementation of rectified flow and some of its followup research / improvement... 9K
Implementation of π₀, the robotic foundation model architecture proposed by Phys... 9K
Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch 7K
Implementation of the MetaController proposed in "Emergent temporal abstractions... 7K
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Sho... 5K
Implementation of MagViT2 Tokenizer in Pytorch 5K
Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilitie... 4K
Implementation of a transformer for reinforcement learning using `x-transformers... 4K
Implementation of the Large Behavioral Model architecture for dexterous manipula... 3K
Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Aut... 2K
x-evolution 2K
Memory-Augmented Sequence Models in Pytorch 2K
Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch 2K
A python framework accelerating ML based discovery in the medical field by encou... 2K
Exploration into the proposed architecture from Sapient Intelligence of Singapor... 2K
Implementation of the proposed Spline-Based Transformer from Disney Research 2K
GotenNet in Pytorch 2K
SILMA TTS v1 Official Repo — a Lightweight Open Bilingual Text to Speech Model 2K
Implementation of Autoregressive Diffusion in Pytorch 2K
A simple and efficient Minecraft Agent development kit for AI research. 2K
VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration 1K
Discrete Distribution Network 1K
Implementation of LVSM, SOTA Large View Synthesis with Minimal 3d Inductive Bias... 1K
Genie2 1K
A utilities library for model training subnets. 1K
Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, i... 1K
An easy-to-use library and command-line tool for TTS 1K
MMDiT 1K
Quartic Transformer 961
Tiny Recursive Model 958
Implementation of ReWiND, "Language-Guided Rewards Teach Robot Policies without ... 949
Implementation of the model architecture for SRT-H 942
Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google ... 802
This repository contains the code to train and evaluate TRIBE v2, a multimodal m... 756
F5-TTS: Text-to-Speech (TTS) ภาษาไทย — เครื่องมือสร้างเสียงพูดจากข้อความด้วยเทคน... 749
Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's ... 653
A MEDS PyTorch Dataset, leveraging a on-the-fly retrieval strategy for flexible,... 635
X-Voice 571
Enhanced multimodal fMRI brain encoding toolkit built on Meta's TRIBE v2. Featur... 549
Explorations into adversarial losses on top of autoregressive loss for language ... 477
ViLLa-X 422
Experimental Manipulation with flow matching 416
Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens f... 403
Generative models for conditional audio generation 371
Amplify 360
Implementation of Dex1B: Learning with 1B Demonstrations for Dexterous Manipulat... 311