71 dependents
| Package | Description | Downloads/month |
|---|---|---|
| A framework for efficient model inference with omni-modality models | 477K | |
| Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech wi... | 104K | |
| Implementation of MeshGPT, SOTA Mesh generation using Attention, in Pytorch | 16K | |
| pix2tex: Using a ViT to convert images of equations into LaTeX code. | 11K | |
| Implementation of rectified flow and some of its followup research / improvement... | 9K | |
| Implementation of π₀, the robotic foundation model architecture proposed by Phys... | 9K | |
| Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch | 7K | |
| Implementation of the MetaController proposed in "Emergent temporal abstractions... | 7K | |
| Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Sho... | 5K | |
| Implementation of MagViT2 Tokenizer in Pytorch | 5K | |
| Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilitie... | 4K | |
| Implementation of a transformer for reinforcement learning using `x-transformers... | 4K | |
| Implementation of the Large Behavioral Model architecture for dexterous manipula... | 3K | |
| Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Aut... | 2K | |
| x-evolution | 2K | |
| Memory-Augmented Sequence Models in Pytorch | 2K | |
| Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch | 2K | |
| A python framework accelerating ML based discovery in the medical field by encou... | 2K | |
| Exploration into the proposed architecture from Sapient Intelligence of Singapor... | 2K | |
| Implementation of the proposed Spline-Based Transformer from Disney Research | 2K | |
| GotenNet in Pytorch | 2K | |
| SILMA TTS v1 Official Repo — a Lightweight Open Bilingual Text to Speech Model | 2K | |
| Implementation of Autoregressive Diffusion in Pytorch | 2K | |
| A simple and efficient Minecraft Agent development kit for AI research. | 2K | |
| VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration | 1K | |
| Discrete Distribution Network | 1K | |
| Implementation of LVSM, SOTA Large View Synthesis with Minimal 3d Inductive Bias... | 1K | |
| Genie2 | 1K | |
| A utilities library for model training subnets. | 1K | |
| Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, i... | 1K | |
| An easy-to-use library and command-line tool for TTS | 1K | |
| MMDiT | 1K | |
| Quartic Transformer | 961 | |
| Tiny Recursive Model | 958 | |
| Implementation of ReWiND, "Language-Guided Rewards Teach Robot Policies without ... | 949 | |
| Implementation of the model architecture for SRT-H | 942 | |
| Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google ... | 802 | |
| This repository contains the code to train and evaluate TRIBE v2, a multimodal m... | 756 | |
| F5-TTS: Text-to-Speech (TTS) ภาษาไทย — เครื่องมือสร้างเสียงพูดจากข้อความด้วยเทคน... | 749 | |
| Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's ... | 653 | |
| A MEDS PyTorch Dataset, leveraging a on-the-fly retrieval strategy for flexible,... | 635 | |
| X-Voice | 571 | |
| Enhanced multimodal fMRI brain encoding toolkit built on Meta's TRIBE v2. Featur... | 549 | |
| Explorations into adversarial losses on top of autoregressive loss for language ... | 477 | |
| ViLLa-X | 422 | |
| Experimental Manipulation with flow matching | 416 | |
| Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens f... | 403 | |
| Generative models for conditional audio generation | 371 | |
| Amplify | 360 | |
| Implementation of Dex1B: Learning with 1B Demonstrations for Dexterous Manipulat... | 311 |