17 dependents
| Package | Description | Downloads/month |
|---|---|---|
| Generative models for conditional audio generation | 118K | |
| Text-Acoustic Dual-Aligned Language Model | 12K | |
| Interface for OuteTTS models. | 7K | |
| Fish Speech | 6K | |
| Vox box | 3K | |
| Nodetool is a no-code development environment for Artificial Intelligence, enabl... | 1K | |
| Dia-JAX: A JAX port of Dia, the text-to-speech model for generating realistic di... | 778 | |
| Boson Multimodal - A multimodal AI framework | 657 | |
| Audio generation using diffusion models, in PyTorch. | 530 | |
| Generative models for conditional audio generation | 371 | |
| [NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for g... | 310 | |
| The official implementation of TokenSynth (ICASSP 2025) | 199 | |
| VoiceHub: A Unified Inference Interface for TTS Models | 179 | |
| SongBloom package for tts-webui | 122 | |
| VoiceStudio: A unified toolkit for text-style prompted speech synthesis, voice a... | 119 | |
| A benchmarking suite for robust audio watermarking. | 82 | |
| Tokenize once, train forever: High-performance latent data persistence for gener... | 6 |