17 dependents
Package Description Downloads/month
Generative models for conditional audio generation 118K
Text-Acoustic Dual-Aligned Language Model 12K
Interface for OuteTTS models. 7K
Fish Speech 6K
Vox box 3K
Nodetool is a no-code development environment for Artificial Intelligence, enabl... 1K
Dia-JAX: A JAX port of Dia, the text-to-speech model for generating realistic di... 778
Boson Multimodal - A multimodal AI framework 657
Audio generation using diffusion models, in PyTorch. 530
Generative models for conditional audio generation 371
[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for g... 310
The official implementation of TokenSynth (ICASSP 2025) 199
VoiceHub: A Unified Inference Interface for TTS Models 179
SongBloom package for tts-webui 122
VoiceStudio: A unified toolkit for text-style prompted speech synthesis, voice a... 119
A benchmarking suite for robust audio watermarking. 82
Tokenize once, train forever: High-performance latent data persistence for gener... 6