24 dependents
Package Description Downloads/month
coqui-ai tts
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and p... 202K
Audiocraft is a library for audio processing and generation with deep learning. ... 122K
Generative models for conditional audio generation 118K
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation... 18K
Interface for OuteTTS models. 7K
A lightweight library for Frechet Audio Distance calculation. 3K
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo ... 2K
🔊 Text-Prompted Generative Audio Model 2K
🔊 Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted g... 1K
A simple library for Fréchet Audio Distance (FAD) calculation 733
Superfeel adaptation of implementation of SoundStorm, Efficient Parallel Audio G... 472
minimal deep learning framework 420
Generative models for conditional audio generation 371
Audio tokenization, in the fastest way possible! 367
[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for g... 310
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and p... 246
An open source implementation of Microsoft's VALL-E X zero-shot TTS 245
A toolkit library for Kernel Audio Distance. 206
MaskGCT model for TTSDB 189
Comprehensive bidirectional voice-text CLI tool with Whisper and VibeVoice 144
Audiocraft is a library for audio processing and generation with deep learning. ... 141
NViXTTS_pl 103
A benchmarking suite for robust audio watermarking. 82
Deep learning for Text to Speech. 63