24 dependents
| Package | Description | Downloads/month |
|---|---|---|
| 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and p... | 202K | |
| Audiocraft is a library for audio processing and generation with deep learning. ... | 122K | |
| Generative models for conditional audio generation | 118K | |
| Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation... | 18K | |
| Interface for OuteTTS models. | 7K | |
| A lightweight library for Frechet Audio Distance calculation. | 3K | |
| An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo ... | 2K | |
| 🔊 Text-Prompted Generative Audio Model | 2K | |
| 🔊 Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted g... | 1K | |
| A simple library for Fréchet Audio Distance (FAD) calculation | 733 | |
| Superfeel adaptation of implementation of SoundStorm, Efficient Parallel Audio G... | 472 | |
| minimal deep learning framework | 420 | |
| Generative models for conditional audio generation | 371 | |
| Audio tokenization, in the fastest way possible! | 367 | |
| [NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for g... | 310 | |
| 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and p... | 246 | |
| An open source implementation of Microsoft's VALL-E X zero-shot TTS | 245 | |
| A toolkit library for Kernel Audio Distance. | 206 | |
| MaskGCT model for TTSDB | 189 | |
| Comprehensive bidirectional voice-text CLI tool with Whisper and VibeVoice | 144 | |
| Audiocraft is a library for audio processing and generation with deep learning. ... | 141 | |
| NViXTTS_pl | 103 | |
| A benchmarking suite for robust audio watermarking. | 82 | |
| Deep learning for Text to Speech. | 63 |