24 dependents
| Package | Description | Downloads/month |
|---|---|---|
| Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech wi... | 104K | |
| endoreg-db | 25K | |
| senselab is a Python package that simplifies building pipelines for biometric (e... | 13K | |
| A generative speech model for daily dialogue. | 7K | |
| Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Sho... | 5K | |
| An Open Source text-to-speech system built by inverting Whisper. | 5K | |
| Voicebox - Pytorch | 3K | |
| Vocos is a neural audio codec for high-quality audio compression and reconstruct... | 3K | |
| An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo ... | 2K | |
| SILMA TTS v1 Official Repo — a Lightweight Open Bilingual Text to Speech Model | 2K | |
| F5-TTS: Text-to-Speech (TTS) ภาษาไทย — เครื่องมือสร้างเสียงพูดจากข้อความด้วยเทคน... | 749 | |
| X-Voice | 571 | |
| An Open Source text-to-speech system built by inverting Whisper (fork of Whisper... | 410 | |
| ChatTTS is a generative speech model for daily dialogue. | 402 | |
| Resp-Agent: A multi-agent framework for respiratory sound diagnosis and generati... | 355 | |
| A simple, hackable text-to-speech system in PyTorch and MLX | 342 | |
| An open source implementation of Microsoft's VALL-E X zero-shot TTS | 245 | |
| LuxTTS MLX port for fast Apple Silicon inference. | 168 | |
| Streaming Vocos | 160 | |
| VoiceStudio: A unified toolkit for text-style prompted speech synthesis, voice a... | 119 | |
| F5-TTS model for TTSDB | 101 | |
| Prosody Modification Network | 94 | |
| E2 TTS model for TTSDB | 74 | |
| An unofficial reimplementation of F5TTS | 62 |