41 dependents
| Package | Description | Downloads/month |
|---|---|---|
| G2P mix | 187K | |
| Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles. | 5K | |
| Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Sho... | 5K | |
| Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Stream... | 3K | |
| 2K | ||
| The Smallest English TTS Model with only 1M parameters | 2K | |
| Inference code for GPT-SoVITS | 1K | |
| VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Spe... | 1K | |
| Local MCP voice coach with English pronunciation, grammar, fluency, phoneme-leve... | 1K | |
| Multilingual neural TTS (6 languages: JA/EN/ZH/ES/FR/PT, code supports SV) — C++... | 1K | |
| ONNX Wrapper for ESPnet | 1K | |
| ForceAlign is a Python library for forced alignment of English text to English a... | 1K | |
| Using LLMs and rules for a local personal agent | 960 | |
| PORORO: Platform Of neuRal mOdels for natuRal language prOcessing | 809 | |
| Korean to Katakana | 754 | |
| Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成,支... | 687 | |
| A python forced alignment package | 641 | |
| 424 | ||
| Python forced alignment | 383 | |
| The TTS Text Frontend for the use of own | 370 | |
| speech agent for 100 hours | 336 | |
| Python wrapper for fast inference with GPT-SoVITS | 328 | |
| Production-ready transcription and diarization pipeline with parallel processing | 286 | |
| Text-to-Speech module for Illufly AI | 248 | |
| MeloPlus: Advanced Python Library for MeloTts | 225 | |
| Indic Conformer ASR Lib | 195 | |
| MaskGCT model for TTSDB | 189 | |
| Multimodal emotion recognition framework for video analysis | 170 | |
| Meloplus: Advanced python library for Melotts | 165 | |
| SongBloom package for tts-webui | 122 | |
| Create pronuciation dictionary using g2p | 121 | |
| a simple audio feature extraction tool | 117 | |
| deepponies tts plugin for OpenVoiceOS | 105 | |
| Aggregate Linguistic Analysis of Speech Transcripts for Research | 102 | |
| 85 | ||
| Comprehensive Linguistic Analysis of Text for Research | 70 | |
| Ominix TTS: A multilingual TTS system | 58 | |
| 46 | ||
| 36 | ||
| Agent toolkit for 100 hours of speech and 10 GiB of text | 3 | |
| 1 |