41 dependents
Package Description Downloads/month
G2P mix 187K
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles. 5K
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Sho... 5K
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Stream... 3K
2K
The Smallest English TTS Model with only 1M parameters 2K
Inference code for GPT-SoVITS 1K
VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Spe... 1K
Local MCP voice coach with English pronunciation, grammar, fluency, phoneme-leve... 1K
Multilingual neural TTS (6 languages: JA/EN/ZH/ES/FR/PT, code supports SV) — C++... 1K
ONNX Wrapper for ESPnet 1K
ForceAlign is a Python library for forced alignment of English text to English a... 1K
Using LLMs and rules for a local personal agent 960
PORORO: Platform Of neuRal mOdels for natuRal language prOcessing 809
Korean to Katakana 754
Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成,支... 687
A python forced alignment package 641
424
Python forced alignment 383
The TTS Text Frontend for the use of own 370
speech agent for 100 hours 336
Python wrapper for fast inference with GPT-SoVITS 328
Production-ready transcription and diarization pipeline with parallel processing 286
Text-to-Speech module for Illufly AI 248
MeloPlus: Advanced Python Library for MeloTts 225
Indic Conformer ASR Lib 195
MaskGCT model for TTSDB 189
Multimodal emotion recognition framework for video analysis 170
Meloplus: Advanced python library for Melotts 165
SongBloom package for tts-webui 122
Create pronuciation dictionary using g2p 121
a simple audio feature extraction tool 117
deepponies tts plugin for OpenVoiceOS 105
Aggregate Linguistic Analysis of Speech Transcripts for Research 102
85
Comprehensive Linguistic Analysis of Text for Research 70
Ominix TTS: A multilingual TTS system 58
46
36
Agent toolkit for 100 hours of speech and 10 GiB of text 3
1