25 dependents
Package Description Downloads/month
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrai... 357K
End-to-End Speech Processing Toolkit 30K
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Stream... 3K
Vox box 3K
A SOTA Industrial-Grade Voice Activity Detection & Audio Event Detection, suppor... 3K
FunCodec is a research-oriented toolkit for audio quantization and downstream ap... 2K
FireRed ASR for fasr (bundled fireredasr2 inference) 1K
FireRedVAD for fasr (bundled fireredvad inference) 1K
FireRedLID language identification model for fasr 1K
Speech audio tools based on Paddlepaddle 1K
Ukrainian TTS (text-to-speech) using ESPNET 712
Speech Diarization and Speaker Embedding 533
Voice conversion toolkit based on S3PRL: Self-Supervised Speech/Sound Pre-traini... 473
山东联通产互AI工具箱 390
Unofficial wespeaker pypi package 340
Librería para detectar emociones en imágenes y audio usando Robobo 321
Production-ready transcription and diarization pipeline with parallel processing 286
FireRed ASR 245
Indic Conformer ASR Lib 195
Speaker Embedding/Diarization, ASVSpoof, VAD, ASR, and more. 187
DELTA is a deep learning based natural language and speech processing platform. ... 153
FunCodec is a research-oriented toolkit for audio quantization and downstream ap... 138
Speaker Embedding 117
Speech Diarization and Speaker Embedding 73
NNSP: Neural network based end-to-end Speech Processing toolkit 54