25 dependents
| Package | Description | Downloads/month |
|---|---|---|
| A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrai... | 357K | |
| End-to-End Speech Processing Toolkit | 30K | |
| Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Stream... | 3K | |
| Vox box | 3K | |
| A SOTA Industrial-Grade Voice Activity Detection & Audio Event Detection, suppor... | 3K | |
| FunCodec is a research-oriented toolkit for audio quantization and downstream ap... | 2K | |
| FireRed ASR for fasr (bundled fireredasr2 inference) | 1K | |
| FireRedVAD for fasr (bundled fireredvad inference) | 1K | |
| FireRedLID language identification model for fasr | 1K | |
| Speech audio tools based on Paddlepaddle | 1K | |
| Ukrainian TTS (text-to-speech) using ESPNET | 712 | |
| Speech Diarization and Speaker Embedding | 533 | |
| Voice conversion toolkit based on S3PRL: Self-Supervised Speech/Sound Pre-traini... | 473 | |
| 山东联通产互AI工具箱 | 390 | |
| Unofficial wespeaker pypi package | 340 | |
| Librería para detectar emociones en imágenes y audio usando Robobo | 321 | |
| Production-ready transcription and diarization pipeline with parallel processing | 286 | |
| FireRed ASR | 245 | |
| Indic Conformer ASR Lib | 195 | |
| Speaker Embedding/Diarization, ASVSpoof, VAD, ASR, and more. | 187 | |
| DELTA is a deep learning based natural language and speech processing platform. ... | 153 | |
| FunCodec is a research-oriented toolkit for audio quantization and downstream ap... | 138 | |
| Speaker Embedding | 117 | |
| Speech Diarization and Speaker Embedding | 73 | |
| NNSP: Neural network based end-to-end Speech Processing toolkit | 54 |