18 dependents
| Package | Description | Downloads/month |
|---|---|---|
| An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, ... | 8K | |
| Femtosense Model Optimization Toolkit | 4K | |
| Modular media quality metrics toolkit. | 3K | |
| A Repository for Single- and Multi-modal Speaker Verification, Speaker Recogniti... | 2K | |
| Speech Toolkit for Malaysian language, https://malaya-speech.readthedocs.io/ | 1K | |
| SyncNet's modern implementation (Python 3.9~3.13) | 802 | |
| Distill the Automatic Speech Recognition (TensorFlow) | 643 | |
| A tool for automatic phoneme transcription | 491 | |
| Video Generator part of the Chatacter Backend | 420 | |
| 精度検証やパラメータチューニングで使用する関数群のライブラリ | 354 | |
| simple to use, pretrained/training-less models for speaker diarization | 319 | |
| DeepTalk Active Speaker Detection | 212 | |
| Femtosense Model Optimization Toolkit | 178 | |
| A convenience python wrapper for the TIMIT database. | 169 | |
| LASER ASD - Lip Landmark Assisted Speaker Detection for Active Speaker Detection | 141 | |
| A package designed to compose speaker verification systems | 112 | |
| Python Spoken Language Toolkit | 61 | |
| A research-based framework for exploring sound as well as machine learning in th... | 53 |