18 dependents
Package Description Downloads/month
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, ... 8K
Femtosense Model Optimization Toolkit 4K
Modular media quality metrics toolkit. 3K
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recogniti... 2K
Speech Toolkit for Malaysian language, https://malaya-speech.readthedocs.io/ 1K
SyncNet's modern implementation (Python 3.9~3.13) 802
Distill the Automatic Speech Recognition (TensorFlow) 643
A tool for automatic phoneme transcription 491
Video Generator part of the Chatacter Backend 420
精度検証やパラメータチューニングで使用する関数群のライブラリ 354
simple to use, pretrained/training-less models for speaker diarization 319
DeepTalk Active Speaker Detection 212
Femtosense Model Optimization Toolkit 178
A convenience python wrapper for the TIMIT database. 169
LASER ASD - Lip Landmark Assisted Speaker Detection for Active Speaker Detection 141
A package designed to compose speaker verification systems 112
Python Spoken Language Toolkit 61
A research-based framework for exploring sound as well as machine learning in th... 53