31 dependents
Package Description Downloads/month
senselab is a Python package that simplifies building pipelines for biometric (e... 13K
An Open Source text-to-speech system built by inverting Whisper. 5K
Speechlib is a library that unifies speaker diarization, transcription and speak... 3K
speech language detection plugin 3K
Simplified diarization pipeline using some pretrained models - audio file to dia... 2K
A Modular and Extensible Deep Learning Toolkit for Computer Audition Tasks. 1K
Templates to generate and/or extract text and image embeddings using HuggingFace 1K
Psychological and Social Interactions Feature Extraction 1K
An ML package for GStreamer 1K
Google EMEA gTech Ads Data Science Team's solution to automatically translate an... 1K
A CLI + Web tool for speaker enrollment and identification using SpeechBrain. 1K
Python library for digital measurement of health 918
User friendly toolkit for generating immersion language learning tools including... 704
A python forced alignment package 641
SONATA (SOund and Narrative Advanced Transcription Assistant): An advanced ASR s... 542
Speaker embedding for anime speech domain based on ECAPA_TDNN 480
ASR pipeline for the ASR project 326
Production-ready transcription and diarization pipeline with parallel processing 286
A standalone service for transcribing audio files using WhisperX 224
VocalID is an open-source Python library for voice authentication using ECAPA-TD... 223
Speech Recognition plus diarization 217
Speechlib is a library that unifies speaker diarization, transcription and speak... 208
Objective vocal fatigue scoring from speech using health-centric ECAPA-TDNN-VHE ... 168
Whisper 및 ECAPA-TDNN 기반의 실차 화자 식별 및 노이즈 보정 라이브러리 147
offline realtime subtitle for mac 145
🎙️ Drop-in replacement for paid transcription APIs. Self-hosted, GPU-powered, sp... 119
a simple audio feature extraction tool 117
Audio noise reduction and enhancement library with multiple engines 113
Hey Buddy is a tool for training wake-word-detecting neural networks for use in ... 110
Spoof-Aware Speaker Verification System 103
Vectorization & RAG Toolkit 58