31 dependents
| Package | Description | Downloads/month |
|---|---|---|
| senselab is a Python package that simplifies building pipelines for biometric (e... | 13K | |
| An Open Source text-to-speech system built by inverting Whisper. | 5K | |
| Speechlib is a library that unifies speaker diarization, transcription and speak... | 3K | |
| speech language detection plugin | 3K | |
| Simplified diarization pipeline using some pretrained models - audio file to dia... | 2K | |
| A Modular and Extensible Deep Learning Toolkit for Computer Audition Tasks. | 1K | |
| Templates to generate and/or extract text and image embeddings using HuggingFace | 1K | |
| Psychological and Social Interactions Feature Extraction | 1K | |
| An ML package for GStreamer | 1K | |
| Google EMEA gTech Ads Data Science Team's solution to automatically translate an... | 1K | |
| A CLI + Web tool for speaker enrollment and identification using SpeechBrain. | 1K | |
| Python library for digital measurement of health | 918 | |
| User friendly toolkit for generating immersion language learning tools including... | 704 | |
| A python forced alignment package | 641 | |
| SONATA (SOund and Narrative Advanced Transcription Assistant): An advanced ASR s... | 542 | |
| Speaker embedding for anime speech domain based on ECAPA_TDNN | 480 | |
| ASR pipeline for the ASR project | 326 | |
| Production-ready transcription and diarization pipeline with parallel processing | 286 | |
| A standalone service for transcribing audio files using WhisperX | 224 | |
| VocalID is an open-source Python library for voice authentication using ECAPA-TD... | 223 | |
| Speech Recognition plus diarization | 217 | |
| Speechlib is a library that unifies speaker diarization, transcription and speak... | 208 | |
| Objective vocal fatigue scoring from speech using health-centric ECAPA-TDNN-VHE ... | 168 | |
| Whisper 및 ECAPA-TDNN 기반의 실차 화자 식별 및 노이즈 보정 라이브러리 | 147 | |
| offline realtime subtitle for mac | 145 | |
| 🎙️ Drop-in replacement for paid transcription APIs. Self-hosted, GPU-powered, sp... | 119 | |
| a simple audio feature extraction tool | 117 | |
| Audio noise reduction and enhancement library with multiple engines | 113 | |
| Hey Buddy is a tool for training wake-word-detecting neural networks for use in ... | 110 | |
| Spoof-Aware Speaker Verification System | 103 | |
| Vectorization & RAG Toolkit | 58 |