Dependents of silero-vad

54 dependents

Package	Description	Downloads/month
gamesentenceminer	An immersion toolkit for learning Languages through games and other visual media...	49K
wakewords	Tools for building wake-word and speech-command datasets and models.	7K
fish-speech	Fish Speech	6K
senko	Very fast, accurate speaker diarization	4K
diarize	Speaker diarization for Python — "who spoke when?" CPU-only, no API keys, Apache...	4K
aiavatar	🥰 Building AI-based conversational avatars lightning fast ⚡️💬	3K
rex-voice-assistant	Lightweight offline voice assistant for hands-free music control (YouTube Music ...	3K
spych	Use your voice to trigger events and communicate with AI Agents.	3K
verbatim	high quality multi-lingual speech to text	2K
easytranscriber	Speech recognition with accurate word-level timestamps.	2K
voxtream	VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Spe...	1K
realtime-client		1K
easyaligner	Forced alignment pipeline designed for efficiency and ease of use.	1K
oddasr	An ASR API server for FunASR	1K
dora-vad	Dora Node for Text translating using Argostranslate	1K
qwen3-asr-toolkit	Python toolkit for the Qwen3-ASR API—parallel high‑throughput calls, robust long...	1K
jusflaudio	Useful audio tools	959
voxlevel	Normalize WAV voice recordings to a consistent target dB level using AGC, VAD, a...	841
audiozen	Audio ZEN is a library for audio/speech signal processing.	809
vocal-core	Generic Speech AI Platform - Ollama for Voice Models	645
abcmrt16	Package to run ABC-MRT16 intelligibility tests	610
miya-speechless	Speechless repo for sales call analysis	440
ppp-svc-helper	Utility library for singing voice conversion work	426
chirp-notes-ai	Meeting recorder CLI that transcribes and generates AI notes	423
roohai	Modular real-time voice agent framework with swappable STT, LLM, TTS, and VAD co...	414
local-wake	Lightweight local wake word detection that recognizes phrases with just a few us...	393
o2-speechless	Speechless repo for sales call analysis	384
ttsds	The TTSDS benchmark evaluates synthetic speech quality by considering prosody, s...	370
razel-py-cli	Razel Python CLI (Typer): installable command-line tool	367
interweave	Voice I/O MCP server for Claude Code — speak and listen through your mic and spe...	365
input-audio	Python input audio.	347
wespeaker-unofficial	Unofficial wespeaker pypi package	343
vaani	macOS menu bar app that captures voice, transcribes, and enhances text with AI	329
localtalk	A local/offline-capable voice assistant with speech recognition, LLM processing,...	329
audiotool	audiotool is a DeepLearning utility library.	325
bilbo-audiobook	Bilingual audiobook interleaver.	319
livecaption	Real-time audio transcription for video streaming with Firefox browser integrati...	318
sceneflow	Smart video cut point detection for AI-generated talking head videos using multi...	304
tvas	Travel Vlog Automation System - Automate vlog ingestion, junk detection, and DaV...	254
humaware-vad	HumAwareVAD: A optimized voice activity detection model to better distinguish hu...	247
cascade-vad	Cascade is a production-ready, high-performance, and low-latency audio stream pr...	229
openwakewordlistener	A wake word listener for Rhasspy	205
moonshine-lite	Lite wrapper for the useful-moonshine speech to text models	188
iflow-mcp-mourad-ghafiri-youtube-mcp-server	A YouTube MCP Server for video information and transcription	172
locivox	Local Voice Transcription System - Privacy-first, model-agnostic speech-to-text	149
cjm-transcription-utils	Miscellaneous utilities for helping with audio transcription.	147
jadia	JaNet diarization package	120
dswed	A package for computing DS-WED.	118
goobits-stt	GOOBITS STT - Pure speech-to-text engine with multiple operation modes	112
viet-tts	VietTTS: An Open-Source Vietnamese Text to Speech	112