Dependents of webrtcvad

59 dependents

Package	Description	Downloads/month
pyvad	py-webrtcvad wrapper for trimming speech clips	28K
ffsubsync	Automagically synchronize subtitles with video.	13K
genai-processors	GenAI Processors is a lightweight Python library that enables efficient, paralle...	5K
asrp	ASR text preprocessing utility	4K
dnn	Machine Learning Utilities	4K
paddlespeech	Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Stream...	3K
snipsskillscore	Core Python utilities for the Snips Manager	3K
abstractvoice	A modular Python library for voice interactions with AI systems	2K
ovos-vad-plugin-webrtcvad	ovos plugin for voice activity detection using webrtcvad	2K
jarvismode	A Python package with a built-in web application	1K
maxs	minimalist ai agent	1K
stt-listen	Transcribe long audio files with ASR or use the streaming interface	1K
piper-sample-generator	Generate TTS audio samples for training wake word systems	899
py-nltools	A collection of basic python modules for spoken natural language processing	774
audioprocessor	A Python package for recording, transcribing, and converting audio	732
lemonpepper	A real-time audio transcription and AI interaction tool	616
asaca	Automatic Speech Analysis for Cognitive Assessment	599
openai-stt	Speech-to-Text tool using Whisper, PyAudio, and VAD.	511
snipsmanagercore	Core Python utilities for the Snips Manager	482
lightwhisperstt	LightWhisperSTT – Fast, lightweight STT using Whisper.cpp	472
voicellm	A modular Python library for voice interactions with AI systems, featuring high-...	427
multimodal-parsers	PDF processing pipeline: remove headers/footers, convert to markdown, and genera...	410
findsub	FindSub is an Application for automatically downloading and ranking subtitles ba...	348
dspeech	A Speech-to-Text toolkit with VAD, punctuation, and emotion classification	343
pydiar	simple to use, pretrained/training-less models for speaker diarization	319
realtime-mlx-stt	Real-time speech-to-text transcription optimized for Apple Silicon	302
jarvismode-base	A Python package with a built-in web application	300
hermes-audio-server	An open source implementation of the audio server part of the Hermes protocol	273
klaus-assistant	Voice-powered research assistant for physical books and papers	271
mmanalyser	a tool for multimedia	242
softvad	A simple mic utility, for streaming with vad. You can use it, but its not recomm...	230
echonanny	Remote access to your PC microphone & voice detection with Web UI	221
mockingbirdforuse		199
shruti	Indic Conformer ASR Lib	195
mixsim	An open-source dataset for multiple purposes, such as speaker localization/track...	185
radtts	Provides training, inference and voice conversion recipes for RADTTS and RADTTS+...	176
pyrtstools	Tools for speech processing, keyword spotting	170
emotion-framework	Multimodal emotion recognition framework for video analysis	170
servai-model	Whisper 및 ECAPA-TDNN 기반의 실차 화자 식별 및 노이즈 보정 라이브러리	147
snips-respeaker	To build voice enabled objects/applications with Python and ReSpeaker	147
holdtranscribe	Hotkey-Activated Voice-to-Clipboard Transcriber	139
tgear-sdk	Tactigon Gear SDK to connect to Tactigon Skin wereable platform	136
mohamedboualamallah	A modular application for audio processing and Finch robot control.	135
ipa-recognizer	A pretrained IPA recognizer	131
livekit-plugins-induslabs	Agent Framework plugin for services using IndusLabs API.	115
vocoder-dictation	Dictation for programmers	105
pyloom-asr	Advanced real-time voice processing library using Whisper and Silero models	105
verifyvoice	A package for verifying the voice of a person	103
dyana-annotate	DYadic Annotation of Naturalistic Audio	80
samvaad	Samvaad is a speech-driven AI tutor that transforms PDFs, articles, and notes in...	80