42 dependents
Package Description Downloads/month
7K
Ultimate RVC 4K
Software Decoder for raw rf captures of laserdisc, vhs and other analog video fo... 4K
Open source, scalable acoustic data analysis for ecology and conservation 3K
Convert Ebooks to Audiobooks with [custom] voice samples 2K
Detect silence segment from speech signal. 2K
AI outbound voice agent framework 1K
1K
ArtBox is a tool set for handling multimedia files. 993
Classification of Activities of Daily Living(ADL) using depth videos and audio 701
VAD-driven streaming voice dictation for macOS — local Whisper ASR + Silero VAD ... 683
A simple, high-quality voice conversion tool. 615
Python Passive Acoustic Analysis tool for Passive Acoustic Monitoring (PAM) 504
AI outbound voice agent framework 471
denoising methods used in animal vocalization denoising 460
Speechless repo for sales call analysis 444
Realtime-распознаватель речи на базе Vosk: управление микрофоном, детекция уровн... 409
TFM 375
Speechless repo for sales call analysis 363
Audio Base App and Framework 350
Python input audio. 338
A Python library for applying information theory and AI/ML models to animal comm... 303
Audio processing 275
Vaani is an open-source, AI-powered speech-to-text desktop app. Vaani (वाणी) ref... 271
Python Implementation for Handling Twilio Phone Calls 205
voice prompts, meeting minutes, voice journaling and more directly from your ter... 189
A diarization package 158
Whisper 및 ECAPA-TDNN 기반의 실차 화자 식별 및 노이즈 보정 라이브러리 147
Comprehensive bidirectional voice-text CLI tool with Whisper and VibeVoice 144
Real-Time Voice Conversion GUI 143
Deep Audio Segmenter, unsupervised 140
SonosphereAI is an AI-driven music creation suite that turns ideas into polished... 137
Private chat with local GPT with document, images, video, etc. 100% private, Apa... 136
Audio noise reduction and enhancement library with multiple engines 113
CNN network for speech onset time (SOT) detection of Mandarin speech 111
A Python library for Persian text-to-speech using Microsoft Azure service. 102
AI Voice Detection System 91
A cross-platform utility for capturing live audio from a microphone using FFmpeg... 81
catshand: a toolbox for podcast editing 77
A Python library for Persian text-to-speech using Microsoft Azure service. 66
11
Deep Audio Segmenter, unsupervised 4