42 dependents
| Package | Description | Downloads/month |
|---|---|---|
| 7K | ||
| Ultimate RVC | 4K | |
| Software Decoder for raw rf captures of laserdisc, vhs and other analog video fo... | 4K | |
| Open source, scalable acoustic data analysis for ecology and conservation | 3K | |
| Convert Ebooks to Audiobooks with [custom] voice samples | 2K | |
| Detect silence segment from speech signal. | 2K | |
| AI outbound voice agent framework | 1K | |
| 1K | ||
| ArtBox is a tool set for handling multimedia files. | 993 | |
| Classification of Activities of Daily Living(ADL) using depth videos and audio | 701 | |
| VAD-driven streaming voice dictation for macOS — local Whisper ASR + Silero VAD ... | 683 | |
| A simple, high-quality voice conversion tool. | 615 | |
| Python Passive Acoustic Analysis tool for Passive Acoustic Monitoring (PAM) | 504 | |
| AI outbound voice agent framework | 471 | |
| denoising methods used in animal vocalization denoising | 460 | |
| Speechless repo for sales call analysis | 444 | |
| Realtime-распознаватель речи на базе Vosk: управление микрофоном, детекция уровн... | 409 | |
| TFM | 375 | |
| Speechless repo for sales call analysis | 363 | |
| Audio Base App and Framework | 350 | |
| Python input audio. | 338 | |
| A Python library for applying information theory and AI/ML models to animal comm... | 303 | |
| Audio processing | 275 | |
| Vaani is an open-source, AI-powered speech-to-text desktop app. Vaani (वाणी) ref... | 271 | |
| Python Implementation for Handling Twilio Phone Calls | 205 | |
| voice prompts, meeting minutes, voice journaling and more directly from your ter... | 189 | |
| A diarization package | 158 | |
| Whisper 및 ECAPA-TDNN 기반의 실차 화자 식별 및 노이즈 보정 라이브러리 | 147 | |
| Comprehensive bidirectional voice-text CLI tool with Whisper and VibeVoice | 144 | |
| Real-Time Voice Conversion GUI | 143 | |
| Deep Audio Segmenter, unsupervised | 140 | |
| SonosphereAI is an AI-driven music creation suite that turns ideas into polished... | 137 | |
| Private chat with local GPT with document, images, video, etc. 100% private, Apa... | 136 | |
| Audio noise reduction and enhancement library with multiple engines | 113 | |
| CNN network for speech onset time (SOT) detection of Mandarin speech | 111 | |
| A Python library for Persian text-to-speech using Microsoft Azure service. | 102 | |
| AI Voice Detection System | 91 | |
| A cross-platform utility for capturing live audio from a microphone using FFmpeg... | 81 | |
| catshand: a toolbox for podcast editing | 77 | |
| A Python library for Persian text-to-speech using Microsoft Azure service. | 66 | |
| 11 | ||
| Deep Audio Segmenter, unsupervised | 4 |