Dependents of soxr - PyPI Stats

82 dependents

Package	Description	Downloads/month
librosa	Python library for audio and music analysis	9.7M
pipecat-ai	Open Source framework for voice and multimodal conversational AI	677K
audiomentations	A Python library for audio data augmentation. Useful for making audio ML models ...	178K
bithuman	Real-time avatar engine — 100+ FPS on CPU. Generate lip-synced video, stream liv...	104K
dv-pipecat-ai	Open Source framework for voice and multimodal conversational AI	11K
beat-this	Accurate and general beat tracker	11K
ubo-app	Ubo main app, running on device initialization. A platform for running other app...	8K
vox-runtime	Universal local runtime for STT and TTS models	6K
fasr	FASR: Fast Automatic Speech Recognition Pipeline	4K
ultimate-rvc	Ultimate RVC	4K
vhs-decode	Software Decoder for raw rf captures of laserdisc, vhs and other analog video fo...	4K
pyclarity	Clarity Challenge toolkit - software for building Clarity Challenge systems	3K
transkun	A simple yet effective Audio-to-Midi Automatic Piano Transcription system	3K
renumics-spotlight	Visualize and maintain datasets to develop and understand data-driven algorithms...	3K
bacpipe	Use bacpipe to streamline the process of generating embeddings and analysing you...	3K
whisper-key-local	Global hotkeys to record speech and transcribe directly to your cursor	2K
dorothy-cci	A Creative Computing Python Library for Interactive Audio Generation and Audio R...	2K
maelzel	A framework for computer music in python	2K
discophon	The Phoneme Discovery Benchmark	2K
deepfense	DeepFense: A Unified, Modular, and Extensible Framework for Robust Deepfake Audi...	1K
lfeats	A unified interface to extract hidden representations from speech foundation mod...	1K
illuminat	Illuminat: Revolutionizing Education through Personalization	1K
piopiy-ai	Build 📞 Telephonic-Grade Voice AI — 🌐 WebRTC-Ready Framework	1K
genie-tts	GPT-SoVITS ONNX Inference Engine & Model Converter	1K
str2speech	An easy-to-use library and command-line tool for TTS	1K
easy-audio-interfaces	Easy Audio Interfaces is a Python library that provides a simple and flexible wa...	1K
mimikit	Music Modeling Kit	966
osekit	OSEkit	889
minidic	Tiny macOS dictation tool on your menubar	762
wav2textgrid	A python forced alignment package	641
voxcpmane	VoxCPM TTS model with Apple Neural Engine backend server	641
audio-metrics	Metrics to measure the quality of audio	600
dnn-tts-torch	This is a library consisting of pre-trained models for the synthesis of Russian ...	532
backdoormbti	BackdoorMBTI is an open source project expanding the unimodal backdoor learning ...	531
ultrastar-score	Score UltraStar karaoke files against vocal audio using Vocaluxe pitch detection...	484
audio-transcript-mcp	Real-time audio transcription MCP server for Claude Code	483
mt3-infer	Unified, inference-only toolkit for MT3 model family (Magenta MT3, MR-MT3, MT3-P...	481
pysilero	Python Wrapper of Silero VAD	476
mlx-voxtral	Voxtral audio processing and model implementation for Apple Silicon using MLX	438
lunavox-tts	GPT-SoVITS ONNX Inference Engine & Model Converter	426
playbacker	Live music performance playback	416
multimodal-parsers	PDF processing pipeline: remove headers/footers, convert to markdown, and genera...	410
chinaunicom-ai	山东联通产互AI工具箱	390
audioevals	Effective evaluations for Text-to-Speech (TTS) systems	390
python-dataset		370
sox-tensorflow	tensorflow generation of SOX-style spectrograms on the GPU	359
fftrack	FFTrack is a Python-based music recognition tool that allows users to identify s...	341
robobo-emotion	Librería para detectar emociones en imágenes y audio usando Robobo	321
flashtts	基于SparkTTS、OrpheusTTS等模型，提供高质量中文语音合成与声音克隆服务。	303
chatter-pkg	A Python library for applying information theory and AI/ML models to animal comm...	303