Dependents of sox - PyPI Stats

40 dependents

Package	Description	Downloads/month
qwen-tts	Qwen-TTS python package	209K
resemble-perth	Open Audio Watermarking Tool	157K
qwen-asr	Qwen-ASR python package	127K
manim-voiceover	Manim plugin for all things voiceover	7K
sonusai	Framework for building deep neural network models for sound, speech, and voice A...	7K
magenta	Use machine learning to create art and music	5K
sonar-space	SONAR, a new multilingual and multimodal fixed-size sentence embedding space, wi...	5K
ultimate-rvc	Ultimate RVC	4K
transkun	A simple yet effective Audio-to-Midi Automatic Piano Transcription system	3K
forcealign	ForceAlign is a Python library for forced alignment of English text to English a...	1K
rvc-infer	Python wrapper for inference with rvc	769
nemo-asr	Collection of Neural Modules for Speech Recognition	698
aniemore	Emotions recognition from audio and text files (only russian language)	635
iara-stt-training	🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT ...	608
magenta-gpu	Use machine learning to create art and music	604
dscaper	A library for soundscape synthesis and augmentation - extension to add speech, d...	530
manim-recorder	Manim plugin for recorder	481
manim-voiceover-plus	Manim plugin for all things voiceover (fork with updated ElevenLabs API)	461
multimodal-parsers	PDF processing pipeline: remove headers/footers, convert to markdown, and genera...	410
audaugio	Augments audio for machine learning	387
torchdataset	This is a package to handle various kinds of data in a unified way with Pytorch.	371
tts-middleware		338
brain-ai	BRAIN - a tool to Build projects and manage Resources for AI Newbies	286
whisperx-nemo-pipeline	Production-ready transcription and diarization pipeline with parallel processing	286
python-vcon	vCon conversational data container manipulation package	274
lecture-transcriber	A DeepSpeech-based transcriber using DeepSegment to separate sentences in a long...	246
shruti	Indic Conformer ASR Lib	195
soundbook	easily download and merge split online audiobooks	186
riffusion	Stable diffusion for real-time music generation.	186
iarahealth-stt-training	🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT ...	179
radtts	Provides training, inference and voice conversion recipes for RADTTS and RADTTS+...	176
manim-voiceover-fixed	manim-voiceover fixed for elevenlabs dependency	157
voicestudio	VoiceStudio: A unified toolkit for text-style prompted speech synthesis, voice a...	119
tubedreams	create dream-sequences from your video browsing history	117
voicegen		106
sonic-cipher	Spoof-Aware Speaker Verification System	103
manim-voiceover-ai	Manim plugin for all things voiceover	96
srvc	A simple RVC Inference Python wrapper.	73
manim-onvoice	Manim Onvoice Termux for Manim	65
pyt-rvc-infer	Python wrapper for simple inference with rvc v2	1