Dependents of soundfile

1,220 dependents

Package	Description	Downloads/month
sglang	SGLang is a high-performance serving framework for large language models and mul...	287.7M
librosa	Python library for audio and music analysis	9.7M
speechbrain	All-in-one speech toolkit in pure Python and Pytorch	1.6M
open-webui	Open WebUI	1.3M
aider-chat	aider is AI pair programming in your terminal	864K
lhotse	Data preparation for speech processing models training.	688K
vllm-omni	A framework for efficient model inference with omni-modality models	477K
genai-perf	GenAI Perf Analyzer CLI - CLI tool to simplify profiling LLMs and Generative AI ...	366K
audio-separator	Easy to use stem (e.g. instrumental/vocals) separation from CLI or as a python p...	364K
funasr	A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrai...	357K
coqui-tts	🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and p...	292K
aiperf	AIPerf is a package for performance testing of AI models	285K
wfdb	Native Python WFDB package	212K
qwen-tts	Qwen-TTS python package	209K
tts	🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and p...	202K
realtimestt	A robust, efficient, low-latency speech-to-text library with advanced voice acti...	177K
laion-clap	Contrastive Language-Audio Pretraining	175K
resemble-perth	Open Audio Watermarking Tool	157K
trainer	🐸 - A general purpose model trainer, as flexible as it gets	135K
qwen-asr	Qwen-ASR python package	127K
voxcpm	VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice D...	122K
f5-tts	Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech wi...	104K
bithuman	Real-time avatar engine — 100+ FPS on CPU. Generate lip-synced video, stream liv...	104K
omnivoice	High-Quality Voice Cloning TTS for 600+ Languages	103K
sprite-ai	Sprite AI is an AI companion for your desktop	83K
audiofile	Handling audio files in Python	72K
audiolab	A streaming audio reader, processor, and writer built on top of soundfile, and P...	66K
dreadnode	Dreadnode Strikes SDK	59K
bigvgan		56K
gamesentenceminer	An immersion toolkit for learning Languages through games and other visual media...	50K
pyrubberband	python wrapper for rubberband	45K
rvc-python	Using RVC via console or python scripts	39K
vmlx	vMLX - Home of JANG_Q - Cont Batch, Prefix, Paged, KV Cache Quant, VL - Powers M...	36K
espnet	End-to-End Speech Processing Toolkit	30K
not1mm	Not1MM != N1MM, An amateur radio contest logger for Linux.	25K
endoreg-db	endoreg-db	25K
psychopy	For running psychology and neuroscience experiments	22K
agentmake	AgentMake AI: a kit for developing agentic AI applications that support 24 AI ba...	22K
whisper-live	A nearly-live implementation of OpenAI's Whisper.	20K
pluto-ml-nightly	Pluto ML - Machine Learning Operations Framework	18K
scitex-audio	Text-to-Speech with Multiple Backend Fallback (elevenlabs → luxtts → gtts → pytt...	17K
minimax-mcp	Minimax MCP Server	17K
restai-core	RESTAI, so many 'A's and 'I's, so little time...	17K
agentcrew-ai	Chat application with multi-agents system supports multi-models and MCP	16K
elevenlabs-mcp	ElevenLabs MCP Server	16K
tensorrt-llm	TensorRT LLM provides users with an easy-to-use Python API to define Large Langu...	16K
faster-qwen3-tts	Real-time text-to-speech with Qwen3-TTS	15K
vox-voxtral	Mistral Voxtral STT/TTS adapter for Vox	15K
octoai-sdk	A runtime library for OctoAI.	15K
omnivad	OmniVAD — Cross-platform Voice Activity Detection and Audio Event Detection (bas...	14K