Dependents of pyloudnorm

57 dependents

Package	Description	Downloads/month
pipecat-ai	Open Source framework for voice and multimodal conversational AI	689K
resemble-perth	Open Audio Watermarking Tool	158K
chatterbox-tts	SoTA open-source TTS	98K
achatbot	An open source chat bot architecture for voice/vision (and multimodal) assistant...	14K
dv-pipecat-ai	Open Source framework for voice and multimodal conversational AI	12K
descript-audiotools-unofficial	Utilities for handling audio.	6K
pytimbre	Python conversion of Timbre Toolbox	6K
seed-vc	zero-shot voice conversion & singing voice conversion, with real-time support	5K
gailbot	GailBot API	4K
pyclarity	Clarity Challenge toolkit - software for building Clarity Challenge systems	4K
paddlespeech	Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Stream...	3K
ayase	Modular media quality metrics toolkit.	3K
palabra-ai	Python SDK for Palabra AI's real-time speech-to-speech translation API. Break do...	2K
ableton-for-ai	Ableton for AI is the bridge between your DAW and AI models. It makes Ableton pr...	2K
refmatch	AI-powered reference track suggestions for music producers	2K
revoxx	Speech Recording Application	2K
py-speech-gen	A Python library for generating synthetic speech datasets using TTS providers.	2K
fish-audio-preprocess	Preprocess audio data	1K
realtime-client		1K
piopiy-ai	Build 📞 Telephonic-Grade Voice AI — 🌐 WebRTC-Ready Framework	1K
mlx-audio-plus	Python tools for text to speech (TTS), speech to text (STT), and speech to speec...	1K
stemgen	Stemgen is a Stem file generator. Convert any track into a stem and have fun wit...	1K
auralis	This is a faster implementation for TTS models, to be used in highly async envir...	1K
gtech-ariel	Google EMEA gTech Ads Data Science Team's solution to automatically translate an...	1K
outspeed		1K
str2speech	An easy-to-use library and command-line tool for TTS	1K
audiozen	Audio ZEN is a library for audio/speech signal processing.	809
psyaitools	Loudness added.	648
audio-metrics	Metrics to measure the quality of audio	619
x-voice	X-Voice	591
dscaper	A library for soundscape synthesis and augmentation - extension to add speech, d...	531
fast-flashtalk	fast SoulX-FlashTalk for RTX 4090	466
multimodal-parsers	PDF processing pipeline: remove headers/footers, convert to markdown, and genera...	414
acids-dataset	data parsing / loading facility for handy audio machine learning. powered by lmd...	383
umik-base-app	Audio Base App and Framework	375
chatterbox-ng	Chatterbox: Open Source TTS and Voice Conversion by Resemble AI	370
reaper-mcp-server	A comprehensive Model Context Protocol (MCP) server that enables AI agents to cr...	360
seavad	SeaVAD: Voice Activity Detection module with silero and state machine.	329
phonepod	Local AI audio restoration. Phone recording to podcast quality. Zero cloud.	324
flashtts	基于SparkTTS、OrpheusTTS等模型，提供高质量中文语音合成与声音克隆服务。	323
whisperx-nemo-pipeline	Production-ready transcription and diarization pipeline with parallel processing	302
voxid	Voice Identity Management Platform	290
sounddiff	Structured audio comparison for producers and developers. Think git diff, but fo...	286
wavtoolkit	File operations and other tools for working with .WAV files	265
mixref	CLI Audio Analyzer for Music Producers - DnB, Techno, House	217
audioinfo-ecrit		210
shruti	Indic Conformer ASR Lib	201
pyneuralfx	A python package for neural audio effect	190
songmatch	Split to stems, match to reference, recombine and match as a whole.	167
pear-pipecat	Open Source framework for voice and multimodal conversational AI	161