[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.
Compact video-to-audio conversion tool with built-in YouTube video/audio download functionality.
Extensible framework for comparing methods of converting video to audio