40 dependents
Package Description Downloads/month
Qwen-TTS python package 209K
Open Audio Watermarking Tool 157K
Qwen-ASR python package 127K
Manim plugin for all things voiceover 7K
Framework for building deep neural network models for sound, speech, and voice A... 7K
Use machine learning to create art and music 5K
SONAR, a new multilingual and multimodal fixed-size sentence embedding space, wi... 5K
Ultimate RVC 4K
A simple yet effective Audio-to-Midi Automatic Piano Transcription system 3K
ForceAlign is a Python library for forced alignment of English text to English a... 1K
Python wrapper for inference with rvc 769
Collection of Neural Modules for Speech Recognition 698
Emotions recognition from audio and text files (only russian language) 635
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT ... 608
Use machine learning to create art and music 604
A library for soundscape synthesis and augmentation - extension to add speech, d... 530
Manim plugin for recorder 481
Manim plugin for all things voiceover (fork with updated ElevenLabs API) 461
PDF processing pipeline: remove headers/footers, convert to markdown, and genera... 410
Augments audio for machine learning 387
This is a package to handle various kinds of data in a unified way with Pytorch. 371
338
BRAIN - a tool to Build projects and manage Resources for AI Newbies 286
Production-ready transcription and diarization pipeline with parallel processing 286
vCon conversational data container manipulation package 274
A DeepSpeech-based transcriber using DeepSegment to separate sentences in a long... 246
Indic Conformer ASR Lib 195
easily download and merge split online audiobooks 186
Stable diffusion for real-time music generation. 186
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT ... 179
Provides training, inference and voice conversion recipes for RADTTS and RADTTS+... 176
manim-voiceover fixed for elevenlabs dependency 157
VoiceStudio: A unified toolkit for text-style prompted speech synthesis, voice a... 119
create dream-sequences from your video browsing history 117
106
Spoof-Aware Speaker Verification System 103
Manim plugin for all things voiceover 96
A simple RVC Inference Python wrapper. 73
Manim Onvoice Termux for Manim 65
Python wrapper for simple inference with rvc v2 1