40 dependents
| Package | Description | Downloads/month |
|---|---|---|
| Qwen-TTS python package | 209K | |
| Open Audio Watermarking Tool | 157K | |
| Qwen-ASR python package | 127K | |
| Manim plugin for all things voiceover | 7K | |
| Framework for building deep neural network models for sound, speech, and voice A... | 7K | |
| Use machine learning to create art and music | 5K | |
| SONAR, a new multilingual and multimodal fixed-size sentence embedding space, wi... | 5K | |
| Ultimate RVC | 4K | |
| A simple yet effective Audio-to-Midi Automatic Piano Transcription system | 3K | |
| ForceAlign is a Python library for forced alignment of English text to English a... | 1K | |
| Python wrapper for inference with rvc | 769 | |
| Collection of Neural Modules for Speech Recognition | 698 | |
| Emotions recognition from audio and text files (only russian language) | 635 | |
| 🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT ... | 608 | |
| Use machine learning to create art and music | 604 | |
| A library for soundscape synthesis and augmentation - extension to add speech, d... | 530 | |
| Manim plugin for recorder | 481 | |
| Manim plugin for all things voiceover (fork with updated ElevenLabs API) | 461 | |
| PDF processing pipeline: remove headers/footers, convert to markdown, and genera... | 410 | |
| Augments audio for machine learning | 387 | |
| This is a package to handle various kinds of data in a unified way with Pytorch. | 371 | |
| 338 | ||
| BRAIN - a tool to Build projects and manage Resources for AI Newbies | 286 | |
| Production-ready transcription and diarization pipeline with parallel processing | 286 | |
| vCon conversational data container manipulation package | 274 | |
| A DeepSpeech-based transcriber using DeepSegment to separate sentences in a long... | 246 | |
| Indic Conformer ASR Lib | 195 | |
| easily download and merge split online audiobooks | 186 | |
| Stable diffusion for real-time music generation. | 186 | |
| 🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT ... | 179 | |
| Provides training, inference and voice conversion recipes for RADTTS and RADTTS+... | 176 | |
| manim-voiceover fixed for elevenlabs dependency | 157 | |
| VoiceStudio: A unified toolkit for text-style prompted speech synthesis, voice a... | 119 | |
| create dream-sequences from your video browsing history | 117 | |
| 106 | ||
| Spoof-Aware Speaker Verification System | 103 | |
| Manim plugin for all things voiceover | 96 | |
| A simple RVC Inference Python wrapper. | 73 | |
| Manim Onvoice Termux for Manim | 65 | |
| Python wrapper for simple inference with rvc v2 | 1 |