12 dependents
| Package | Description | Downloads/month |
|---|---|---|
| 3K | ||
| A package for audio transcription and speaker diarization using Whisper and NeMo... | 2K | |
| Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯 | 1K | |
| 930 | ||
| Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English... | 548 | |
| minimal deep learning framework | 420 | |
| PDF processing pipeline: remove headers/footers, convert to markdown, and genera... | 410 | |
| Production-ready transcription and diarization pipeline with parallel processing | 286 | |
| An open source implementation of Microsoft's VALL-E X zero-shot TTS | 245 | |
| Indic Conformer ASR Lib | 195 | |
| LuxTTS MLX port for fast Apple Silicon inference. | 168 | |
| A Python package for speech transcription and speaker diarization with speaker m... | 164 |