12 dependents
Package Description Downloads/month
3K
A package for audio transcription and speaker diarization using Whisper and NeMo... 2K
Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯 1K
930
Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English... 548
minimal deep learning framework 420
PDF processing pipeline: remove headers/footers, convert to markdown, and genera... 410
Production-ready transcription and diarization pipeline with parallel processing 286
An open source implementation of Microsoft's VALL-E X zero-shot TTS 245
Indic Conformer ASR Lib 195
LuxTTS MLX port for fast Apple Silicon inference. 168
A Python package for speech transcription and speaker diarization with speaker m... 164