36 dependents
Package Description Downloads/month
Tools for building wake-word and speech-command datasets and models. 7K
6K
3K
Deep Learning based variant calling toolkit - https://clara-parabricks.github.io... 3K
Push-to-talk transcription 2K
NeMo ASR export to NNEF via torch-to-nnef. 2K
A package for audio transcription and speaker diarization using Whisper and NeMo... 2K
('NeMo Model => Riva Deployment Converter',) 1K
Speech processing templates and pipelines for transcription, speaker diarization... 910
Collection of Neural Modules for Speech Recognition 698
A nemo stt plugin for OVOS 659
A user-friendly package for Thai speech recognition using the Typhoon ASR model. 637
Simple, powerful streaming transcription for Python using NVIDIA's Parakeet TDT 623
Constraint-aware audio resynthesis and distillation pipeline. 505
Speechless repo for sales call analysis 444
PyPi package for KaniTTS-2 model 432
Text-to-speech using neural audio codec and causal language models 414
PDF processing pipeline: remove headers/footers, convert to markdown, and genera... 410
BioNeMo Large Language Model Components using NeMo and Megatron 347
Production-ready transcription and diarization pipeline with parallel processing 286
Mono repo with support for speech processing sinapsis templates 255
A scalable generative AI framework built for researchers and developers working ... 254
Maivi - My AI Voice Input: Real-time voice-to-text local on cpu better than whis... 252
A library to standardize the usage of various machine learning models 214
Bangla Speech to Text & Text to Speech. 195
Scalable Data Preprocessing Tool for Training Large Language Models 185
9jaLingo TTS-2: Text-to-Speech for Nigerian Languages — English (Nigerian Accent... 172
A simple, developer-friendly Python package for creating AI workflows 133
The realtime communication library for Python - fastrtc with Nvidia's Canary STT 113
deepponies tts plugin for OpenVoiceOS 105
WavLM based diarization with MSDD 101
ASR based on NVIDIA parakeet model 100
A scalable generative AI framework built for researchers and developers working ... 82
Scalable data pre processing and curation toolkit for LLMs 71
Open source - Voice AI 19
Scalable Data Preprocessing Tool for Training Large Language Models 1