36 dependents
| Package | Description | Downloads/month |
|---|---|---|
| Tools for building wake-word and speech-command datasets and models. | 7K | |
| 6K | ||
| 3K | ||
| Deep Learning based variant calling toolkit - https://clara-parabricks.github.io... | 3K | |
| Push-to-talk transcription | 2K | |
| NeMo ASR export to NNEF via torch-to-nnef. | 2K | |
| A package for audio transcription and speaker diarization using Whisper and NeMo... | 2K | |
| ('NeMo Model => Riva Deployment Converter',) | 1K | |
| Speech processing templates and pipelines for transcription, speaker diarization... | 910 | |
| Collection of Neural Modules for Speech Recognition | 698 | |
| A nemo stt plugin for OVOS | 659 | |
| A user-friendly package for Thai speech recognition using the Typhoon ASR model. | 637 | |
| Simple, powerful streaming transcription for Python using NVIDIA's Parakeet TDT | 623 | |
| Constraint-aware audio resynthesis and distillation pipeline. | 505 | |
| Speechless repo for sales call analysis | 444 | |
| PyPi package for KaniTTS-2 model | 432 | |
| Text-to-speech using neural audio codec and causal language models | 414 | |
| PDF processing pipeline: remove headers/footers, convert to markdown, and genera... | 410 | |
| BioNeMo Large Language Model Components using NeMo and Megatron | 347 | |
| Production-ready transcription and diarization pipeline with parallel processing | 286 | |
| Mono repo with support for speech processing sinapsis templates | 255 | |
| A scalable generative AI framework built for researchers and developers working ... | 254 | |
| Maivi - My AI Voice Input: Real-time voice-to-text local on cpu better than whis... | 252 | |
| A library to standardize the usage of various machine learning models | 214 | |
| Bangla Speech to Text & Text to Speech. | 195 | |
| Scalable Data Preprocessing Tool for Training Large Language Models | 185 | |
| 9jaLingo TTS-2: Text-to-Speech for Nigerian Languages — English (Nigerian Accent... | 172 | |
| A simple, developer-friendly Python package for creating AI workflows | 133 | |
| The realtime communication library for Python - fastrtc with Nvidia's Canary STT | 113 | |
| deepponies tts plugin for OpenVoiceOS | 105 | |
| WavLM based diarization with MSDD | 101 | |
| ASR based on NVIDIA parakeet model | 100 | |
| A scalable generative AI framework built for researchers and developers working ... | 82 | |
| Scalable data pre processing and curation toolkit for LLMs | 71 | |
| Open source - Voice AI | 19 | |
| Scalable Data Preprocessing Tool for Training Large Language Models | 1 |