59 dependents
| Package | Description | Downloads/month |
|---|---|---|
| Faster Whisper transcription with CTranslate2 | 7.4M | |
| WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarizatio... | 1.1M | |
| Whisper command line client compatible with original OpenAI client based on CTra... | 38K | |
| Open Source Neural Machine Translation and (Large) Language Models in PyTorch | 23K | |
| Open-source offline translation library written in Python | 10K | |
| Fast, multimodal context for agents. | 6K | |
| A translator wrapper for open-source translation models with batch optimization.... | 4K | |
| 3K | ||
| Offline screen translator for Japanese retro games | 3K | |
| GPU-accelerated AI Subtitle Sync | 3K | |
| dgenerate is a scriptable command line tool (and library) for generating images ... | 2K | |
| Explore large language models in 512MB of RAM | 2K | |
| Global hotkeys to record speech and transcribe directly to your cursor | 2K | |
| Speech recognition with accurate word-level timestamps. | 2K | |
| A package for audio transcription and speaker diarization using Whisper and NeMo... | 2K | |
| Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude... | 1K | |
| Connecting Transfromers on HuggingfaceHub with CTranslate2. | 1K | |
| Open dubbing is an AI dubbing system which uses machine learning models to autom... | 1K | |
| No Language Left Behind | 1K | |
| Machine learning models optimized for robotics experimentation and deployment | 1K | |
| Natural Language Processing by the Exciton Research | 1K | |
| Google EMEA gTech Ads Data Science Team's solution to automatically translate an... | 1K | |
| Flavored fork of m-bain/WhisperX for LeGen better experience | 929 | |
| An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple I... | 916 | |
| Pipeline for querying and turning NASA's ADS publications metadata into curated,... | 890 | |
| A high-performance translation library using CTTranslate2 and vLLM. | 771 | |
| Open language modeling toolkit based on PyTorch | 752 | |
| Voice AI Platform - Local STT/TTS with Chinese Language Support | 711 | |
| User friendly toolkit for generating immersion language learning tools including... | 704 | |
| Modern speech recognition with word-level timestamps and speaker diarization. Fo... | 687 | |
| Translation models for 22 scheduled languages of India | 665 | |
| X-Voice | 571 | |
| A CLI tool for automatically translating manga pages from Japanese to English. D... | 528 | |
| Faster Whisper transcription with CTranslate2 | 514 | |
| Faster-whisper ASR tools for SonusAI | 501 | |
| Faster Whisper transcription with CTranslate2 | 418 | |
| Faster Whisper transcription with CTranslate2 with Live Capabilities for Edge De... | 399 | |
| ... | 390 | |
| Shunya Labs speech transcription package with ct2 and transformers backends | 385 | |
| Open-source neural machine translation library based on OpenNMT's CTranslate2 | 382 | |
| Faster Whisper transcription with CTranslate2 | 378 | |
| Simple and easy to use realtime speech to text | 353 | |
| A compatibility fix to for whisperx for use with gogadget | 310 | |
| 音声ファイルをテキストに変換するCLIツール | 302 | |
| Production-ready transcription and diarization pipeline with parallel processing | 286 | |
| Faster Whisper ASR transcription with CTranslate2 | 277 | |
| WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarizatio... | 259 | |
| A standalone service for transcribing audio files using WhisperX | 224 | |
| WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarizatio... | 200 | |
| Automatic Speech Recognition (ASR) SDK for Nigerian languages Yoruba, Igbo, Haus... | 165 |