Dependents of openai-whisper

275 dependents

Package	Description	Downloads/month
vllm-omni	A framework for efficient model inference with omni-modality models	477K
khoj	Your AI second brain. Self-hostable. Get answers from the web or your docs. Buil...	61K
stable-ts	Transcription, forced alignment, and audio indexing with OpenAI's Whisper	55K
whisper-timestamped	Multilingual Automatic Speech Recognition with word-level timestamps and confide...	49K
abstract-hugpy	A batteries-included bridge between your abstract_* ecosystem and popular Huggin...	38K
whisper-live	A nearly-live implementation of OpenAI's Whisper.	20K
spotify-translator	Generate lyric translations and transcriptions from Spotify URLs using OpenAI's ...	18K
rgwml	Manipulate data with code that is less a golden retriever, and more a Samurai's ...	15K
batchalign	Python Speech Language Sample Analysis	10K
batchalignhk	Python Speech Language Sample Analysis	8K
outetts	Interface for OuteTTS models.	7K
utilities-nlp		7K
freegenius	FreeGenius AI, an advanced AI assistant that is capable of engaging in conversat...	7K
vidchain	✅A Lightweight Video RAG Framework for Multimodal Reasoning	5K
gailbot	GailBot API	5K
buzz-captions		3K
speechlib	Speechlib is a library that unifies speaker diarization, transcription and speak...	3K
hamel	General Utilities	3K
stream-translator-gpt	A stream-translator fork with VAD based audio slicing & GPT / Gemini translation...	3K
vox-box	Vox box	3K
social-research-probe	Evidence-first social-media research CLI + Claude Code skill	3K
tts-webui-valle-x	An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo ...	2K
blindai	Confidential AI deployment with secure enclaves :lock:	2K
wa-transcriber	Automatically transcribe WhatsApp voice notes to your clipboard using OpenAI Whi...	2K
clipify	🎥 Turn one long video into 10 viral clips – 10x faster! 🚀 Make your content shar...	2K
kabigon		2K
clip2context	Extract frames and transcripts from video files for LLM context and multimodal p...	2K
s2t	Speech to Text (s2t): Record audio, run Whisper, export formats, and copy transc...	2K
iiiflow	Intergrating Arclight with Digital Content, IIIF, and ArchivesSpace	2K
audio-transcode-watcher	Watch a source folder and automatically transcode audio files to multiple format...	2K
onyx-ai-voice	Comprehensive STT and TTS Voice Engine for ONYX platform	2K
transcriber	A simple tool to transcribe audio files	2K
whisper-ui	A simple GUI to process a small number of audio files using OpenAI's Whisper mod...	1K
whisper-mic	Whisper for your microphone	1K
maxs	minimalist ai agent	1K
whisptray	Enter text using your voice.	1K
fish-audio-preprocess	Preprocess audio data	1K
voxtream	VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Spe...	1K
transcribetools	Transcribe and/ot translate all soundfiles in a folder using Whisper	1K
openartemis	Artemis CLI - AI-powered research and development tool	1K
microclaw-streams	Push-to-talk voice conversations powered by Whisper and Claude Code	1K
hspylib-askai	HomeSetup - AskAI	1K
str2speech	An easy-to-use library and command-line tool for TTS	1K
wenbi	A simple tool to make the video, audio, subtitle and video-url (especially youtu...	1K
gg-daigua	方便的工具	999
whisper-lm-transformers	Add language model support to HF Transformers' Whisper models	980
opendatagen	Data preparation system to build controllable AI system	929
cued-speech	Cued Speech Processing Tools - Decode and Generate cued speech videos	924
marketingtool	A tool module to help you do marketing	907
whisperflow	WhisperFlow: Real-Time Transcription Powered by OpenAI Whisper	878