Faster Whisper Python Packages

speechlib

Speechlib is a library that unifies speaker diarization, transcription and speaker recognition in a single pipeline to create transcripts for audio conversations with actual speaker names and time tags. This library also contains audio preprocessor functions.

3K 258 27

edgevox

Offline voice agent framework for robots.

3K 4 0

stream-translator-gpt

A stream-translator fork with VAD based audio slicing & GPT / Gemini translation.

3K 207 29

tafrigh

تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.

2K 141 18

voicepad-core

Core audio processing for Voicepad.

2K 0 0

voicepad

Command-line interface for voice recording and GPU-accelerated transcription.

2K 0 0

openlrc

Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT，Claude等)来转录、翻译你的音频为字幕文件。

1K 654 49

whisper-run

Faster Whisper with Speaker Diarization

755 9 1

casts-down

Cross-platform CLI for downloading and transcribing podcasts with local Whisper speech-to-text

558 2 0

whisper-autosrt

a command line utility for automatic speech recognition and subtitle generation

553 29 3

stttui

A local, fully offline speech-to-text terminal UI using your CPU

546 0 0

speech-mine

Turn audio into searchable, speaker-labeled transcripts. Built for iterative pipelines, not one-shot runs.

539 0 2

hns

hns is a speech-to-text CLI tool to transcribe your voice from your microphone directly to clipboard. Integrate hns with Claude Code, Ollama, LLM, and more CLI tools for powerful workflows.

449 100 14

whisper-streaming

Providing easy-to-use and extensible STT (Speech-To-Text) implementation with Whisper-like ASR (Automatic Speech Recognition) models.

320 0 1

aavt

A Python package for Video Translate and Some Tools for Video

283 3K 242

bmnspeechlib

208 258 27

whisper-telegram-mcp

Two-way voice for Claude via Telegram — Whisper transcription + Kokoro/OpenAI TTS as an MCP server

177 0 0

asrbench-cli

A command-line tool for the ASRBench framework, simplifying audio transcription system benchmarking with a single config file, supporting popular and custom transcription systems

132 0 0

aavt111

这是一个全自动（音频）视频翻译项目。利用Whisper识别声音，AI大模型翻译字幕，最后合并字幕视频，生成翻译后的视频。

5 3K 242

Search Packages