PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
NavodPeiris
speechlib

Speechlib is a library that unifies speaker diarization, transcription and speaker recognition in a single pipeline to create transcripts for audio conversations with actual speaker names and time tags. This library also contains audio preprocessor functions.

3K 258 27
nrl-ai
edgevox

Offline voice agent framework for robots.

3K 4 0
ionic-bond
stream-translator-gpt

A stream-translator fork with VAD based audio slicing & GPT / Gemini translation.

3K 207 29
ieasybooks
tafrigh

تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.

2K 141 18
HYP3R00T
voicepad-core

Core audio processing for Voicepad.

2K 0 0
HYP3R00T
voicepad

Command-line interface for voice recording and GPU-accelerated transcription.

2K 0 0
zh-plus
openlrc

Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT,Claude等)来转录、翻译你的音频为字幕文件。

1K 654 49
gorkemkaramolla
whisper-run

Faster Whisper with Speaker Diarization

755 9 1
clemente0731
casts-down

Cross-platform CLI for downloading and transcribing podcasts with local Whisper speech-to-text

558 2 0
botbahlul
whisper-autosrt

a command line utility for automatic speech recognition and subtitle generation

553 29 3
aholten
stttui

A local, fully offline speech-to-text terminal UI using your CPU

546 0 0
BeckettFrey
speech-mine

Turn audio into searchable, speaker-labeled transcripts. Built for iterative pipelines, not one-shot runs.

539 0 2
primaprashant
hns

hns is a speech-to-text CLI tool to transcribe your voice from your microphone directly to clipboard. Integrate hns with Claude Code, Ollama, LLM, and more CLI tools for powerful workflows.

449 100 14
nkaaf
whisper-streaming

Providing easy-to-use and extensible STT (Speech-To-Text) implementation with Whisper-like ASR (Automatic Speech Recognition) models.

320 0 1
Chenyme
aavt

A Python package for Video Translate and Some Tools for Video

283 3K 242
NavodPeiris
bmnspeechlib

Speechlib is a library that unifies speaker diarization, transcription and speaker recognition in a single pipeline to create transcripts for audio conversations with actual speaker names and time tags. This library also contains audio preprocessor functions.

208 258 27
abid-mahdi
whisper-telegram-mcp

Two-way voice for Claude via Telegram — Whisper transcription + Kokoro/OpenAI TTS as an MCP server

177 0 0
ASRBench
asrbench-cli

A command-line tool for the ASRBench framework, simplifying audio transcription system benchmarking with a single config file, supporting popular and custom transcription systems

132 0 0
Chenyme
aavt111

这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。

5 3K 242
    • Data from PyPI, GitHub, ClickHouse, and BigQuery