Audio Transcription Python Packages

whispercpp

Pybind11 bindings for Whisper.cpp

13K 345 68

linto

Transcription and annotation interface for recorded audio or video files

2K 53 6

cjm-transcription-plugin-system

A flexible plugin system for audio transcription intended to make it easy to add support for multiple backends.

2K 0 0

cjm-transcription-plugin-whisper

OpenAI Whisper plugin for the cjm-transcription-plugin-system library - provides local speech-to-text transcription with configurable model selection and parameter control.

856 0 0

cjm-transcription-plugin-voxtral-hf

Mistral Voxtral plugin for the cjm-transcription-plugin-system library - provides local speech-to-text transcription through 🤗 Transformers with configurable model selection and parameter control.

602 0 0

casts-down

Cross-platform CLI for downloading and transcribing podcasts with local Whisper speech-to-text

558 2 0

cjm-transcription-plugin-voxtral-vllm

Mistral Voxtral plugin for the cjm-transcription-plugin-system library - provides local speech-to-text transcription through vLLM with configurable model selection and parameter control.

546 0 0

cjm-transcription-plugin-gemini

Google Gemini API plugin for the cjm-transcription-plugin-system library - provides speech-to-text transcription with configurable model selection and parameter control.

502 0 0

openai-game-translator

an openai based game audio translator

375 5 1

precisetranscribe

Utilities for transcribing audio files using the Whisper API.

324 1 0

ur-audio-sub

Generate text captions for audio files & youtube video using OpenAI Whisper on Google Colab. Multiple languages support.

255 16 2

voicetag

Speaker identification powered by pyannote and resemblyzer

250 49 4

openscenesense

OpenSceneSense is a Python library that harnesses AI for advanced video analysis, offering customizable frame and audio insights for dynamic applications in media, education, and content moderation.

196 22 1

podscript

CLI tool to transcribe podcasts and YouTube videos into clean markdown with speaker diarization and timestamps

146 41 1

trainscribe

A command-line tool for transcribing audio files in a folder to a metadata.csv file, using OpenAI's Whisper.

130 0 0

vayu-whisper

Vayu (وایو) - The fastest Whisper implementation on Apple Silicon

121 1 0

Search Packages