PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
aarnphm
whispercpp

Pybind11 bindings for Whisper.cpp

13K 345 68
linto-ai
linto

Transcription and annotation interface for recorded audio or video files

2K 53 6
cj-mills
cjm-transcription-plugin-system

A flexible plugin system for audio transcription intended to make it easy to add support for multiple backends.

2K 0 0
cj-mills
cjm-transcription-plugin-whisper

OpenAI Whisper plugin for the cjm-transcription-plugin-system library - provides local speech-to-text transcription with configurable model selection and parameter control.

856 0 0
cj-mills
cjm-transcription-plugin-voxtral-hf

Mistral Voxtral plugin for the cjm-transcription-plugin-system library - provides local speech-to-text transcription through 🤗 Transformers with configurable model selection and parameter control.

602 0 0
clemente0731
casts-down

Cross-platform CLI for downloading and transcribing podcasts with local Whisper speech-to-text

558 2 0
cj-mills
cjm-transcription-plugin-voxtral-vllm

Mistral Voxtral plugin for the cjm-transcription-plugin-system library - provides local speech-to-text transcription through vLLM with configurable model selection and parameter control.

546 0 0
cj-mills
cjm-transcription-plugin-gemini

Google Gemini API plugin for the cjm-transcription-plugin-system library - provides speech-to-text transcription with configurable model selection and parameter control.

502 0 0
Erisae
openai-game-translator

an openai based game audio translator

375 5 1
aeronjl
precisetranscribe

Utilities for transcribing audio files using the Whisper API.

324 1 0
thinh-vu
ur-audio-sub

Generate text captions for audio files & youtube video using OpenAI Whisper on Google Colab. Multiple languages support.

255 16 2
Gr122lyBr
voicetag

Speaker identification powered by pyannote and resemblyzer

250 49 4
ymrohit
openscenesense

OpenSceneSense is a Python library that harnesses AI for advanced video analysis, offering customizable frame and audio insights for dynamic applications in media, education, and content moderation.

196 22 1
timf34
podscript

CLI tool to transcribe podcasts and YouTube videos into clean markdown with speaker diarization and timestamps

146 41 1
veralvx
trainscribe

A command-line tool for transcribing audio files in a folder to a metadata.csv file, using OpenAI's Whisper.

130 0 0
CodeWithBehnam
vayu-whisper

Vayu (وایو) - The fastest Whisper implementation on Apple Silicon

121 1 0
    • Data from PyPI, GitHub, ClickHouse, and BigQuery