PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Transcription Python Packages

Python packages with the GitHub topic transcription. Sorted by relevance, with stars and monthly downloads.
speechmatics
speechmatics-python

Python library and CLI for Speechmatics

51K 75 23
neocl
speach

🐍🍑 Python 3 library for managing, annotating, and converting natural language corpuses using popular formats (CoNLL, ELAN, Praat, CSV, JSON, SQLite, VTT, Audacity, TTL, TIG, ISF, etc.)

30K 21 6
hajin-park
spotify-translator

Generate lyric translations and transcriptions from Spotify URLs using OpenAI's Whisper model.

18K 0 0
Picovoice
pvcheetah

On-device streaming speech-to-text engine powered by deep learning

7K 662 77
juanmc2005
diart

A python package to build AI-powered real-time audio applications

6K 2K 161
Picovoice
pvleopard

On-device speech-to-text engine powered by deep learning

5K 481 29
baxtree
subaligner

Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers. https://subaligner.readthedocs.io/

4K 506 24
Yujia-Yan
transkun

A simple yet effective Audio-to-Midi Automatic Piano Transcription system

3K 335 33
NavodPeiris
speechlib

Speechlib is a library that unifies speaker diarization, transcription and speaker recognition in a single pipeline to create transcripts for audio conversations with actual speaker names and time tags. This library also contains audio preprocessor functions.

3K 258 27
deepgram
deepctl

Official Deepgram CLI — speech-to-text, text-to-speech, and audio intelligence from your terminal

3K 3 2
nomadkaraoke
lyrics-transcriber

Automatically create synchronised lyrics files in ASS and LRC with word-level timestamps, using Whisper and lyrics from online sources, with anchor sequences and LLMs to auto-correct transcription

3K 91 17
plribeiro3000
meeting-hive

Local-first, AI-queryable meeting archive. Ingests from meeting tools, applies personal vocabulary corrections, and stores in portable markdown.

2K 0 0
patrikx3
p3x-meet-assistant

Real-time AI speech-to-text for meetings with GPT-4o Transcribe and GPU speaker diarization

2K 0 0
kristofferv98
voiceprocessingtoolkit

The VoiceProcessingToolkit is an all-encompassing suite designed for sophisticated voice detection, wake word recognition, text-to-speech synthesis, and advanced audio processing. It offers intuitive interfaces to streamline the integration of voice processing capabilities into your applications

2K 4 1
HYP3R00T
voicepad-core

Core audio processing for Voicepad.

2K 0 0
HYP3R00T
voicepad

Command-line interface for voice recording and GPU-accelerated transcription.

2K 0 0
andbue
nashi

Some bits of javascript to transcribe scanned pages using PageXML

2K 17 4
d-flood
criticus

A collection of computer tools for aiding the text critical workflow from transcription to collation to analysis.

1K 26 2
grahamrowe82
slurpai

Convert voice notes, videos, and audio files into AI-ready text and images

1K 1 0
joinly-ai
joinly-client

Client for joinly: Make your meetings accessible to AI Agents

982 497 81
tiroq
mnemofy

mnemofy extracts audio from media files, transcribes speech, and produces documented meeting notes with topics, decisions, and concrete mentions.

977 0 0
dimastatz
whisperflow

WhisperFlow: Real-Time Transcription Powered by OpenAI Whisper

911 696 101
Picovoice
pvcheetahdemo

On-device streaming speech-to-text engine powered by deep learning

910 662 77
Picovoice
pvleoparddemo

On-device speech-to-text engine powered by deep learning

875 481 29
    • Data from PyPI, GitHub, ClickHouse, and BigQuery