PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
speechmatics
speechmatics-python

Python library and CLI for Speechmatics

50K 75 23
neocl
speach

🐍🍑 Python 3 library for managing, annotating, and converting natural language corpuses using popular formats (CoNLL, ELAN, Praat, CSV, JSON, SQLite, VTT, Audacity, TTL, TIG, ISF, etc.)

29K 21 6
hajin-park
spotify-translator

Generate lyric translations and transcriptions from Spotify URLs using OpenAI's Whisper model.

18K 0 0
Picovoice
pvcheetah

On-device streaming speech-to-text engine powered by deep learning

6K 662 77
juanmc2005
diart

A python package to build AI-powered real-time audio applications

6K 2K 161
Picovoice
pvleopard

On-device speech-to-text engine powered by deep learning

5K 481 29
baxtree
subaligner

Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers. https://subaligner.readthedocs.io/

3K 506 24
Yujia-Yan
transkun

A simple yet effective Audio-to-Midi Automatic Piano Transcription system

3K 335 33
NavodPeiris
speechlib

Speechlib is a library that unifies speaker diarization, transcription and speaker recognition in a single pipeline to create transcripts for audio conversations with actual speaker names and time tags. This library also contains audio preprocessor functions.

3K 258 27
deepgram
deepctl

Official Deepgram CLI — speech-to-text, text-to-speech, and audio intelligence from your terminal

3K 3 2
nomadkaraoke
lyrics-transcriber

Automatically create synchronised lyrics files in ASS and LRC with word-level timestamps, using Whisper and lyrics from online sources, with anchor sequences and LLMs to auto-correct transcription

3K 91 17
plribeiro3000
meeting-hive

Local-first, AI-queryable meeting archive. Ingests from meeting tools, applies personal vocabulary corrections, and stores in portable markdown.

2K 0 0
patrikx3
p3x-meet-assistant

Real-time AI speech-to-text for meetings with GPT-4o Transcribe and GPU speaker diarization

2K 0 0
kristofferv98
voiceprocessingtoolkit

The VoiceProcessingToolkit is an all-encompassing suite designed for sophisticated voice detection, wake word recognition, text-to-speech synthesis, and advanced audio processing. It offers intuitive interfaces to streamline the integration of voice processing capabilities into your applications

2K 4 1
HYP3R00T
voicepad-core

Core audio processing for Voicepad.

2K 0 0
HYP3R00T
voicepad

Command-line interface for voice recording and GPU-accelerated transcription.

2K 0 0
andbue
nashi

Some bits of javascript to transcribe scanned pages using PageXML

1K 17 4
d-flood
criticus

A collection of computer tools for aiding the text critical workflow from transcription to collation to analysis.

1K 26 2
grahamrowe82
slurpai

Convert voice notes, videos, and audio files into AI-ready text and images

1K 1 0
tiroq
mnemofy

mnemofy extracts audio from media files, transcribes speech, and produces documented meeting notes with topics, decisions, and concrete mentions.

1K 0 0
joinly-ai
joinly-client

Client for joinly: Make your meetings accessible to AI Agents

1K 497 81
Picovoice
pvcheetahdemo

On-device streaming speech-to-text engine powered by deep learning

911 662 77
dimastatz
whisperflow

WhisperFlow: Real-Time Transcription Powered by OpenAI Whisper

878 696 101
Picovoice
pvleoparddemo

On-device speech-to-text engine powered by deep learning

869 481 29
    • Data from PyPI, GitHub, ClickHouse, and BigQuery