Dependents of pyannote-audio

80 dependents

Package	Description	Downloads/month
whisperx	WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarizatio...	1.1M
senselab	senselab is a Python package that simplifies building pipelines for biometric (e...	13K
batchalign	Python Speech Language Sample Analysis	10K
batchalignhk	Python Speech Language Sample Analysis	8K
insanely-fast-whisper	An insanely fast whisper CLI	7K
diart	A python package to build AI-powered real-time audio applications	6K
speechlib	Speechlib is a library that unifies speaker diarization, transcription and speak...	3K
whisply	💬 Fast, cross-platform CLI and GUI for batch transcription, translation, speaker...	3K
whispermlx	Time-Accurate Automatic Speech Recognition using Whisper.	2K
easytranscriber	Speech recognition with accurate word-level timestamps.	2K
turnvoice	Voice Transformation for Videos. 🎤👄🎬	1K
citeindex	Ingest sources with proper citation — PDF, URL, media, Office, DJVU	1K
clipsai	Clips AI is an open-source Python library that automatically converts long video...	1K
voxscriber	Local speaker diarization using MLX Whisper (macOS) or faster-whisper (Linux/CUD...	1K
pelican-nlp	Preprocessing and Extraction of Linguistic Information for Computational Analysi...	1K
open-dubbing	Open dubbing is an AI dubbing system which uses machine learning models to autom...	1K
psifx	Psychological and Social Interactions Feature Extraction	1K
easyaligner	Forced alignment pipeline designed for efficiency and ease of use.	1K
gtech-ariel	Google EMEA gTech Ads Data Science Team's solution to automatically translate an...	1K
openwillis-transcribe	Python library for digital measurement of health	1K
wenbi	A simple tool to make the video, audio, subtitle and video-url (especially youtu...	1K
whisperx-legen-fork	Flavored fork of m-bain/WhisperX for LeGen better experience	929
wishcribe	Fast multi-speaker audio/video transcription — faster-whisper + pyannote.audio	874
clipsai-jp	このパッケージはClipsAIの日本語専用フォーク版です。whisperxをfaster-whisperに置き換え、依存関係の問題を解決しています。	783
jarvis-conversationalist	A voice assistant for the command line	782
zagency	An agentic framework for building AI agents with LLM integration	711
gogadget	User friendly toolkit for generating immersion language learning tools including...	704
transcribe-with-whisper	Add your description here	697
murmurai-core	Modern speech recognition with word-level timestamps and speaker diarization. Fo...	687
asaca	Automatic Speech Analysis for Cognitive Assessment	599
speakerscribe	Speech-to-text with speaker diarization — Whisper + pyannote.audio, optimized fo...	552
sonata-asr	SONATA (SOund and Narrative Advanced Transcription Assistant): An advanced ASR s...	542
scraibe	Transcription tool for audio files based on Whisper and Pyannote	541
gryannote	Provide Gradio custom components to make the diarization-based audio labeling pr...	511
whisperer-ml	Go from raw audio to a text-audio dataset with OpenAI's Whisper	484
miya-speechless	Speechless repo for sales call analysis	444
diarize-whisper	Librairie pour la transcription ASR et la diarisation	405
localtranscribe	A lightweight, offline-first transcription utility for audio and video files wit...	404
captionalchemy	A Python package to create closed captions with face detection and recognition.	399
topai-faster-whisper	Faster Whisper transcription with CTranslate2	378
o2-speechless	Speechless repo for sales call analysis	363
audio-metrics-cli	Voice Acoustic Analyzer - Professional audio metrics extraction	361
trnscrb	Offline meeting transcription for macOS — auto-detects meetings, transcribes loc...	349
ttsds	The TTSDS benchmark evaluates synthetic speech quality by considering prosody, s...	346
audio-scribe	A command-line tool for audio transcription with Whisper and Pyannote.	344
pickpod	Integrated tools to transfer the internet audio to text, extract unpopular views...	336
audio-transcribing	A toolkit for audio transcription, speaker diarization, and text processing	332
audiotextspeakerchangedetect	Detect Speaker Change based on Textual Features via LLMs & Rule-Based NLP and Au...	330
audiotool	audiotool is a DeepLearning utility library.	330
whisperjf	A compatibility fix to for whisperx for use with gogadget	310