Speaker Recognition Python Packages

nemo-toolkit

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

798K 17K 3K

wespeakerruntime

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

10K 1K 192

diarize

Speaker diarization for Python — "who spoke when?" CPU-only, no API keys, Apache 2.0. ~10.8% DER on VoxConverse, 8x faster than real-time.

4K 62 7

speechlib

Speechlib is a library that unifies speaker diarization, transcription and speaker recognition in a single pipeline to create transcripts for audio conversations with actual speaker names and time tags. This library also contains audio preprocessor functions.

3K 258 27

pveagle

On-device speaker recognition engine powered by deep learning

2K 42 6

nemo-toolkit-asr

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

1K 17K 3K

pveagledemo

On-device speaker recognition engine powered by deep learning

885 42 6

diarizationlm

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

704 449 38

nemo-asr

Collection of Neural Modules for Speech Recognition

698 17K 3K

pvfalcon

On-device speaker diarization powered by deep learning

666 71 7

ppvector

本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型，同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法

599 312 50

mvector

Voice Print Recognition toolkit on Pytorch

583 1K 167

sidlingvo

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

466 449 38

uisrnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

429 2K 319

pvfalcondemo

On-device speaker diarization powered by deep learning

327 71 7

audioperm

A python library for generating different permutations of audible segments from audio files.

288 13 2

hyperion-ml

Python toolkit for speech processing

279 72 21

nemo-nlp

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

254 17K 3K

voicetag

Speaker identification powered by pyannote and resemblyzer

250 49 4

wavencoder

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.

246 92 14

bmnspeechlib

208 258 27

whisper-speaker-id

Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.

130 26 1

piwho

Speaker recognition library based on MARF for raspberry pi and other SBCs.

94 57 19

voxceleb-luigi

Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments

86 43 4

Search Packages