PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Speaker Recognition Python Packages

Python packages with the GitHub topic speaker-recognition. Sorted by relevance, with stars and monthly downloads.
NVIDIA
nemo-toolkit

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

823K 17K 3K
wenet-e2e
wespeakerruntime

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

10K 1K 192
FoxNoseTech
diarize

Speaker diarization for Python — "who spoke when?" CPU-only, no API keys, Apache 2.0. ~10.8% DER on VoxConverse, 8x faster than real-time.

4K 62 7
NavodPeiris
speechlib

Speechlib is a library that unifies speaker diarization, transcription and speaker recognition in a single pipeline to create transcripts for audio conversations with actual speaker names and time tags. This library also contains audio preprocessor functions.

3K 258 27
Picovoice
pveagle

On-device speaker recognition engine powered by deep learning

1K 42 6
NVIDIA
nemo-toolkit-asr

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

1K 17K 3K
Picovoice
pveagledemo

On-device speaker recognition engine powered by deep learning

954 42 6
nvidia
nemo-asr

Collection of Neural Modules for Speech Recognition

693 17K 3K
Picovoice
pvfalcon

On-device speaker diarization powered by deep learning

686 71 7
google
diarizationlm

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

673 450 38
yeyupiaoling
mvector

Voice Print Recognition toolkit on Pytorch

651 1K 167
yeyupiaoling
ppvector

本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法

618 312 50
google
sidlingvo

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

471 450 38
google
uisrnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

460 2K 319
Picovoice
pvfalcondemo

On-device speaker diarization powered by deep learning

370 71 7
nvidia
nemo-nlp

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

344 17K 3K
hyperion-ml
hyperion-ml

Python toolkit for speech processing

296 72 21
zabir-nabil
audioperm

A python library for generating different permutations of audible segments from audio files.

278 13 2
shangeth
wavencoder

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.

261 92 14
Gr122lyBr
voicetag

Speaker identification powered by pyannote and resemblyzer

246 49 4
NavodPeiris
bmnspeechlib

Speechlib is a library that unifies speaker diarization, transcription and speaker recognition in a single pipeline to create transcripts for audio conversations with actual speaker names and time tags. This library also contains audio preprocessor functions.

216 258 27
jakariaemon
whisper-speaker-id

Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.

143 26 1
Adirockzz95
piwho

Speaker recognition library based on MARF for raspberry pi and other SBCs.

96 57 19
maxhollmann
voxceleb-luigi

Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments

95 43 4
    • Data from PyPI, GitHub, ClickHouse, and BigQuery