PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
NVIDIA
nemo-toolkit

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

798K 17K 3K
wenet-e2e
wespeakerruntime

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

10K 1K 192
FoxNoseTech
diarize

Speaker diarization for Python — "who spoke when?" CPU-only, no API keys, Apache 2.0. ~10.8% DER on VoxConverse, 8x faster than real-time.

4K 62 7
NavodPeiris
speechlib

Speechlib is a library that unifies speaker diarization, transcription and speaker recognition in a single pipeline to create transcripts for audio conversations with actual speaker names and time tags. This library also contains audio preprocessor functions.

3K 258 27
Picovoice
pveagle

On-device speaker recognition engine powered by deep learning

2K 42 6
NVIDIA
nemo-toolkit-asr

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

1K 17K 3K
Picovoice
pveagledemo

On-device speaker recognition engine powered by deep learning

885 42 6
google
diarizationlm

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

704 449 38
nvidia
nemo-asr

Collection of Neural Modules for Speech Recognition

698 17K 3K
Picovoice
pvfalcon

On-device speaker diarization powered by deep learning

666 71 7
yeyupiaoling
ppvector

本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法

599 312 50
yeyupiaoling
mvector

Voice Print Recognition toolkit on Pytorch

583 1K 167
google
sidlingvo

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

466 449 38
google
uisrnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

429 2K 319
Picovoice
pvfalcondemo

On-device speaker diarization powered by deep learning

327 71 7
zabir-nabil
audioperm

A python library for generating different permutations of audible segments from audio files.

288 13 2
hyperion-ml
hyperion-ml

Python toolkit for speech processing

279 72 21
nvidia
nemo-nlp

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

254 17K 3K
Gr122lyBr
voicetag

Speaker identification powered by pyannote and resemblyzer

250 49 4
shangeth
wavencoder

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.

246 92 14
NavodPeiris
bmnspeechlib

Speechlib is a library that unifies speaker diarization, transcription and speaker recognition in a single pipeline to create transcripts for audio conversations with actual speaker names and time tags. This library also contains audio preprocessor functions.

208 258 27
jakariaemon
whisper-speaker-id

Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.

130 26 1
Adirockzz95
piwho

Speaker recognition library based on MARF for raspberry pi and other SBCs.

94 57 19
maxhollmann
voxceleb-luigi

Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments

86 43 4
    • Data from PyPI, GitHub, ClickHouse, and BigQuery