PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
NVIDIA
nemo-toolkit

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

798K 17K 3K
espnet
espnet

End-to-End Speech Processing Toolkit

30K 10K 2K
KevKibe
africanwhisper

🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.

3K 38 6
PaddlePaddle
paddlespeech-feat

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

3K 13K 2K
PaddlePaddle
paddlespeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

3K 13K 2K
PalabraAI
palabra-ai

Python SDK for Palabra AI's real-time speech-to-speech translation API. Break down language barriers and enable seamless communication across 25+ languages

2K 37 6
NVIDIA
nemo-toolkit-asr

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

1K 17K 3K
PaddlePaddle
paddleaudio

Speech audio tools based on Paddlepaddle

1K 13K 2K
PaddlePaddle
paddlespeech-ctcdecoders

CTC decoders in paddlespeech

1K 13K 2K
nvidia
nemo-asr

Collection of Neural Modules for Speech Recognition

698 17K 3K
nvidia
nemo-nlp

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

254 17K 3K
George0828Zhang
torch-cif

A fast parallel implementation of continuous integrate-and-fire (CIF) https://arxiv.org/abs/1905.11235

196 36 4
nvidia
nemo-tts

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

82 17K 3K
PaddlePaddle
paddlespeech-ldd-ctcdecoders

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

59 13K 2K
    • Data from PyPI, GitHub, ClickHouse, and BigQuery