PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About

Speech Translation Python Packages

Python packages with the GitHub topic speech-translation. Sorted by relevance, with stars and monthly downloads.
NVIDIA
nemo-toolkit

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

900K 17K 3K
espnet
espnet

End-to-End Speech Processing Toolkit

29K 10K 2K
PaddlePaddle
paddlespeech-feat

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

3K 13K 2K
PaddlePaddle
paddlespeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

3K 13K 2K
KevKibe
africanwhisper

🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.

3K 38 6
huggingface
speech-to-speech

Build local voice agents with open-source models

2K 5K 561
PalabraAI
palabra-ai

Python SDK for Palabra AI's real-time speech-to-speech translation API. Break down language barriers and enable seamless communication across 25+ languages

2K 37 6
PaddlePaddle
paddleaudio

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

1K 13K 2K
NVIDIA
nemo-toolkit-asr

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

1K 17K 3K
PaddlePaddle
paddlespeech-ctcdecoders

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

1K 13K 2K
nvidia
nemo-asr

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

699 17K 3K
nvidia
nemo-nlp

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

372 17K 3K
George0828Zhang
torch-cif

A fast parallel implementation of continuous integrate-and-fire (CIF) https://arxiv.org/abs/1905.11235

273 36 4
nvidia
nemo-tts

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

95 17K 3K
PaddlePaddle
paddlespeech-ldd-ctcdecoders

CTC decoders in paddlespeech

64 13K 2K
    • Data from PyPI, GitHub, ClickHouse, and BigQuery