PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Text To Speech Python Packages

Python packages with the GitHub topic text-to-speech. Sorted by relevance, with stars and monthly downloads.
elevenlabs
elevenlabs

The official Python SDK for the ElevenLabs API.

9.4M 3K 413
rany2
edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

4.6M 11K 1K
deepgram
deepgram-sdk

Official Python SDK for Deepgram.

2.2M 424 127
unslothai
unsloth

Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

1.9M 64K 6K
pndurette
gtts

Python library and CLI tool to interface with Google Translate's text-to-speech API

1.5M 3K 383
unslothai
unsloth-zoo

Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

1.4M 64K 6K
nateshmbhat
pyttsx3

Offline Text To Speech synthesis for python

789K 3K 360
k2-fsa
sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, RK NPU, Axera NPU, Ascend NPU, x86_64 servers, websocket server/client, support 12 programming languages

244K 12K 1K
fishaudio
fish-audio-sdk

The official Python library for the Fish Audio API.

214K 173 31
coqui-ai
tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

204K 45K 6K
eginhard
monotonic-alignment-search

Monotonically align text and speech

191K 4 1
snakers4
silero

Silero Models: pre-trained text-to-speech models made embarrassingly simple

175K 6K 363
k2-fsa
sherpa-onnx-core

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, RK NPU, Axera NPU, Ascend NPU, x86_64 servers, websocket server/client, support 12 programming languages

147K 12K 1K
OpenBMB
voxcpm

VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning

125K 17K 2K
Blaizzy
mlx-audio

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

78K 7K 578
playht
pyht

PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API

60K 219 33
k2-fsa
sherpa-onnx-bin

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, RK NPU, Axera NPU, Ascend NPU, x86_64 servers, websocket server/client, support 12 programming languages

56K 12K 1K
gradio-app
fastrtc

The python library for real-time communication

51K 5K 430
espnet
espnet

End-to-End Speech Processing Toolkit

29K 10K 2K
jeroenterheerdt
pycsspeechtts

Python (py) library to use Microsofts Cognitive Services Speech (csspeech) Text to Speech (tts) API.

29K 5 8
collabora
whisper-live

A nearly-live implementation of OpenAI's Whisper.

20K 4K 549
ywatanabe1989
scitex-audio

Text-to-Speech with Multiple Backend Fallback (elevenlabs → luxtts → gtts → pyttsx3)

20K 0 0
ywatanabe1989
scitex-notification

Give your AI agents a voice — TTS, phone calls, SMS, email, webhooks. One MCP server, 8 backends.

20K 2 0
Capsize-Games
airunner

Offline inference engine for art, real-time voice conversations, LLM powered chatbots and automated workflows

14K 1K 97
    • Data from PyPI, GitHub, ClickHouse, and BigQuery