PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Tts Python Packages

Python packages with the GitHub topic tts. Sorted by relevance, with stars and monthly downloads.
rany2
edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

4.6M 11K 1K
unslothai
unsloth

Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

1.9M 64K 6K
pndurette
gtts

Python library and CLI tool to interface with Google Translate's text-to-speech API

1.5M 3K 383
unslothai
unsloth-zoo

Web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

1.4M 64K 6K
NVIDIA
nemo-toolkit

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

806K 17K 3K
thewh1teagle
kokoro-onnx

TTS with kokoro and onnx runtime

278K 3K 268
coqui-ai
tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

204K 45K 6K
eginhard
monotonic-alignment-search

Monotonically align text and speech

191K 4 1
snakers4
silero

Silero Models: pre-trained text-to-speech models made embarrassingly simple

175K 6K 363
OpenBMB
voxcpm

VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning

125K 17K 2K
thewh1teagle
phonikud

Hebrew grapheme to phoneme (G2P)

50K 95 12
tsukumijima
pyopenjtalk-plus

pyopenjtalk-plus: A Python wrapper for OpenJTalk with additional improvements

28K 57 4
GetStream
vision-agents

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

27K 8K 637
moonshine-ai
moonshine-voice

Very low latency speech to text, intent recognition, and text to speech, for building voice agents and interfaces

26K 8K 408
mbailey
voice-mode

Natural (2-way) voice conversations with Claude Code

25K 1K 155
DoodleBears
split-lang

✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and budoux

25K 74 11
daswer123
xtts-api-server

A simple FastAPI Server to run XTTSv2

20K 584 156
ywatanabe1989
scitex-audio

Text-to-Speech with Multiple Backend Fallback (elevenlabs → luxtts → gtts → pyttsx3)

20K 0 0
ywatanabe1989
scitex-notification

Give your AI agents a voice — TTS, phone calls, SMS, email, webhooks. One MCP server, 8 backends.

20K 2 0
pnnbao97
sea-g2p

Fast multilingual text-to-phoneme converter for South East Asian languages.

19K 94 21
mosecorg
mosec

A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine

18K 899 72
ai-bot-pro
achatbot

An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.

14K 89 18
holgern
kokorog2p

A unified multi-language G2P (Grapheme-to-Phoneme) library for Kokoro TTS.

11K 3 1
foyoux
pygtrans

谷歌翻译, 支持 APIKEY 一口气翻译十万条

11K 245 43
    • Data from PyPI, GitHub, ClickHouse, and BigQuery