PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
SYSTRAN
faster-whisper

Faster Whisper transcription with CTranslate2

7.4M 23K 2K
m-bain
whisperx

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

1.1M 22K 2K
basetenlabs
truss

The simplest way to serve AI/ML models in production

632K 1K 102
kurianbenoy
whisper-normalizer

A python package for whisper normalizer

454K 76 17
alibaba-damo-academy
funasr

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

357K 16K 2K
microsoft
foundry-local-sdk

Foundry Local Manager Python SDK: Control-plane SDK for Foundry Local.

319K 2K 307
basetenlabs
truss-transfer

The simplest way to serve AI/ML models in production

300K 1K 102
basetenlabs
baseten-performance-client

The simplest way to serve AI/ML models in production

182K 1K 102
linto-ai
whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

49K 3K 210
xorbitsai
xinference

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.

43K 9K 824
Softcatala
whisper-ctranslate2

Whisper command line client compatible with original OpenAI client based on CTranslate2.

38K 1K 124
mbailey
voice-mode

Natural (2-way) voice conversations with Claude Code

26K 1K 155
cubist38
mlx-openai-server

A high-performance API server that provides OpenAI-compatible endpoints for MLX models. Developed using Python and powered by the FastAPI framework, it provides an efficient, scalable, and user-friendly solution for running MLX-based vision and language models locally with an OpenAI-compatible interface.

22K 325 58
istupakov
onnx-asr

A lightweight Python package for Automatic Speech Recognition using ONNX models

21K 311 30
collabora
whisper-live

A nearly-live implementation of OpenAI's Whisper.

20K 4K 549
ARahim3
mlx-tune

Fine-tune LLMs on your Mac with Apple Silicon. SFT, DPO, GRPO, Vision, TTS, STT, Embedding, and OCR fine-tuning — natively on MLX. Unsloth-compatible API.

14K 1K 79
aarnphm
whispercpp

Pybind11 bindings for Whisper.cpp

13K 345 68
peterk
srt-equalizer

A Python module to transform subtitle line lengths, splitting into multiple subtitle fragments if necessary.

11K 41 6
alibaba-damo-academy
funasr-onnx

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

7K 16K 2K
rahulsiiitm
vidchain

✅A Lightweight Video RAG Framework for Multimodal Reasoning

5K 1 0
shashikg
whisper-s2t

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

5K 568 76
mbailey
voice-mode-install

Natural (2-way) voice conversations with Claude Code

4K 1K 155
KevKibe
africanwhisper

🚀 Framework for seamless fine-tuning of Whisper model on a multi-lingual dataset and deployment to prod.

3K 38 6
shhossain
banglaspeech2text

BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned on the latest whisper speech to text model for optimal performance.

3K 124 18
    • Data from PyPI, GitHub, ClickHouse, and BigQuery