PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
alphacep
vosk

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

458K 15K 2K
speechmatics
speechmatics-rt

Python SDKs for Speechmatics APIs

144K 18 7
speechmatics
speechmatics-voice

Python SDKs for Speechmatics APIs

103K 18 7
khoj-ai
khoj

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

61K 34K 2K
khoj-ai
khoj-assistant

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (gpt, claude, gemini, llama, qwen, mistral). Get started - free.

48K 34K 2K
speechmatics
speechmatics-batch

Python SDKs for Speechmatics APIs

29K 18 7
GetStream
vision-agents

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

26K 8K 637
moonshine-ai
moonshine-voice

Very low latency speech to text, intent recognition, and text to speech, for building voice agents and interfaces

25K 8K 408
istupakov
onnx-asr

A lightweight Python package for Automatic Speech Recognition using ONNX models

21K 311 30
analyticsinmotion
werpy

🐍📦 Ultra-fast Python package for calculating and analyzing the Word Error Rate (WER). Built for the scalable evaluation of speech and transcription accuracy.

15K 26 6
coqui-ai
stt

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

9K 3K 300
GetStream
vision-agents-plugins-getstream

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

8K 8K 637
GetStream
vision-agents-plugins-deepgram

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

7K 8K 637
GetStream
vision-agents-plugins-cartesia

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

7K 8K 637
GetStream
vision-agents-plugins-openai

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

7K 8K 637
Picovoice
pvcheetah

On-device streaming speech-to-text engine powered by deep learning

6K 662 77
GetStream
vision-agents-plugins-gemini

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

6K 8K 637
GetStream
vision-agents-plugins-openrouter

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

6K 8K 637
GetStream
vision-agents-plugins-smart-turn

Smart Turn detection plugin for Vision Agents

6K 8K 637
GetStream
vision-agents-plugins-elevenlabs

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

5K 8K 637
GetStream
vision-agents-plugins-ultralytics

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

5K 8K 637
GetStream
vision-agents-plugins-kokoro

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

5K 8K 637
GetStream
vision-agents-plugins-anthropic

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

5K 8K 637
GetStream
vision-agents-plugins-fast-whisper

Open Vision Agents by Stream. Build voice and vision agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

5K 8K 637
    • Data from PyPI, GitHub, ClickHouse, and BigQuery