464 dependents
| Package | Description | Downloads/month |
|---|---|---|
| A robust, efficient, low-latency speech-to-text library with advanced voice acti... | 177K | |
| 删库 | 150K | |
| Sprite AI is an AI companion for your desktop | 83K | |
| Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, De... | 52K | |
| A nearly-live implementation of OpenAI's Whisper. | 20K | |
| A simple FastAPI Server to run XTTSv2 | 20K | |
| Python packages to interact with the hardware of the Stretch mobile manipulators... | 14K | |
| Converts text to speech in realtime | 12K | |
| Ubo main app, running on device initialization. A platform for running other app... | 8K | |
| All-Python FT8 Transceiver GUI / Command Line Modem | 8K | |
| XGO Blockly - 图形化和 AI 编程 Web 服务器 | 7K | |
| Framework for building deep neural network models for sound, speech, and voice A... | 7K | |
| Fish Speech | 6K | |
| Transcribe your .wav .mp4 .mp3 .flac files to text or record your own audio! | 6K | |
| XGO Blockly CM4 - 图形化和 AI 编程 Web 服务器 (CM4精简版) | 6K | |
| مستندات کتابخانه تبدیل متن به خطوط باستانی که میتونید از این سورس های متن بازاست... | 5K | |
| Python bindings for using SoundFonts (sf2/sf3/sfo formats), generating audio sam... | 5K | |
| A real-time software for turn-taking, backchannel, and head-nodding prediction | 5K | |
| This package lets you start Vapi calls directly from Python. | 5K | |
| data over sound plugin | 5K | |
| 5K | ||
| 🥰 Building AI-based conversational avatars lightning fast ⚡️💬 | 3K | |
| 3K | ||
| A PyAudio wrapper for aiovban | 3K | |
| Core Python utilities for the Snips Manager | 3K | |
| Connect to the Unitree Go2 and G1 with WebRTC | 2K | |
| Let's make the OpenAI module easy to use | 2K | |
| high quality multi-lingual speech to text | 2K | |
| World's first 'Text to ML model' support library. Also comes with an analysis ch... | 2K | |
| voice_code project | 2K | |
| Everything to control and customize Tello | 2K | |
| AI voice chatbot using Hack Club AI | 2K | |
| The VoiceProcessingToolkit is an all-encompassing suite designed for sophisticat... | 2K | |
| 2K | ||
| An OpenSesame Plug-in for playing and recording audio files with low latency on ... | 2K | |
| A versatile local LLM framework for chat, image, video, machine learning modelli... | 2K | |
| A Python library for Real-time Music Alignment | 1K | |
| Whisper for your microphone | 1K | |
| Transcribe long audio files with ASR or use the streaming interface | 1K | |
| Official Python SDK for Lokutor - Real-time AI voice and TTS | 1K | |
| Connect to the Unitree Go2 and G1 with WebRTC | 1K | |
| GitBase is a custom database system built with Python and powered by GitHub, tre... | 1K | |
| A terminal-based random chord progression generator | 1K | |
| 1K | ||
| voice-presentation-control is a tool that allows you to control your presentatio... | 1K | |
| Command-line tool for learning foreign languages through gradual exposure to new... | 1K | |
| A seamless voice dictation system for Linux | 1K | |
| A Scrcpy (V3.3) client implemented in Python. Gui with Kivy/KivyMD. With Video, ... | 1K | |
| Library for Unihiker M10 | 1K | |
| Modern Speech Recognition with both active and ambient listening and keyboard | 1K |