464 dependents
Package Description Downloads/month
A robust, efficient, low-latency speech-to-text library with advanced voice acti... 177K
删库 150K
Sprite AI is an AI companion for your desktop 83K
Desktop AI Assistant powered by GPT-5, GPT-4, o1, o3, Gemini, Claude, Ollama, De... 52K
A nearly-live implementation of OpenAI's Whisper. 20K
A simple FastAPI Server to run XTTSv2 20K
Python packages to interact with the hardware of the Stretch mobile manipulators... 14K
Converts text to speech in realtime 12K
Ubo main app, running on device initialization. A platform for running other app... 8K
All-Python FT8 Transceiver GUI / Command Line Modem 8K
XGO Blockly - 图形化和 AI 编程 Web 服务器 7K
Framework for building deep neural network models for sound, speech, and voice A... 7K
Fish Speech 6K
Transcribe your .wav .mp4 .mp3 .flac files to text or record your own audio! 6K
XGO Blockly CM4 - 图形化和 AI 编程 Web 服务器 (CM4精简版) 6K
مستندات کتابخانه تبدیل متن به خطوط باستانی که میتونید از این سورس های متن بازاست... 5K
Python bindings for using SoundFonts (sf2/sf3/sfo formats), generating audio sam... 5K
A real-time software for turn-taking, backchannel, and head-nodding prediction 5K
This package lets you start Vapi calls directly from Python. 5K
data over sound plugin 5K
5K
🥰 Building AI-based conversational avatars lightning fast ⚡️💬 3K
3K
A PyAudio wrapper for aiovban 3K
Core Python utilities for the Snips Manager 3K
Connect to the Unitree Go2 and G1 with WebRTC 2K
Let's make the OpenAI module easy to use 2K
high quality multi-lingual speech to text 2K
World's first 'Text to ML model' support library. Also comes with an analysis ch... 2K
voice_code project 2K
Everything to control and customize Tello 2K
AI voice chatbot using Hack Club AI 2K
The VoiceProcessingToolkit is an all-encompassing suite designed for sophisticat... 2K
2K
An OpenSesame Plug-in for playing and recording audio files with low latency on ... 2K
A versatile local LLM framework for chat, image, video, machine learning modelli... 2K
A Python library for Real-time Music Alignment 1K
Whisper for your microphone 1K
Transcribe long audio files with ASR or use the streaming interface 1K
Official Python SDK for Lokutor - Real-time AI voice and TTS 1K
Connect to the Unitree Go2 and G1 with WebRTC 1K
GitBase is a custom database system built with Python and powered by GitHub, tre... 1K
A terminal-based random chord progression generator 1K
1K
voice-presentation-control is a tool that allows you to control your presentatio... 1K
Command-line tool for learning foreign languages through gradual exposure to new... 1K
A seamless voice dictation system for Linux 1K
A Scrcpy (V3.3) client implemented in Python. Gui with Kivy/KivyMD. With Video, ... 1K
Library for Unihiker M10 1K
Modern Speech Recognition with both active and ambient listening and keyboard 1K