701 dependents
| Package | Description | Downloads/month |
|---|---|---|
| A framework for building realtime voice AI agents 🤖🎙️📹 | 3.4M | |
| Minimal CLI coding agent by Mistral | 3.2M | |
| Cross-platform, customizable ML solutions for live and streaming media. | 3.1M | |
| aider is AI pair programming in your terminal | 864K | |
| Build and run agents you can see, understand and trust. | 202K | |
| A modular Python synthesizer and sequencer | 195K | |
| A library containing components related to model inferences in Gen AI applicatio... | 167K | |
| A small speech recognizer | 113K | |
| Sprite AI is an AI companion for your desktop | 83K | |
| A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library ... | 78K | |
| An immersion toolkit for learning Languages through games and other visual media... | 50K | |
| Whisper command line client compatible with original OpenAI client based on CTra... | 38K | |
| TFLite Support is a toolkit that helps users to develop ML and deploy TFLite mod... | 30K | |
| Moshi is moshi | 28K | |
| Natural (2-way) voice conversations with Claude Code | 26K | |
| Very low latency speech to text, intent recognition, and text to speech, for bui... | 25K | |
| Not1MM != N1MM, An amateur radio contest logger for Linux. | 25K | |
| A terminal frontend for gambatte game boy color emulator | 23K | |
| AgentMake AI: a kit for developing agentic AI applications that support 24 AI ba... | 22K | |
| Native Linux port of Thermalright LCD Control Center (TRCC 2.1.2). Themes, video... | 20K | |
| Open-source, AI-enhanced CAT tool with multi-LLM support, translation memory, gl... | 18K | |
| Minimax MCP Server | 17K | |
| Cross-platform, customizable ML solutions for live and streaming media. | 17K | |
| Chat application with multi-agents system supports multi-models and MCP | 16K | |
| ElevenLabs MCP Server | 16K | |
| Synchronized audio player for Sendspin servers | 15K | |
| A patching tool to remove the numpy<2 constraint from official mediapipe wheels. | 12K | |
| Software Project Infraestructure | 11K | |
| so-vits-svc fork with realtime support, improved interface and more features. | 11K | |
| Video SDK Agents | 11K | |
| Userful tools for linux life | 11K | |
| AI meeting assistant for macOS — auto-record, live transcription, Claude-powered... | 11K | |
| This is a auto-testing framework of audio functions for Android devices. | 10K | |
| Music Theory for Humans. | 9K | |
| cecli - a neat cli assistant | 9K | |
| Machine learning audio prediction experiments based on templates | 9K | |
| A modular signal analysis python library. | 8K | |
| Interface for OuteTTS models. | 7K | |
| Spatial Audio Python Package | 7K | |
| Some utilities that build on python-sounddevice | 7K | |
| A network based light effect controller | 7K | |
| FreeGenius AI, an advanced AI assistant that is capable of engaging in conversat... | 7K | |
| A CLI text-to-speech tool using the Kokoro model, supporting multiple languages,... | 6K | |
| Implementation of HS-TasNet, "Real-time Low-latency Music Source Separation usin... | 6K | |
| Data format and GUI for working with multi-modal behavioral data | 6K | |
| Ghost background AI assistant for live code challenges | 6K | |
| Python library for controlling Icom transceivers over LAN (UDP) — no wfview/haml... | 6K | |
| The all-in-one voice SDK | 6K | |
| An open-source evaluation framework for voice agents | 6K | |
| A python package to build AI-powered real-time audio applications | 6K |