Very fast, accurate speaker diarization
Real-time speech-to-text clipboard tool with Silero VAD and local ASR support
Real-time speech-to-text translation over WebSocket. Streams Opus or raw PCM audio from client to server for live transcription and optional translation. Supports CLI and Python API.