Speaker diarization for Python — "who spoke when?" CPU-only, no API keys, Apache 2.0. ~10.8% DER on VoxConverse, 8x faster than real-time.
Python package to trim RTTM diarization files and optionally audio files to a user-specified time range.