Two modes, one binary — a fleet orchestrator that clones repos and ships PRs via Claude Code, OR a drop-in OpenAI-compatible chat completion endpoint for customized Claude agents with real-time token-by-token thinking + tool-call streaming. LibreChat/OpenWebUI/LiteLLM-ready. Helm + KEDA + OTEL.