Catch silent agent failures in CI. Pytest plugin with regression detection for LLM agents.
Multi-agent failure detection for production AI systems
Single-master group membership framework with failure detection useful for stateless partitioning