Who evaluates the evaluator? Judicator audits LLM-as-a-Judge systems for 7 documented bias types. Zero config. Works with any LLM.
Local Ollama experiment pipeline for source-selection behavior analysis