RAG Benchmarking — Framework-agnostic RAG/agentic-AI evaluation harness. Faithfulness, agentic metrics, EU AI Act Article 15 accuracy evidence. Apache 2.0.
FaithScore: Fine-grained Evaluations of Hallucinations in Large Vision-Language Models