Faithfulness Python Packages

rag-benchmarking

RAG Benchmarking — Framework-agnostic RAG/agentic-AI evaluation harness. Faithfulness, agentic metrics, EU AI Act Article 15 accuracy evidence. Apache 2.0.

503 0 0

faithscore

FaithScore: Fine-grained Evaluations of Hallucinations in Large Vision-Language Models

78 33 7

Search Packages