PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
aiexponenthq
rag-benchmarking

RAG Benchmarking — Framework-agnostic RAG/agentic-AI evaluation harness. Faithfulness, agentic metrics, EU AI Act Article 15 accuracy evidence. Apache 2.0.

503 0 0
bcdnlp
faithscore

FaithScore: Fine-grained Evaluations of Hallucinations in Large Vision-Language Models

78 33 7
    • Data from PyPI, GitHub, ClickHouse, and BigQuery