Ai Evaluation Tools Python Packages

eval-ai-library

Comprehensive AI Model Evaluation Framework with support for multiple LLM providers

6K 33 3

agentneo

Python SDK for Agent AI Observability, Monitoring and Evaluation Framework. Includes features like agent, llm and tools tracing, debugging multi-agentic system, self-hosted dashboard and advanced analytics with timeline and execution graph view

2K 16K 4K

evalstats

Statistical analysis methods for comparing prompt and model performance in LLM evaluations.

1K 101 2

promptstats

Statistical analysis methods for comparing prompt and model performance in LLM evaluations.

1K 101 2

Search Packages