PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
Giskard-AI
giskard

🐢 Open-Source Evaluation & Testing library for LLM Agents

40K 5K 446
Oncoshot
llmvalidate

Oncoshot LLM validation framework

746 20 1
moonwatcher-ai
moonwatcher

No description available

70 16 2
Oncoshot
oncoshot-llm-validation-framework

A comprehensive Python framework for evaluating LLM-extracted structured data against ground truth labels. Supports binary classification, scalar values, and list fields with detailed performance metrics, confidence-based evaluation, and statistical uncertainty quantification via non-parametric bootstrap confidence intervals.

45 20 1
    • Data from PyPI, GitHub, ClickHouse, and BigQuery