PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Eval Python Packages

Python packages with the GitHub topic eval. Sorted by relevance, with stars and monthly downloads.
lmfit
asteval

minimalistic evaluator of python expression using ast module

5.3M 218 49
yaroslaff
evalidate

Safe and fast evaluation of untrusted user-supplied python expressions

49K 40 4
swival
swival

A small, powerful, open-source CLI coding agent that works with open models.

11K 130 12
yiouli
pixie-qa

Agent skill for AI agent development

9K 6 0
kiwi0fruit
litereval

Wrapper around ast.literal_eval with additional {foo='bar', key=None} dict syntax. + Deep merge two dictionaries.

3K 1 1
abundant-ai
oddish

Run Harbor tasks in the cloud with scheduling, monitoring, and persistent state

3K 7 1
ai-twinkle
twinkle-eval

High-performance LLM evaluation framework with parallel API calls — up to 17× faster than sequential tools. Supports box, math, and logit-based evaluation.

1K 96 16
camgitt
proofagent

pytest for AI agents — test safety, accuracy, tool use, and cost. No YAML, no telemetry, just Python.

1K 0 0
gmitt98
fieldtest

LLM evaluation framework — define what correct, well-formed, and safe means before you measure

950 0 0
gggh2
coii-sdk

agents team

559 0 0
infinitode
codesafe

An open-source Python library for code encryption, decryption, and safe evaluation using Python's built-in AST module, complete with allowed functions, variables, built-in imports, timeouts, and blocked access to attributes.

496 2 1
ZackeryRSmith
cval

A layer of protection for python's eval

484 2 0
Freed-Wu
sphinxcontrib-eval

Evaluate shell command or python code in sphinx and myst

341 0 1
tensorstax
agenttrace

AgentTrace is a lightweight observability library to trace and evaluate agentic systems.

288 57 1
AIAnytime
rag-evaluator

A library for evaluating Retrieval-Augmented Generation (RAG) systems (The traditional ways).

153 43 19
10fra
promptmin

Prompt minimizer for LLM evals — shrink prompts to minimal failing inputs using delta debugging

141 0 0
ssbuild
aigc-evals

aigc_evals

134 10 0
lemonyte
safe-exec

Deobfuscate and inspect code passed into exec() and eval()

90 4 0
fullzer4
flashvm

Eval untrusted code in any language. No containers, no VMs, no setup. Just Linux.

63 17 0
ekcbw
no-subclasses

A library that removes the __subclasses__() list from all classes.一个清除所有类的__subclasses__()列表的库。

38 23 4
    • Data from PyPI, GitHub, ClickHouse, and BigQuery