PyPI Stats
  • Insights
  • PyPI
  • GitHub
  • Search
  • Compare
  • Advisories
  • Ecosystem
  • About
Home

Search Packages

Find Python packages by name, description, GitHub topic, or filter by metrics
Basaltlabs-app
gauntlet-cli

Behavioral reliability under pressure. Test how LLMs behave when things get hard.

10K 6 0
qualixar
agentassert-abc

Formal behavioral specification and runtime enforcement for autonomous AI agents. Agent Behavioral Contracts (ABC).

3K 3 0
stef41
modeldiffx

Model behavioral diffing - compare LLM outputs across versions, detect regressions.

1K 1 0
Swanand33
llm-behave

Behavioral testing for LLM applications. pytest plugin with semantic assertions, multi-turn conversation testing, and drift detection. No LLM judge needed.

586 1 0
chanikkyasaai
trajex

AI agent behavioral testing — learns what correct looks like, catches deviations automatically. Zero API keys needed.

366 0 0
SyncTek-LLC
specterqa

AI persona-based behavioral testing for web apps. No test scripts. YAML-configured. Vision-powered.

114 0 2
SyncTek-LLC
ghostqa

AI persona-based behavioral testing for web apps. No test scripts. YAML-configured. Vision-powered.

21 0 2
    • Data from PyPI, GitHub, ClickHouse, and BigQuery