Dependents of inspect-ai

48 dependents

Package	Description	Downloads/month
inspect-scout	Transcript Analysis for AI Agents	560K
inspect-swe	Software Engineering Agents for Inspect AI	105K
inspect-evals	Collection of evals for Inspect AI	96K
docent-python	Docent SDK	47K
lighteval	Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backend...	22K
inspect-cyber	An Inspect extension for cyber evaluations	15K
inspect-k8s-sandbox	A Kubernetes Sandbox Environment for Inspect	13K
control-arena	ControlArena is a collection of settings, model organisms and protocols - for ru...	10K
agent-eval	Agent evaluation toolkit	6K
openbench	Provider-agnostic, open-source evaluation infrastructure for language models	5K
valsai	SDK for the Vals AI Platform	4K
inspect-petri	An alignment auditing agent capable of quickly exploring alignment hypothesis	3K
inspect-harbor	Inspect AI interface to Harbor tasks	3K
hte-cli	Human Time-to-Completion Evaluation CLI	3K
inspect-sandboxes	Collection of sandboxes for Inspect AI	3K
inspect-flow	Inspect Flow is a workflow stack built on Inspect AI that enables research organ...	2K
asteroid-sdk	A Python SDK for Asteroid	1K
petri-bloom	Framework for generating behavioral evaluations of frontier AI models.	1K
astabench		1K
inspect-wandb	Integration between Inspect and Weights & Biases	1K
inspect-openreward	Run OpenReward environments through the Inspect eval platform	936
factorio-learning-environment	Factorio Learning Environment	917
inspect-mlflow	MLflow integration for Inspect AI: experiment tracking, execution tracing, and S...	776
evalsense	Tools for systematic large language model evaluations	724
refactor-arena	A Control Arena setting for evaluating agents inserting backdoors while refactor...	721
pentester-test	Pentester CLI	661
inspect-podman	Podman sandbox environment for Inspect AI	610
bench-af	Bench-AF: Alignment Faking Benchmark	598
entropy-labs	Entropy Labs' Sentinel is an agent control plane that enables efficient oversigh...	514
inspect-mlflow-ext	MLflow integration for Inspect AI	497
lunette-sdk		445
inspect-build-time-contract	Opt-in lint for Inspect AI tasks: warn when @verifiable_task uses a model-graded...	402
cot-suite	Chain-of-thought monitorability and faithfulness evaluation for reasoning-model ...	387
gcri	Generalized Cognitive Refinement Iteration - Hierarchical Multi-Agent System wit...	386
persona-bench	Pluristic alignment evaluation benchmark for LLMs	275
a-piece-of-shit	A utterly useless package that imports everything for you. Now with top 1000 PyP...	247
inspect-policy-sandbox	Policy-enforced sandbox environment extension for Inspect AI	228
multiagent-inspect	Multi-agent system for AI evaluations in AISI's inspect-ai framework	215
fulcrum-cli	Utility to connect and use Fulcrum Research to understand your agents	187
mini-control-arena	A lightweight library for running AI Control experiments	161
inspect-agents	Inspect-AI–based agents and tools.	151
he-eval	Human evaluation for Control Arena settings	149
control-runner	Tool for configuring and running control_arena experiments using .yaml files.	131
gage-inspect	Gage support for Inspect AI	130
ohmytofu	Python package to interface with the AI infrastructure over at ohmytofu.ai	115
casimats-inspect		90
easy-inspect	High-level, zero-code interface for evaluating LLMs using Inspect-AI	71
ember-inspect	Use Goodfire Ember API with UK-AISI Inspect AI	65