Analysis

Why AI Evaluation Regimes are bad

Zac Boring March 12, 2026 1 min read

How the flagship project of the AI Safety Community ended up helping AI Corporations.I care about preventing extinction risks from superintelligence. This de facto makes me part of the “AI Safety” community, a social cluster of people who care about these risks.In the community, a few organisations are working on “Evaluations” (which I will shorten to Evals). The most notable examples are Apollo Research, METR, and the UK AISI.Evals make for an influential cluster of safety work, wherein auditor

By PranavG

Read the full article at LessWrong AI →