Research

GDM AI Control Roadmap

Zac Boring June 18, 2026 1 min read

GDM has published an AI Control Roadmap! From the executive summary:We present the GDM AI Control Roadmap (v0.1) – our plan for implementing and adopting internal guardrails designed to catch potential adversarial behaviour by AI agents, even as they become increasingly harder to oversee and contain.We focus on system-level mitigations that limit the harm a misaligned AI system could cause. Specifically, this report provides:• Threat modelling: Taking inspiration from cybersecurity, we adopt a c

By Mary Phuong

Read the full article at Alignment Forum →