Tracking AI existential risk. Auto-aggregated headlines. Human-curated analysis.
AGGREGATING 47 SOURCES · UPDATED LIVE
Research

GDM AI Control Roadmap

Zac Boring June 18, 2026 1 min read
Read original source →

GDM has published an AI Control Roadmap! From the executive summary:We present the GDM AI Control Roadmap (v0.1) – our plan for implementing and adopting internal guardrails designed to catch potential adversarial behaviour by AI agents, even as they become increasingly harder to oversee and contain.We focus on system-level mitigations that limit the harm a misaligned AI system could cause. Specifically, this report provides:• Threat modelling: Taking inspiration from cybersecurity, we adopt a c

By Mary Phuong

Read the full article at Alignment Forum →