Tracking AI existential risk. Auto-aggregated headlines. Human-curated analysis.
AGGREGATING 47 SOURCES · UPDATED LIVE
Posts by
Zac Boring 2 days ago Industry
Something’s off with Midjourney’s pivot to body scanners
via The Verge AI [4] — Last week, Midjourney, an AI startup best known for its image generator, made an unusual pivot: medical imaging. The company announced a futuristic ultrasound scanner that would dunk users into a vat of water and, hopefully, produce "something as powerful…
Zac Boring 2 days ago Analysis
Monthly Roundup #43: June 2026
via Substack Zvi [999] — Your monthly hit of all the things that are fit to print without a better place to live.
Zac Boring 3 days ago Research
LLM-Driven Feature Discovery
via Alignment Forum [999] — We would often like to get a qualitative sense of a target model’s behaviors in important distributions (e.g. deployment, RL training, or evals). For example, we might want to discover novel behaviors, figure out what causes some target behavior to…
Zac Boring 3 days ago Analysis
The AI Industrial Explosion — Part 4: Cheap power
via LessWrong AI [4] — In Parts 1, 2, and 3 we estimated how fast a post-AGI economy could grow using existing or historically observed production techniques, grounded in US input-output data. That approach gave us confidence that the methods we assumed were physically…
Zac Boring 3 days ago Analysis
GLM-5.2 Is The New Best Open Model
via Substack Zvi [999] — GLM-5.2 arrived last week.
Zac Boring 3 days ago Analysis
A brief list of ways AI safety efforts could be net negative
via LessWrong AI [5] — Here’s Holden Karnofsky:I tend to think it’s worse than 51/49. I tend to think we’re always going to be prone to overestimate how robustly good our actions are. And the more we learn about all the galaxy-brained considerations that one should have had in…
Zac Boring 4 days ago Analysis
The Invisible Side of AI Governance
via LessWrong AI [3] — Tldr: Most strategic writing on AI governance on LessWrong describes the outsider game, which is most often visible: press, statements, open letters. Here I want to describe the other, invisible half: the insider work within ministerial cabinets and…
Zac Boring 5 days ago Research
[Linkpost] How Transparent Is DiffusionGemma (and why it matters)
via Alignment Forum [999] — Authors: Joshua Engels*, Callum McDougall*, Bilal Chughtai*, Janos Kramar, Senthoran Rajamanoharan, Cindy Wu, Arthur Conmy, Asic Q Chen, Jean Tarbouriech, Min Ma, Brendan O'Donoghue+, João Gabriel Lopes de Oliveira+, Rohin Shah+, Neel Nanda+*Primary…
Zac Boring 5 days ago Analysis
AI Safety Ecosystem Research notes
via LessWrong AI [5] — These are some personal notes taken and later dressed up a bit to make into a post. Dunno how much value is here for people already familiar with the AI Safety Ecosystem.Over several weeks in the spring of 2026 I attempted to map out the entire AI Safety…
Zac Boring 5 days ago Analysis
Introduction: Gaussian Natural Latents
via LessWrong AI [4] — Short introductory post for my research direction: Gaussian Natural Latents. I explain the motivation and give a preview of the forthcoming results.The Natural Abstractions agenda, in my view, is a promising research program that asks important theoretical…
Zac Boring 6 days ago Analysis
Claude Fable 5 and Mythos 5: Capabilities
via Substack Zvi [999] — Only three days after the release of Claude Fable 5, Anthropic was forced by the United States Government to make it unavailable, when a jailbreak was brought to its attention, rather than the previous situation of ‘yes obviously experts can jailbreak…
Zac Boring 6 days ago Analysis
On “Model Organisms”
via LessWrong AI [5] — This post was written while working for Arcadia Impact's Alignment Team (and grew out of an internal talk I gave) but is my own opinion and not theirs. I am grateful for feedback from Daniel Tan and the rest of the team.This post was originally going to be…
Zac Boring 7 days ago Research
GDM AI Control Roadmap
via Alignment Forum [999] — GDM has published an AI Control Roadmap! From the executive summary:We present the GDM AI Control Roadmap (v0.1) – our plan for implementing and adopting internal guardrails designed to catch potential adversarial behaviour by AI agents, even as they…
Zac Boring 7 days ago Analysis
Your Model Organisms Might Be Fried
via LessWrong AI [7] — Context: We are the ‘model motivations’ team at Arcadia Alignment. We aim to build a science of ‘model intentions’, unifying insights from personas and other empirical evidence. In this post, we’ll outline the need for much better model organisms and how…
Zac Boring 7 days ago Analysis
Effective Altruism will be unbundled
via LessWrong AI [5] — From the end of high school to after my sophomore year of college, I considered myself an effective altruist. I was on the board of my college EA club, ran an EA intro fellowship, and went to EA retreats. I was vegetarian, regularly donated to GiveWell,…
Zac Boring 7 days ago Analysis
AI #173: AI Pauses
via Substack Zvi [999] — A lot of things are always happening.
Zac Boring 7 days ago Industry
Adobe’s redesigned AI studio remembers what your creations look like
via The Verge AI [4] — Adobe is introducing some new capabilities for its Firefly AI assistant, alongside a "reimagined" AI studio that lets you edit and generate new designs from a single interface. The new Firefly experience launching today in private beta is designed to give…
Zac Boring 8 days ago Research
AI Companies Could Become More Powerful Than Their Host Nations
via MIRI [999] — View the official memo here. AI companies are on track to produce capabilities that eclipse the power of the United States and other host nations. This is likely to happen in the next few years, and very likely to happen in the next decade. Leading AI labs…
Zac Boring 8 days ago Analysis
Several frontier models are substantially prefill aware
via LessWrong AI [3] — This blog post discusses work in a recently-published paper. However, this blogpost was primarily written by Parv Mahajan and Andy Wang, and several of the more speculative takes may not represent the all-things-considered view of the entire team.Link to…
Zac Boring 8 days ago Analysis
Alignement pretraining could backfire
via LessWrong AI [3] — There has been recent interest in generating synthetic documents to upsample examples of aligned AI during LLM pretraining. See, for instance, Geodesic's Alignment Pretraining paper or Anthropic's "Teaching Claude Why."I worry that this strategy can work…
Live Doom Meter
-- %
0% — We're fine 100% — GG
P(Doom) Scoreboard
0%25%50%75%100%
Loading estimates...