pDoom (Page 5)

DOOM LEVEL -- %

Latest Headlines Auto-Updated

17 days ago Research

Constructive Alignment: Governing Preference Dynamics in Human-AI Interaction

via ArXiv cs.AI [5] — Most approaches to AI alignment treat human preferences as fixed targets to be inferred and optimized. This assumption conflicts with extensive empirical evidence showing that preferences are layered, dynamic, and constructed through…

17 days ago Analysis

Conversations With Cade Metz on the Rationalists

via LessWrong AI [4] — (Previously, previously.) New York Times reporter Cade Metz has been writing a book about the people who believed in AGI before it was cool. That's a subject that I think I know some things about, so we had some on-the-record conversations in 2025, which…

17 days ago Analysis Essential

Claude Sonnet 5 Is Not Frontier But Has Its Uses

via Substack Zvi [999] — Fable 5 is back today, baby! Premium subscribers have one week to use it within their subscriptions. First hit’s free. Then you pay by the token.

18 days ago Analysis

Model access for third-parties — it's a big deal!

via LessWrong AI [5] — Over time, there might be an increasingly large gap between insider model access and outsider model access. By insiders, I mean employees at the frontier lab.[1] By "outsiders", I mean external safety researchers, third-party auditors, and other actors…

18 days ago Research

What Drives Interactive Improvement from Feedback?

via ArXiv cs.AI [4] — We study when natural-language feedback produces improvement beyond the gains obtainable from repeated attempts alone. In multi-turn language agent setting, higher final accuracy can reflect useful feedback, but it can also arise from resampling, format…

18 days ago Analysis

Structural Proxies

via LessWrong AI [4] — Lately I've been thinking a lot about what work would help with actually winning and getting to good worlds. In the spirit of that I decided to venture outside my normal wheelhouse and spend some time reflecting on what technical research could make me…

19 days ago Analysis Essential

The Once And Future Fable #5

via Substack Zvi [999] — We, or at least ‘more than 100 American institutions,’ got Mythos back this week.

19 days ago Research Essential

MIRI Newsletter #126

via MIRI [999] — Announcing: AI StopWatch In our last update, we mentioned we had something new in the works: a dedicated channel for news and analysis about AI. Subscribe to AI StopWatch An experiment from the writers and analysts at MIRI, AI StopWatch posts news and commentary…

19 days ago Analysis

In partial defence of p(doom)

via LessWrong AI [7] — p(doom) is a shorthand for some important bits and a way to notice a disagreement to double-crux about.If you work on AI capabilities at a frontier AI company, I might ask you for your p(doom).If it's less than 1%, I know that you're probably not familiar…

19 days ago Industry

Roundtables: Longevity’s Next Frontier: “Reprogramming” Your Body

via MIT Technology Review [4] — Listen to the session or watch below Billions of dollars are flooding into efforts to reverse aging as scientists explore ways to return cells to a younger state. But how far off are these experimental treatments? Will they really work? Watch a…

19 days ago Research Essential

Summary: TGT’s 2026 ICML Papers

via MIRI [999] — The International Conference on Machine Learning (ICML), held annually for over forty years, is among the most influential conferences in modern AI research. This year in Seoul, ICML is hosting its second workshop on Technical AI Governance Research (TAIGR), and…

19 days ago Industry

The Download: AI “coworkers” and stratospheric internet

via MIT Technology Review [4] — This is today’s edition of The Download, our weekday newsletter that provides a daily dose of what’s going on in the world of technology. AI agents are not your “coworkers” Imagine coming in to work to learn that a new underling will report to…

19 days ago Analysis

Human-Guided Agentic Research: A Research Agenda

via LessWrong AI [4] — tl;dr: As recursive self-improvement accelerates, we need a top-level agenda to research how to effectively keep humans in the loop. We need to study how humans can best interpret and guide research performed by autonomous agents when those agents lack…

19 days ago Industry

Building tech in the world’s secret R&D hub

via MIT Technology Review [4] — Apple. Anthropic. Disney Research. Google. Meta. Microsoft. NVIDIA. OpenAI. Few places outside Silicon Valley can claim R&D hubs from all of these companies. Fewer still are concentrated in a city of just over 400,000 people—roughly half the size…

19 days ago Research

The Two Genie Game: Adoption and Welfare in Audit-Grounded AI Governance

via ArXiv cs.AI [6] — We ask under what conditions an agent with a harm-minimizing policy can displace an approval-seeking (RLHF) agent in a competitive market, and when that policy is sufficient to prevent community harm. We use evolutionary game theory (finite-population…

19 days ago Research

IMCBench: A benchmark for multimodal LLMs in Image-grounded Medical Conversations

via ArXiv cs.AI [6] — Recent advances in large language models and vision-language models have enabled reasoning over multimodal data, offering opportunities for clinical applications such as decision support and triaging. However, existing medical AI benchmarks are fragmented:…

20 days ago Analysis

AI will make biological extinction risks worse before it makes them better

via LessWrong AI [4] — An argument goes: If we don't build aligned artificial superintelligence, we risk driving ourselves extinct for some other reason. We should rush to build ASI quickly, in spite of the risks—the longer we wait, the more vulnerable we are to extinction from…

20 days ago Industry

AI agents are not your “coworkers”

via MIT Technology Review [4] — This story originally appeared in The Algorithm, our weekly newsletter on AI. To get stories like this in your inbox first, sign up here. Imagine coming in to work to learn that a new underling will report to you. The worker is not a person but an…

20 days ago Analysis

$1M AI x-risk grant round is live on grantmaking.ai - apply for funding, review applicants, or fund projects

via LessWrong AI [5] — TLDR: what is the grant round?grantmaking.ai is launching a $1M grant round, distributing $5k to $50k per successful application to people and projects working to reduce x-risk from AI.Applications will be reviewed by Gavin Leech, Ryan Kidd, and Marcus…

20 days ago Analysis Essential

P(doom) is a Dumb Meme

via LessWrong AI [10] — Look, I'm as much of a Rationalist with a special interest in AI x-risk as anyone. But oh my god do I hate talking about "P(doom)". When it first started showing up in the wake of ChatGPT, I assumed that it was floating around variously adjacent circles…

Live Doom Meter

-- %

0% — We're fine 100% — GG

P(Doom) Scoreboard

0%25%50%75%100%

Loading estimates...

Recent Voices

We are creating something that will be more powerful than us. I don't know a good precedent for a less intelligent thing managing a more intelligent thing.

— Geoffrey Hinton, Nobel Prize Lecture, Dec 2024

If you're not worried about AI safety, you're not paying attention.

— Sen. Blumenthal, Senate AI Hearing, 2024

The probability of doom is high enough that we should be working very hard to reduce it.

— Yoshua Bengio, MILA Talk, 2024