pDoom (Page 11)

DOOM LEVEL -- %

Latest Headlines Auto-Updated

2 months ago Research

MediHive: A Decentralized Agent Collective for Medical Reasoning

via ArXiv cs.AI [6] — Large language models (LLMs) have revolutionized medical reasoning tasks, yet single-agent systems often falter on complex, interdisciplinary problems requiring robust handling of uncertainty and conflicting evidence. Multi-agent systems (MAS) leveraging…

2 months ago Analysis

The state of AI safety in four fake graphs

via LessWrong AI [5] — Here is a quick overview of my intuitions on where we are with AI safety in early 2026:So far, we continue to see exponential improvements in capabilities. This is most visible in the famous “METR graph”, but the trend is clear in many other metrics,…

2 months ago Analysis Essential

AI #161 Part 2: Every Debate on AI

via Substack Zvi [999] — AI discorce.

2 months ago Analysis

(Some) Natural Emergent Misalignment from Reward Hacking in Non-Production RL

via LessWrong AI [5] — Authors: Satvik Golechha*, Sid Black*, Joseph Bloom* Equal Contribution.This work was done as part of the Model Transparency team at the UK AI Security Institute (AISI).Executive SummaryIn Natural Emergent Misalignment from Reward Hacking in Production RL…

2 months ago Analysis

Nick Bostrom: How big is the cosmic endowment?

via LessWrong AI [4] — Superintelligence, pp. 122–3. 2014.Consider a technologically mature civilization capable of building sophisticated von Neumann probes of the kind discussed in the text. If these can travel at 50% of the speed of light, they can reach some stars before the…

2 months ago Analysis

What if superintelligence is just weak?

via LessWrong AI [4] — In response to “2023 Or, Why I am Not a Doomer” by Dean W. Ball.Dean Ball is a pretty big voice in AI policy – over 19k subscribers on his newsletter, and a former Senior Policy Advisor for AI at the Trump White House – so why does he disagree that AI…

2 months ago Research Essential

The AI Doc: Your Questions Answered

via MIRI [999] — So you’ve just seen The AI Doc, and you suddenly have questions, lots of them. The 104-minute documentary (currently in theaters) takes viewers on a fast-paced tour through the many dimensions of the AI problem, featuring interviews from a wide range of experts.…

2 months ago Analysis

AI's capability improvements haven't come from it getting less affordable

via LessWrong AI [3] — METR's frontier time horizons are doubling every few months, providing substantial evidence that AI will soon be able to automate many tasks or even jobs. But per-task inference costs have also risen sharply, and automation requires AI labor to be…

2 months ago Analysis

ControlAI 2025 Impact Report

via LessWrong AI [4] — This post highlights a few key excerpts from our full impact report. You can read the full report at https://controlai.com/impact-report-2025.ControlAI is a non-profit organization working to avert the extinction risks posed by superintelligence. We help…

2 months ago Analysis Essential

Anthropic vs. DoW #6: The Court Rules

via Substack Zvi [999] — Last night, Anthropic was given its preliminary injunction, with a stay of seven days.

2 months ago Research Essential

Trust as Monitoring: Evolutionary Dynamics of User Trust and AI Developer Behaviour

via ArXiv cs.AI [8] — AI safety is an increasingly urgent concern as the capabilities and adoption of AI systems grow. Existing evolutionary models of AI governance have primarily examined incentives for safe development and effective regulation, typically representing users'…

2 months ago Research

ARC-AGI-3: A New Challenge for Frontier Agentic Intelligence

via ArXiv cs.AI [4] — We introduce ARC-AGI-3, an interactive benchmark for studying agentic intelligence through novel, abstract, turn-based environments in which agents must explore, infer goals, build internal models of environment dynamics, and plan effective action sequences…

2 months ago Analysis

My hobby: running deranged surveys

via LessWrong AI [4] — In late 2024, I was on a long walk with some friends along the coast of the San Francisco Bay when the question arose of just how much of a bubble we live in. It’s well known that the Bay Area is a bubble, and that normal people don’t spend that much time…

2 months ago Analysis Essential

Sen. Sanders (I-VT) and Rep. Ocasio-Cortez (D-NY) propose AI Data Center Moratorium Act

via LessWrong AI [15] — The text of the bill can be found here. It begins by citing the warnings of AI company CEOs and deep learning pioneers Geoffrey Hinton and Yoshua Bengio, the 2023 FLI open letter calling for a 6-month pause, and the 2025 FLI statement on…

2 months ago Research Essential

Test your best methods on our hard CoT interp tasks

via Alignment Forum [999] — Authors: Daria Ivanova, Riya Tyagi, Arthur Conmy, Neel NandaDaria and Riya are co-first authors. This work was done during Neel Nanda’s MATS 9.0. Claude helped write code and suggest edits for this post.TL;DR One of our best safety techniques right…

2 months ago Analysis Essential

AI #161 Part 1: 80,000 Interviews

via Substack Zvi [999] — The major technical advances this week were in agentic coding, as covered yesterday.

2 months ago Research Essential

A Toy Environment For Exploring Reasoning About Reward

via Alignment Forum [999] — tldr: We share a toy environment that we found useful for understanding how reasoning changed over the course of capabilities-focused RL. Over the course of capabilities-focused RL, the model biases more strongly towards reward hints over direct…

2 months ago Analysis

$1 billion is not enough; OpenAI Foundation must start spending tens of billions each year

via LessWrong AI [6] — OpenAI is now a public benefit corporation, with a charter that demands they use AGI for the benefit of all, and do so safely. To justify this structure to the Attorneys General of Delaware and California, they split off the nonprofit OpenAI Foundation,…

2 months ago Analysis Essential

Claude Code, Cowork and Codex #6: Claude Code Auto Mode and Full Cowork Computer Use

via Substack Zvi [999] — Whatever else you think about Anthropic’s agentic coding department, they ship.

2 months ago Industry

Agentic commerce runs on truth and context

via MIT Technology Review [4] — Imagine telling a digital agent, “Use my points and book a family trip to Italy. Keep it within budget, pick hotels we’ve liked before, and handle the details.” Instead of returning a list of links, the agent assembles an itinerary and executes…

Live Doom Meter

-- %

0% — We're fine 100% — GG

P(Doom) Scoreboard

0%25%50%75%100%

Loading estimates...

Recent Voices

We are creating something that will be more powerful than us. I don't know a good precedent for a less intelligent thing managing a more intelligent thing.

— Geoffrey Hinton, Nobel Prize Lecture, Dec 2024

If you're not worried about AI safety, you're not paying attention.

— Sen. Blumenthal, Senate AI Hearing, 2024

The probability of doom is high enough that we should be working very hard to reduce it.

— Yoshua Bengio, MILA Talk, 2024