Analysis - pDoom (Page 5)

Zac Boring 2 months ago Analysis

By Strong Default, ASI Will End Liberal Democracy

via LessWrong AI [4] — The existence of liberal democracy—with rule of law, constraints on government power, and enfranchised citizens—relies on a balance of power where individual bad actors can't do too much damage. Artificial superintelligence (ASI), even if it's aligned,…

Zac Boring 2 months ago Analysis

Housing Roundup #13: More Dakka

via Substack Zvi [999] — Build more housing where people want to live.

Zac Boring 2 months ago Analysis

Ten different ways of thinking about Gradual Disempowerment

via LessWrong AI [7] — About a year ago, we wrote a paper that coined the term “Gradual Disempowerment.”It proved to be a great success, which is terrific. A friend and colleague told me that it was the most discussed paper at DeepMind last year (selection bias, grain of salt,…

Zac Boring 2 months ago Analysis

Steering Might Stop Working Soon

via LessWrong AI [5] — Steering LLMs with single-vector methods might break down soon, and by soon I mean soon enough that if you're working on steering, you should start planning for it failing now.This is particularly important for things like steering as a mitigation against…

Zac Boring 2 months ago Analysis

Sadly, The Whispering Earring

via LessWrong AI [4] — The Whispering Earring (which you should read first) explores one of the most dystopic-utopic scenarios. Imagine you could achieve all you've ever wanted by just giving up your agency. While theoretically this seems rather undesirable, in practice you get…

Zac Boring 2 months ago Analysis

Anthropic Responsible Scaling Policy v3: Dive Into The Details

via Substack Zvi [999] — Wednesday’s post talked about the implications of Anthropic changing from v2.2 to v3.0 of its RSP, including that this broke promises that many people relied upon when making important decisions.

Zac Boring 2 months ago Analysis

Systematically dismantle the AI compute supply chain.

via LessWrong AI [9] — This is not an April fool’s joke, I’m participating in Inkhaven, which means I need to write a blog post every day.I recently watched The AI Doc. It’s the first big documentary featuring AI safety. It’s playing in theatres across America. It’s got a bunch…

Zac Boring 2 months ago Analysis

AI #162: Visions of Mythos

via Substack Zvi [999] — Anthropic had some problem with leaks this week.

Zac Boring 2 months ago Analysis

Anthropic's Pause is the Most Expensive Alarm in Corporate History

via LessWrong AI [6] — Imagine Apple halting iPhone production because studies linked smartphones to teen suicide rates. Imagine Pfizer proactively pulling Lipitor because of internal studies showing increased cardiac risk, and not because of looming settlements or FDA…

Zac Boring 2 months ago Analysis

Introducing LIMBO: Maintaining Optimal P(DOOM) (and a call for funding)

via LessWrong AI [12] — We are excited to publicly introduce the Laboratory for Importance-sampled Measure and Bayesian Observation (LIMBO), a small research group working at the intersection of cosmological theory, probability, and existential risk. We believe that the…

Zac Boring 2 months ago Analysis

Anthropic Responsible Scaling Policy v3: A Matter of Trust

via Substack Zvi [999] — Anthropic has revised its Responsible Scaling Policy to v3.

Zac Boring 2 months ago Analysis

Product Alignment is not Superintelligence Alignment (and we need the latter to survive)

via LessWrong AI [9] — tl;dr: progress on making Claude friendly[1] is not the same as progress on making it safe to build godlike superintelligence. solving the former does not imply we get a good future.[2] please track the difference.The term 'Alignment' was coined[3] to…

Zac Boring 2 months ago Analysis

Co-Found Lens Academy With Me. (We have early users and funding)

via LessWrong AI [9] — tl;dr. Lens Academy is creating scalable superingelligence x-risk education with several USPs. Current team: Luc (full time founder, technical generalist) and several part time contributors. We have users and funding. Looking for a cofounder who's either a…

Zac Boring 2 months ago Analysis

Slack in Cells, Slack in Brains

via LessWrong AI [4] — [A veridically metaphorical explanation of why you shouldn't naïvely cram your life with local optimizations (even for noble or altruistic reasons).]TL;DR: You need Slack to be an effective agent. Slack is fragile, and it is tempting to myopically…

Zac Boring 2 months ago Analysis

Movie Review: The AI Doc

via Substack Zvi [999] — The AI Doc: Or How I Became an Apocaloptimist is a brilliant piece of work.

Zac Boring 2 months ago Analysis

The state of AI safety in four fake graphs

via LessWrong AI [5] — Here is a quick overview of my intuitions on where we are with AI safety in early 2026:So far, we continue to see exponential improvements in capabilities. This is most visible in the famous “METR graph”, but the trend is clear in many other metrics,…

Zac Boring 2 months ago Analysis

AI #161 Part 2: Every Debate on AI

via Substack Zvi [999] — AI discorce.

Zac Boring 2 months ago Analysis

(Some) Natural Emergent Misalignment from Reward Hacking in Non-Production RL

via LessWrong AI [5] — Authors: Satvik Golechha*, Sid Black*, Joseph Bloom* Equal Contribution.This work was done as part of the Model Transparency team at the UK AI Security Institute (AISI).Executive SummaryIn Natural Emergent Misalignment from Reward Hacking in Production RL…

Zac Boring 2 months ago Analysis

Nick Bostrom: How big is the cosmic endowment?

via LessWrong AI [4] — Superintelligence, pp. 122–3. 2014.Consider a technologically mature civilization capable of building sophisticated von Neumann probes of the kind discussed in the text. If these can travel at 50% of the speed of light, they can reach some stars before the…

Zac Boring 2 months ago Analysis

What if superintelligence is just weak?

via LessWrong AI [4] — In response to “2023 Or, Why I am Not a Doomer” by Dean W. Ball.Dean Ball is a pretty big voice in AI policy – over 19k subscribers on his newsletter, and a former Senior Policy Advisor for AI at the Trump White House – so why does he disagree that AI…