Analysis - pDoom

Zac Boring 20 hours ago Analysis

Reasons to believe current AI models are conscious

via LessWrong AI [5] — There are a number of reasons to believe current AI models are conscious. I mean “conscious” is the sense of “is there something it is like to be an AI model?” and “does the AI model have phenomenal experience?”. As to what “AI models” refers to, the short…

Zac Boring a day ago Analysis

AI #177 Part 2: Wish You Were Here

via Substack Zvi [999] — As usual, part 2 of the weekly deals with speculative, regulatory, political and alignment questions.

Zac Boring 2 days ago Analysis

Help us launch AI safety university groups by referring potential founders

via LessWrong AI [5] — TL;DRUniversity groups are among the most reliable producers of AI safety talent, yet dozens of top schools that could sustain a group don't have one. We're launching the AI Safety Seeding Initiative (aisafetyseeding.org) in partnership with Kairos to…

Zac Boring 2 days ago Analysis

The State of AI Consciousness Research

via LessWrong AI [7] — I am agnostic on whether any current system is conscious; the claim is only that the question is researchable.This piece surveys the empirical research on AI consciousness. The premise of that research, and of the survey, is that the question does not have…

Zac Boring 2 days ago Analysis

AI #177 Part 1: Tip of the Iceberg

via Substack Zvi [999] — This week saw the releases of, among other things:

Zac Boring 3 days ago Analysis

Monthly Roundup #44: July 2026

via Substack Zvi [999] — It’s a quiet week so let’s do the monthly right on schedule.

Zac Boring 4 days ago Analysis

An analysis of AI-generated content at the Mechanistic Interpretability Workshop

via LessWrong AI [7] — IntroductionOver the past few years, AI tools have become useful for conducting technical AI research. In the early ChatGPT era (~2023–2024), chat assistants were maybe useful as sounding boards for research ideas, or as editors for polishing a paper…

Zac Boring 4 days ago Analysis

Twitter Thoughts For You

via Substack Zvi [999] — I previously have written back in March 2022 about how I use Twitter, and back in April 2023 about Twitter and its then-new algorithms, which have changed again.

Zac Boring 4 days ago Analysis

The Flood, by Anton Leicht

via LessWrong AI [5] — Anton covers AI policy angles in a singular fashion; every article he writes is worth reading, and this one is unusually topical to Manifund. As someone who went on the ground for Bores, I'd like to see us learn & update from past failures; and how could I…

Zac Boring 5 days ago Analysis

Better Call Sol The Workhorse

via Substack Zvi [999] — OpenAI’s GPT-5.6-Sol is finally here, along with the cheaper Terra and Luna.

Zac Boring 6 days ago Analysis

The US Government may find it difficult to seize control during takeoff

via LessWrong AI [4] — I'm not trying to advance any claims about whether this is good or bad, or what to do about it (if anything).I sometimes see concern about loss of most future value as a result of e.g. the US government[1] taking control of the future by seizing control of…

Zac Boring 7 days ago Analysis

The Human Substitution Test as a Sanity Check for AI Evaluations

via LessWrong AI [4] — TL;DR: We suggest a sanity check for proposed evaluation or AI oversight schemes: Imagine the AI was replaced by a competent, strategic human — someone who knows they might get evaluated and has their own agenda. Would the evaluation still work?When we…

Zac Boring 7 days ago Analysis

Introduction for and Reactions to Plan A

via Substack Zvi [999] — Introducing Plan A

Zac Boring 7 days ago Analysis

Freeing Thucydides

via LessWrong AI [4] — Prompted by discussion with Buck Shlegeris and others at the Forethought retreat. The idea that AI could bring an end to Thucydides traps is Buck’s. Speculative. I think it's plausible that we will not see sustained competition between actors for control…

Zac Boring 8 days ago Analysis

Plan A's problem with dry tinder

via LessWrong AI [4] — A group is worried about an approaching fire spreading rapidly through their city. They manage to halt the fire outside the city gates. Meanwhile they build massive physical structures to help them study and guide the fire safely. But these structures are…

Zac Boring 8 days ago Analysis

The easiest pathway to control is through executive power

via LessWrong AI [13] — When people in the AI safety community outline loss-of-control scenarios, they often spend a lot of time on relatively elaborate mechanisms — scheming AIs developing nanotech, labs leveraging superintelligence into hard power like drone armies, or…

Zac Boring 8 days ago Analysis

AI #176 Part 2: Plan B

via Substack Zvi [999] — This is part 2 of the weekly, broadly covering speculation, rhetoric and policy, along with alignment research.

Zac Boring 9 days ago Analysis

AI Safety Policy Needs to train Legal Practitioners

via LessWrong AI [5] — I completed my law degree at a working-class London university. In my first year, I was 18 years old, and I was often the youngest person in the room: almost everyone else was a paralegal, clerk or caseworker with years of live files behind them, studying…

Zac Boring 9 days ago Analysis

Debate with Self-Play Best-of-N Optimization

via LessWrong AI [4] — Context: This is the first research output from Arcadia Alignment’s scalable oversight team, carried out in collaboration with external researchers and mentors. We aim to do rigorous empirical work on debate - bridging the gap from theory to the alignment…

Zac Boring 9 days ago Analysis

AI 2040: Plan A

via LessWrong AI [4] — For the past year, we at the AI Futures Project have been sinking most of our time into our next big scenario. Now it’s done! It’s called AI 2040: Plan A.It’s called Plan A because it’s a recommendation, not a prediction. It’s what we think should happen,…