Zac Boring - pDoom (Page 8)

Zac Boring a month ago Analysis

Out-of-Context Reasoning (OOCR) in LLMs: A Short Primer and Reading List

via LessWrong AI [6] — Out-of-context reasoning (OOCR) is a concept relevant to LLM generalization and AI alignment. Also available as a PDF. Contents What is OOCR? Examples Papers Videos What is out-of-context reasoning for LLMs? It's when an LLM reaches a conclusion that…

Zac Boring a month ago Analysis

Learned Chain-of-Thought Obfuscation Generalises to Unseen Tasks

via LessWrong AI [3] — TL;DRTraining against a CoT or summary-only monitor can lead to obfuscation of dangerous reasoning in unseen tasks. This strengthens the “don’t train against a monitor” claims.Figure 1. A Two prior results: penalising the CoT or final response produces…

Zac Boring a month ago Analysis

Gemini 3.5 Flash Looks Good For How Fast It Is

via Substack Zvi [999] — Google once again has a model worth at least some consideration.

Zac Boring a month ago Research

The Erdős Proof and AI Capabilities

via MIRI [999] — View the official memo here. An internal model at OpenAI has autonomously disproved a central conjecture in discrete geometry, a mathematical field with applications in cryptography, wireless device communication, and medical imaging. The proof relates to a…

Zac Boring a month ago Industry

Why would you disrespect your favorite artist with an AI remix?

via The Verge AI [4] — AI covers and remixes of songs are already a blight on the internet. Spotify, YouTube, TikTok, and Instagram are awash in flat reggae versions of "Smells Like Teen Spirit," dinky country renditions of The Weeknd, and monotonous Motown reimaginings of…

Zac Boring a month ago Industry

OpenAI named a Leader in enterprise coding agents by Gartner

via OpenAI Blog [6] — OpenAI is named a leader in the 2026 Gartner Magic Quadrant for Enterprise AI Coding Agents, with Codex recognized for innovation and enterprise-scale deployment.

Zac Boring a month ago Analysis

Do AI Risks Require Extraordinary Government Intervention?

via AI Snake Oil [7] — Let’s not skip the hard work of AI governance

Zac Boring a month ago Analysis

AI #169: New Knowledge

via Substack Zvi [999] — Even in a relatively quiet period, AI is out there creating new knowledge.

Zac Boring a month ago Research

The Case for Evaluating Model Behaviors

via Alignment Forum [999] — Most evaluations of AI systems focus on their capabilities: how good they are at coding tasks, how effectively they can answer complex scientific questions, and so on.From a safety perspective, capability evaluations have a place: by understanding how…

Zac Boring a month ago Industry

You can now remix other people’s YouTube Shorts with AI

via The Verge AI [4] — Google announced a new YouTube Shorts Remix feature that lets users restyle clips or even insert themselves into other people's videos using Gemini Omni. Now, at the bottom of a YouTube Short, when you click the remix icon, you'll see an option to…

Zac Boring a month ago Industry

Demis Hassabis said this might be the ‘foothills of the singularity.’ What?

via The Verge AI [6] — Welcome to a "profound moment for humanity," according to Google DeepMind CEO Demis Hassabis, who closed out Google I/O's keynote presentation on Tuesday, saying: Google's cutting-edge research and products will help unlock AGI's incredible potential for…

Zac Boring a month ago Analysis

Childhood And Education #19: Letting Kids Be Kids #2

via Substack Zvi [999] — I cannot emphasize enough the need to let kids be kids.

Zac Boring a month ago Industry

Google Search is getting its biggest changes ever

via The Verge AI [4] — Google Search is entering the next phase of its AI evolution. During Google I/O 2026, the company showed off a reimagined search box that makes it easier to flow between AI Overviews, the AI-generated summaries that appear at the top of search results, and…

Zac Boring a month ago Research

AgentWall: A Runtime Safety Layer for Local AI Agents

via ArXiv cs.AI [8] — The safety of autonomous AI agents is increasingly recognized as a critical open problem. As agents transition from passive text generators to active actors capable of executing shell commands, modifying files, calling APIs, and browsing the web, the…

Zac Boring a month ago Analysis

Thoughts on interviewing candidates for AI safety fellowships

via LessWrong AI [5] — Around July last year I decided I was going to go all in on technical AI safety research. To do that I’d need to get into an AI safety fellowship, quit my job, and sell everything that was in my flat in South Africa (hopefully in that order).I applied to…

Zac Boring a month ago Analysis

Classifier Context Rot: Monitor Performance Degrades with Context Length

via LessWrong AI [3] — Monitoring coding agents for dangerous behavior using language models requires classifying transcripts that often exceed 500K tokens, but prior agent monitoring benchmarks rarely contain transcripts longer than 100K tokens.We show that when used as…

Zac Boring a month ago Analysis

Dating Roundup #12: Sex and Violence

via Substack Zvi [999] — No more burying the sex stuff under an avalanche of other stuff so no one notices.

Zac Boring a month ago Research

Fast-tracking genetic leads to reverse cellular aging

via DeepMind Blog [4] — Biologists use Co-Scientist to find novel factors that successfully rejuvenate human cells.

Zac Boring a month ago Research

Verifiable Agentic Infrastructure: Proof-Derived Authorization for Sovereign AI Systems

via ArXiv cs.AI [3] — Modern cloud and enterprise systems rely on identity-centric authorization, assuming that callers possessing valid credentials are safe to execute commands. The emergence of autonomous AI agents invalidates this assumption: agents can generate syntactically…

Zac Boring a month ago Research

SDOF: Taming the Alignment Tax in Multi-Agent Orchestration with State-Constrained Dispatch

via ArXiv cs.AI [3] — Multi-agent orchestration frameworks such as LangChain, LangGraph, and CrewAI route tasks through graph-based pipelines but do not enforce the stage constraints that govern real business processes. We present SDOF, a framework that treats multi-agent…