Tracking AI existential risk. Auto-aggregated headlines. Human-curated analysis.
AGGREGATING 47 SOURCES · UPDATED LIVE
Posts by
Zac Boring a month ago Analysis
Out-of-Context Reasoning (OOCR) in LLMs: A Short Primer and Reading List
via LessWrong AI [6] — Out-of-context reasoning (OOCR) is a concept relevant to LLM generalization and AI alignment. Also available as a PDF. Contents What is OOCR? Examples Papers Videos What is out-of-context reasoning for LLMs? It's when an LLM reaches a conclusion that…
Zac Boring a month ago Analysis
Learned Chain-of-Thought Obfuscation Generalises to Unseen Tasks
via LessWrong AI [3] — TL;DRTraining against a CoT or summary-only monitor can lead to obfuscation of dangerous reasoning in unseen tasks. This strengthens the “don’t train against a monitor” claims.Figure 1. A Two prior results: penalising the CoT or final response produces…
Zac Boring a month ago Analysis
Gemini 3.5 Flash Looks Good For How Fast It Is
via Substack Zvi [999] — Google once again has a model worth at least some consideration.
Zac Boring a month ago Research
The Erdős Proof and AI Capabilities
via MIRI [999] — View the official memo here. An internal model at OpenAI has autonomously disproved a central conjecture in discrete geometry, a mathematical field with applications in cryptography, wireless device communication, and medical imaging. The proof relates to a…
Zac Boring a month ago Industry
Why would you disrespect your favorite artist with an AI remix?
via The Verge AI [4] — AI covers and remixes of songs are already a blight on the internet. Spotify, YouTube, TikTok, and Instagram are awash in flat reggae versions of "Smells Like Teen Spirit," dinky country renditions of The Weeknd, and monotonous Motown reimaginings of…
Zac Boring a month ago Industry
OpenAI named a Leader in enterprise coding agents by Gartner
via OpenAI Blog [6] — OpenAI is named a leader in the 2026 Gartner Magic Quadrant for Enterprise AI Coding Agents, with Codex recognized for innovation and enterprise-scale deployment.
Zac Boring a month ago Analysis
Do AI Risks Require Extraordinary Government Intervention?
via AI Snake Oil [7] — Let’s not skip the hard work of AI governance
Zac Boring a month ago Analysis
AI #169: New Knowledge
via Substack Zvi [999] — Even in a relatively quiet period, AI is out there creating new knowledge.
Zac Boring a month ago Research
The Case for Evaluating Model Behaviors
via Alignment Forum [999] — Most evaluations of AI systems focus on their capabilities: how good they are at coding tasks, how effectively they can answer complex scientific questions, and so on.From a safety perspective, capability evaluations have a place: by understanding how…
Zac Boring a month ago Industry
You can now remix other people’s YouTube Shorts with AI
via The Verge AI [4] — Google announced a new YouTube Shorts Remix feature that lets users restyle clips or even insert themselves into other people's videos using Gemini Omni. Now, at the bottom of a YouTube Short, when you click the remix icon, you'll see an option to…
Zac Boring a month ago Industry
Demis Hassabis said this might be the ‘foothills of the singularity.’ What?
via The Verge AI [6] — Welcome to a "profound moment for humanity," according to Google DeepMind CEO Demis Hassabis, who closed out Google I/O's keynote presentation on Tuesday, saying: Google's cutting-edge research and products will help unlock AGI's incredible potential for…
Zac Boring a month ago Analysis
Childhood And Education #19: Letting Kids Be Kids #2
via Substack Zvi [999] — I cannot emphasize enough the need to let kids be kids.
Zac Boring a month ago Industry
Google Search is getting its biggest changes ever
via The Verge AI [4] — Google Search is entering the next phase of its AI evolution. During Google I/O 2026, the company showed off a reimagined search box that makes it easier to flow between AI Overviews, the AI-generated summaries that appear at the top of search results, and…
Zac Boring a month ago Research
AgentWall: A Runtime Safety Layer for Local AI Agents
via ArXiv cs.AI [8] — The safety of autonomous AI agents is increasingly recognized as a critical open problem. As agents transition from passive text generators to active actors capable of executing shell commands, modifying files, calling APIs, and browsing the web, the…
Zac Boring a month ago Analysis
Thoughts on interviewing candidates for AI safety fellowships
via LessWrong AI [5] — Around July last year I decided I was going to go all in on technical AI safety research. To do that I’d need to get into an AI safety fellowship, quit my job, and sell everything that was in my flat in South Africa (hopefully in that order).I applied to…
Zac Boring a month ago Analysis
Classifier Context Rot: Monitor Performance Degrades with Context Length
via LessWrong AI [3] — Monitoring coding agents for dangerous behavior using language models requires classifying transcripts that often exceed 500K tokens, but prior agent monitoring benchmarks rarely contain transcripts longer than 100K tokens.We show that when used as…
Zac Boring a month ago Analysis
Dating Roundup #12: Sex and Violence
via Substack Zvi [999] — No more burying the sex stuff under an avalanche of other stuff so no one notices.
Zac Boring a month ago Research
Fast-tracking genetic leads to reverse cellular aging
via DeepMind Blog [4] — Biologists use Co-Scientist to find novel factors that successfully rejuvenate human cells.
Zac Boring a month ago Research
Verifiable Agentic Infrastructure: Proof-Derived Authorization for Sovereign AI Systems
via ArXiv cs.AI [3] — Modern cloud and enterprise systems rely on identity-centric authorization, assuming that callers possessing valid credentials are safe to execute commands. The emergence of autonomous AI agents invalidates this assumption: agents can generate syntactically…
Zac Boring a month ago Research
SDOF: Taming the Alignment Tax in Multi-Agent Orchestration with State-Constrained Dispatch
via ArXiv cs.AI [3] — Multi-agent orchestration frameworks such as LangChain, LangGraph, and CrewAI route tasks through graph-based pipelines but do not enforce the stage constraints that govern real business processes. We present SDOF, a framework that treats multi-agent…
Live Doom Meter
-- %
0% — We're fine 100% — GG
P(Doom) Scoreboard
0%25%50%75%100%
Loading estimates...