Tracking AI existential risk. Auto-aggregated headlines. Human-curated analysis.
AGGREGATING 47 SOURCES · UPDATED LIVE
Research
Zac Boring 15 days ago Research
ArchAgent: Agentic AI-driven Computer Architecture Discovery
via ArXiv cs.AI [4] — Agile hardware design flows are a critically needed force multiplier to meet the exploding demand for compute. Recently, agentic generative AI systems have demonstrated significant advances in algorithm design, improving code efficiency, and enabling d
Zac Boring 15 days ago Research
Agent Behavioral Contracts: Formal Specification and Runtime Enforcement for Reliable Autonomous AI Agents
via ArXiv cs.AI [3] — Traditional software relies on contracts -- APIs, type systems, assertions -- to specify and enforce correct behavior. AI agents, by contrast, operate on prompts and natural language instructions with no formal behavioral specification. This gap is the
Zac Boring 15 days ago Research
Why Did My Model Do That? Model Incrimination for Diagnosing LLM Misbehavior
via Alignment Forum [5] — Authors: Aditya Singh*, Gerson Kroiz*, Senthooran Rajamanoharan, Neel NandaAditya and Gerson are co-first authors. This work was conducted during MATS 9.0 and was advised by Senthooran Rajamanoharan and Neel Nanda.MotivationImagine that a frontier lab’s coding agent has been caught putting a bug in
Zac Boring 16 days ago Research
ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning
via ArXiv cs.AI [4] — Agentic reinforcement learning (ARL) has rapidly gained attention as a promising paradigm for training agents to solve complex, multi-step interactive tasks. Despite encouraging early results, ARL remains highly unstable, often leading to training coll
Zac Boring 18 days ago Research
Task-Aware Exploration via a Predictive Bisimulation Metric
via ArXiv cs.AI — Accelerating exploration in visual reinforcement learning under sparse rewards remains challenging due to the substantial task-irrelevant variations. Despite advances in intrinsic exploration, many methods either assume access to low-dimensional states
Zac Boring 19 days ago Research
Alignment in Time: Peak-Aware Orchestration for Long-Horizon Agentic Systems
via ArXiv cs.AI [5] — Traditional AI alignment primarily focuses on individual model outputs; however, autonomous agents in long-horizon workflows require sustained reliability across entire interaction trajectories. We introduce APEMO (Affect-aware Peak-End Modulation for
Zac Boring 19 days ago Research
El Agente Gr\'afico: Structured Execution Graphs for Scientific Agents
via ArXiv cs.AI [6] — Large language models (LLMs) are increasingly used to automate scientific workflows, yet their integration with heterogeneous computational tools remains ad hoc and fragile. Current agentic approaches often rely on unstructured text to manage context a
Zac Boring 19 days ago Research
Ontology-Guided Neuro-Symbolic Inference: Grounding Language Models with Mathematical Domain Knowledge
via ArXiv cs.AI — Language models exhibit fundamental limitations -- hallucination, brittleness, and lack of formal grounding -- that are particularly problematic in high-stakes specialist fields requiring verifiable reasoning. I investigate whether formal domain ontolo
Zac Boring 21 days ago Research
How will we do SFT on models with opaque reasoning?
via Alignment Forum [1] — Current LLMs externalize lots of their reasoning in human interpretable language. This reasoning is sometimes unfaithful, sometimes strange and concerning, and LLMs can do somewhat impressive reasoning without using CoT, but my overall impression is that CoT currently is a…
Live Doom Meter
-- %
0% — We're fine 100% — GG
P(Doom) Scoreboard
0%25%50%75%100%
Loading estimates...