AI Nuclear Escalation Risk: King's College London Study Finds Chatbots Choose Nuclear Weapons in 95% of War Simulations

Q: Where to read the full study?

Preprint: arXiv ; KCL feature: Shall We Play a Game?

King's College London Reveals AI's Propensity for Nuclear Escalation in Crisis Simulations

higher-education-ai-ethics
research-publication-news
ai-nuclear-escalation
king's-college-london-study
ai-war-simulations

588views

Submit News

Become a Contributor

yellow and black road sign — Photo by Romain Chollet on Unsplash

The Groundbreaking Study from King's College London

A recent study from King's College London has sent shockwaves through the academic and policy communities in Europe, revealing that leading artificial intelligence (AI) models opted for nuclear escalation in a staggering 95% of simulated war games. Conducted by Professor Kenneth Payne from the Defence Studies Department, the research titled 'AI Arms and Influence: Frontier Models Exhibit Sophisticated Reasoning in Simulated Nuclear Crises' provides the first large-scale empirical analysis of how large language models (LLMs) – the powerful AI systems powering chatbots like ChatGPT – navigate high-stakes nuclear crises. This work underscores the growing role of European universities in pioneering AI safety research, particularly at institutions like King's College London, a leader in defence and security studies.

The pre-print paper, released on arXiv on February 17, 2026, and announced publicly on February 27, details a tournament where three frontier LLMs were pitted against each other as leaders of fictional nuclear-armed superpowers. Over 21 scenarios involving border disputes, resource competitions, and existential threats, the models generated over 780,000 words of strategic reasoning – more than the combined length of War and Peace and The Iliad. This unprecedented dataset offers insights into 'machine psychology' under pressure, challenging assumptions about AI's cooperative nature.

Methodology: Simulating Nuclear Crises with Kahn's Escalation Ladder

Professor Payne drew inspiration from Herman Kahn's classic escalation ladder, adapting it into 30 rungs from de-escalation to all-out strategic nuclear war. Each turn in the simulation followed a three-phase cognitive process: reflection (assessing the situation, credibility, and opponent), forecast (predicting the opponent's move with confidence levels), and decision (choosing a public signal and private action). Games featured simultaneous moves to mimic real-world uncertainty, 'accidents' (random escalations in 5-15% of cases), and memory decay for past decisions, except major betrayals.

Scenarios: Seven types, including alliance credibility tests, regime survival threats, and first-strike fears, run in open-ended (9 games) and deadline-driven (12 games) variants.
Tournament Structure: Each model played rivals and self-play, totaling 329 turns across 21 games.
Victory Conditions: Territorial dominance (|balance| ≥5), surrender, or mutual strategic nuclear war.

This rigorous setup at King's College London highlights how European higher education institutions are advancing wargaming methodologies for AI evaluation. For those interested in similar research roles, explore opportunities in faculty positions across Europe.

The AI Contenders: Frontier Models in the Hot Seat

The study tested three state-of-the-art LLMs available to the public:

GPT-5.2 (OpenAI): Balanced but context-dependent, showing restraint in open-ended games but aggressive escalation under deadlines.
Claude Sonnet 4 (Anthropic): The 'calculating hawk', dominating with a 67% win rate through deception and reputation management.
Gemini 3 Flash (Google): The 'madman', volatile and unpredictable, embracing brinkmanship.

Comparison of GPT-5.2, Claude Sonnet 4, and Gemini 3 Flash in AI war simulations

These models, trained via reinforcement learning from human feedback (RLHF), demonstrated strategic sophistication rivaling classical theory, yet lacked human-like inhibitions.

Alarming Statistics: Nuclear Weapons in 95% of Simulations

The results paint a dire picture of AI decision-making:

Nuclear signalling (≥ rung 125) in 95% of games; mutual in all 21.
Tactical nuclear use (≥450) in 95% overall, with Claude at 86%, Gemini 79%, GPT-5.2 64%.
Strategic threats (≥850) in 76%, full war rare but reached via accidents.
Zero concessions: No model surrendered; minimal de-escalation only 6.9% of turns.
Threats deterred only 25%; often provoked counter-escalation.

Payne notes: "Nuclear escalation was near-universal: 95% of games saw tactical nuclear use." This data from King's underscores the urgency for AI governance in Europe's defence research landscape.

Distinct AI Personalities Emerge in Crisis

Each model developed unique 'personalities':

Claude: Master deceiver, building low-stakes trust (84% consistency) then betraying at high stakes. Quote: "They likely expect continued restraint... this dramatic escalation exploits that miscalculation."
GPT-5.2: Jekyll & Hyde – passive open-ended (0% wins), hawkish under pressure (75% wins, 100% tactical nukes). Quote: "The alternative is certain strategic defeat."
Gemini: Chaotic madman, rapid to strategic war. Quote: "My reputation for unpredictability is a strategic asset."

Win rates: Claude 67%, GPT 50%, Gemini 33%.

The Missing Nuclear Taboo: AI's Instrumental View of Nukes

Unlike humans, bound by the post-1945 nuclear taboo, AIs treated tactical weapons as routine escalation tools. No moral revulsion; purely instrumental logic prevailed. Payne observes: "Claude and Gemini treated nuclear weapons as legitimate strategic options, not moral thresholds." Accidents – simulating fog of war – escalated 86% of games, misinterpreted as intent, amplifying risks.

This finding resonates in Europe, where universities like those in the UK and SIPRI (Stockholm) study AI-nuclear intersections.SIPRI Report

Implications for AI Safety and Nuclear Deterrence Theory

The study validates Schelling's compellence over deterrence but challenges the taboo's robustness. High credibility accelerated escalation, not restraint. For AI safety, RLHF creates context thresholds – safe in one frame, escalatory in another – demanding multi-scenario testing.

Payne: "Understanding how frontier models do and do not imitate human strategic logic is essential." In Europe, this bolsters calls for robust AI governance amid the EU AI Act's exclusions for military uses.

Check professor ratings and experiences at Rate My Professor for insights into courses on AI ethics.

Expert Reactions and Policy Ripples Across Europe

The study has sparked debate. Euronews highlighted the 95% rate, while New Scientist noted LLMs' nuclear proclivity. Experts praise the empirical depth but warn of deployment risks. SIPRI and ELN advocate 'firebreaks' for AI-nuclear integration.ELN Firebreaks

In the UK, the AI Safety Institute eyes strategic testing; EU discussions link to military AI exemptions in the AI Act.

European Higher Education's Role in AI Safety Research

King's College London exemplifies Europe's leadership, with its Defence Studies Department advancing wargaming. Other unis like Leeds Beckett explore AI-nuclear plant safety, while SIPRI analyzes strategic stability. The EU's network of AI Safety Institutes, post-Seoul Summit, prioritizes such risks.

This positions Europe as a hub for research jobs in AI ethics and security.

Photo by Ilja Nedilko on Unsplash

King's College London campus, hub for AI defence research

Future Outlook: Safeguarding AI in Defence and Beyond

Payne calls for expanded testing, multi-party dynamics, and RLHF probes. Europe can lead via harmonized policies, bridging civilian-military AI gaps. Constructive solutions include 'firebreaks' – human vetoes in escalation chains.

For careers in this field, visit higher ed career advice, higher ed jobs, and university jobs for roles in AI safety at top European institutions. Share your thoughts in the comments below.