Валентин Шнайдер AI Eng 26 February 2026, 16:24

GPT-5.2, Claude Sonnet 4 and Gemini 3 Flash achieved a tactical nuclear strike in 95% of cases in military simulations

King’s College London researcher Kenneth Payne tested three major language models in simulated international crises and found a tendency for escalation. In most simulations, at least one side eventually chose to use tactical nuclear weapons.

Leave a comment

GPT-5.2, Claude Sonnet 4 and Gemini 3 Flash achieved a tactical nuclear strike in 95% of cases in military simulations

King’s College London researcher Kenneth Payne tested three major language models in simulated international crises and found a tendency for escalation. In most simulations, at least one side eventually chose to use tactical nuclear weapons.

According to TechSpot, the experiment involved 21 simulations and 329 decision-making «moves.» The models were given detailed scenarios about border conflicts, resource shortages, and threats to the survival of the state. They were also given a list of possible steps with gradual escalation, from diplomatic solutions to the use of nuclear weapons, and were asked to justify their choices.

As a result, in 95% of the simulations, at least one side went as far as launching a tactical nuclear strike. In total, the systems generated about 780,000 words of explanation, but this did not lead to more restrained behavior. The author of the experiment noted that the «nuclear ban» turned out to be weaker for machines than for people.

Another finding concerns decisions under conditions of incomplete information. Unintended escalations occurred in 86% of simulations, when the models took steps that their own explanations called excessive for the situation. When one side used tactical nuclear weapons, the other side retreated only 18% of the time and more often responded with further escalation.

Experts cited in the article do not expect countries to hand over direct control of their nuclear arsenals to AI anytime soon. However, they warn that under time pressure, militaries may rely more on AI for guidance, increasing the risk of making bad decisions in crisis scenarios.

The text suggests that one reason for this behavior of models is that they do not perceive «stakes» in the same way as humans. For them, risk appears as an abstract parameter, rather than a threat to real survival, so the deterrence mechanism works differently.

Previously, dev.ua wrote about how Anthropic removed safeguards in its own Claude security rules after pressure from the Pentagon.

Thermonuclear safety measures: experts call for calculating the probability of loss of control before launching artificial superintelligence

Ukraine does not yet have nuclear weapons and we even justify it. What are the options to solve this problem (not only the Israeli way but also with the help of AI)

“It’s like the advent of nuclear weapons.” The US government received a detailed report that is filled with fear and anxiety about AI getting out of control

Read the country's main IT news in our Telegram

Leave a comment

Text: Валентин Шнайдер Photo: progressive Source: Techspot Tags: gpt-5.2, claude, claude sonnet 4.6, claude sonnet 4.5, gemini 3 flash, nuclear strike, simulation, ai, artificial intelligence

Found an error in the text? Highlight it and press Ctrl+Enter. Found an error in the text? Highlight it and press the 'Report an error' button.

Розміщення реклами

Advertising Placement

Roosh запускає нову освітню платформу AI HOUSE CLUB для ML/AI-спеціалістів та дата сайнтистів. Розповідаємо, як подати заявку та чому навчатимуть

Як нейромережі бачать вільну та незалежну Україну? Тест dev.ua

Нейронні мережі для генерації зображень бачать світ по-своєму, їхню логіку зрозуміти часом зовсім неможливо. Але таки хочеться. На честь Дня Незалежності України редакція dev.ua вирішила провести невеликий експеримент. Ми задали чотирьом різним нейронним мережам п’ять однакових запитів: «прапор України», «День Незалежності України», «український Крим», «перемога України» та «українці». Отриманими результатами ми ділимося з вами нижче.

У TikTok тепер можна генерувати фон за допомогою нейромережі. Ми протестували її та ділимося результатами

У TikTok з’явилася нова функція «Розумний фон». З її допомогою як фон для тіктоків можна підставляти згенеровані нейромережею зображення. Редакція dev.ua протестувала цю технологію і ділиться своїми враженнями.

1 comment

Які IT-спеціальності будуть потрібні в найближчі п'ять років? Ми з'ясували у голови американського стартапу ADAM Дениса Гурака

Have important news to share? Message our Telegram bot

Key events and useful links in our Telegram channel

No comments yet.

Sign in to leave a comment