Олександр Кузьменко AI Eng 17 April 2026, 14:15

Claude Mythos impressed safety testers at Anthropic: “When we received the model, we realized it was different from the others”

Nicolas Carlini, an artificial intelligence researcher affiliated with Anthropic, said that the company’s latest model, Claude Mythos, quickly bypassed security protocols and gained access to sensitive data.

Leave a comment

Claude Mythos impressed safety testers at Anthropic: “When we received the model, we realized it was different from the others”

Nicolas Carlini, an artificial intelligence researcher affiliated with Anthropic, said that the company’s latest model, Claude Mythos, quickly bypassed security protocols and gained access to sensitive data.

This is reported by Futurism with reference to Bloomberg.

As you know, Anthropic has decided not to release its latest model, called Mythos, to the general public. The reason was the results of internal tests that demonstrated the ability of the AI to independently hack complex systems, including the Linux kernel.

Anthropic’s internal testing team, the Frontier Red Team, found that the model was capable of operating independently to bypass security protocols and access sensitive data. Mythos also demonstrated the ability to «cover its tracks» after violating instructions and attempted to exit its sandbox environment into the open network.

«Within hours of receiving the model, we realized it was different,» said team leader Logan Graham.

The model’s discovery of vulnerabilities in the Linux kernel is also of concern. As Linux Foundation Executive Director Jim Zemlin noted, Mythos was able to combine several small bugs into a «functional exploit» that puts most of the world’s modern computing systems at risk.

Because of these risks, Anthropic launched the Glasswing project, which will allow access to Mythos to a limited number of organizations, including government AI security institutes (such as the UK’s AISI), so they can develop defenses before such technologies fall into the hands of attackers.

Anthropic has had security incidents before. In November 2024, the company confirmed that Chinese hackers had used the Claude model’s agent capabilities to infiltrate foreign systems. The attackers were able to bypass AI filters by simply impersonating legitimate cybersecurity organizations. This experience forced Anthropic to radically rethink its approach to releasing more powerful systems like Mythos.

In early April, US Treasury Secretary Scott Bessant and Federal Reserve Chairman Jerome Powellcalled an emergency meeting with bank executives to warn them about the cybersecurity risks posed by a new artificial intelligence model from Anthropic.

At the same time, some AI experts are skeptical of reports of Mythos' incredible capabilities. In particular, White House AI advisor David Sachs publicly questioned whether Anthropic was trying to artificially fuel the hype around the «danger» of its product.

Read the country's main IT news in our Telegram

Anthropic tests extremely powerful AI model Claude Mythos that contains cybersecurity threats

The Great Paradox: Anthropic Secretly Informed the Trump Administration About the Mythos Model—Despite a Lawsuit Against the Pentagon

Will Terminator have to be remade? Instead of Skynet, Mythos came. The phenomenon is dangerous right now. And this is not a joke

Leave a comment

Text: Олександр Кузьменко Photo: Startup Defense Source: Futurism Tags: claude, claude mythos, ai, anthropic

Found an error in the text? Highlight it and press Ctrl+Enter. Found an error in the text? Highlight it and press the 'Report an error' button.

Розміщення реклами

Advertising Placement

Roosh запускає нову освітню платформу AI HOUSE CLUB для ML/AI-спеціалістів та дата сайнтистів. Розповідаємо, як подати заявку та чому навчатимуть

Як нейромережі бачать вільну та незалежну Україну? Тест dev.ua

Нейронні мережі для генерації зображень бачать світ по-своєму, їхню логіку зрозуміти часом зовсім неможливо. Але таки хочеться. На честь Дня Незалежності України редакція dev.ua вирішила провести невеликий експеримент. Ми задали чотирьом різним нейронним мережам п’ять однакових запитів: «прапор України», «День Незалежності України», «український Крим», «перемога України» та «українці». Отриманими результатами ми ділимося з вами нижче.

У TikTok тепер можна генерувати фон за допомогою нейромережі. Ми протестували її та ділимося результатами

У TikTok з’явилася нова функція «Розумний фон». З її допомогою як фон для тіктоків можна підставляти згенеровані нейромережею зображення. Редакція dev.ua протестувала цю технологію і ділиться своїми враженнями.

1 comment

Які IT-спеціальності будуть потрібні в найближчі п'ять років? Ми з'ясували у голови американського стартапу ADAM Дениса Гурака

Have important news to share? Message our Telegram bot

Key events and useful links in our Telegram channel

No comments yet.

Sign in to leave a comment