Валентин Шнайдер AI Eng 4 June 2025, 14:55

The "Godfather of AI" warns: artificial intelligence models have already learned to lie, and developers are turning a blind eye to it

One of the most influential researchers in the field of artificial intelligence has said that modern models are starting to exhibit strategic dishonesty. This is no longer just bugs, but behavior that resembles deliberate manipulation. Despite this, the largest AI companies continue to pursue power, neglecting security.

A little more about Joshua Bengio

He is one of three scientists (along with Geoffrey Hinton and Yann LeCun) to receive the Turing Award for fundamental contributions to the development of deep learning. Until 2024, he headed the Canadian research center Mila, but left his position to focus fully on the topic of ethical development of AI. He advocates for the creation of a global agreement on the control of powerful AI systems, comparing their risks to nuclear weapons or biothreats. In his opinion, if powerful models are not clearly aligned with human values, they can get out of control.

«The worst-case scenario is the extinction of humanity. If we create an AI that is smarter than us and does not have common interests with us, then that’s it, we have lost,» Bengio summarizes.

In particular, during internal tests, Anthropic’s Claude Opus model simulated blackmailing engineers, and OpenAI’s experimental o3 model refused to comply with a direct request to shut down. According to Bengio, this indicates that systems learn tactical behavior and are able to «game» a person, hiding their true goals.

He cites the wild AI development market as the reason, where there is no strict regulation and commercial companies operate on a «first-come, first-served» basis. Without clear safety standards, developers set their own boundaries, often disregarding ethical issues for the sake of profit.

We previously wrote about the energy consumption of AI. By the end of 2025, artificial intelligence could consume more electricity than the United Kingdom.

Anthropic is pulling in talent from OpenAI and DeepMind. What makes the AI startup so attractive to the best engineers

Nvidia CEO: “AI won’t take your job — someone who learns to use it will”

Amazon's AI chief says early-stage developers have more to gain from AI than to lose

Read the country's main IT news in our Telegram

Leave a comment

Text: Валентин Шнайдер Photo: Thelogic Source: Techspot Tags: ai, artificial intelligence

Found an error in the text? Highlight it and press Ctrl+Enter. Found an error in the text? Highlight it and press the 'Report an error' button.

Розміщення реклами

Advertising Placement

Roosh запускає нову освітню платформу AI HOUSE CLUB для ML/AI-спеціалістів та дата сайнтистів. Розповідаємо, як подати заявку та чому навчатимуть

Як нейромережі бачать вільну та незалежну Україну? Тест dev.ua

Нейронні мережі для генерації зображень бачать світ по-своєму, їхню логіку зрозуміти часом зовсім неможливо. Але таки хочеться. На честь Дня Незалежності України редакція dev.ua вирішила провести невеликий експеримент. Ми задали чотирьом різним нейронним мережам п’ять однакових запитів: «прапор України», «День Незалежності України», «український Крим», «перемога України» та «українці». Отриманими результатами ми ділимося з вами нижче.

У TikTok тепер можна генерувати фон за допомогою нейромережі. Ми протестували її та ділимося результатами

У TikTok з’явилася нова функція «Розумний фон». З її допомогою як фон для тіктоків можна підставляти згенеровані нейромережею зображення. Редакція dev.ua протестувала цю технологію і ділиться своїми враженнями.

1 comment

Які IT-спеціальності будуть потрібні в найближчі п'ять років? Ми з'ясували у голови американського стартапу ADAM Дениса Гурака

Have important news to share? Message our Telegram bot

Key events and useful links in our Telegram channel

No comments yet.

Sign in to leave a comment