UNIT.City — місце, де люди працюють... КРАЩЕ! Обирай свій простір просто зараз 👉
Марія БровінськаAI Eng
24 January 2025, 10:08
2025-01-24
Dreams come true: OpenAI has shown off an AI agent called Operator that can do work for people. What it can do, what makes it special, and who is it already available to?
Operator is capable of independently performing work for you — you give him a task, and he performs it.
Operator is capable of independently performing work for you — you give him a task, and he performs it.
The operator is based on a new model that the developers call a «computer-using agent» (CUA).
CUA combines the vision capabilities of GPT-4o with advanced reasoning through reinforcement learning. It is trained to operate a computer in the same way that humans do — looking at a screen, using a mouse, and using a keyboard.
The model still has limitations and will continue to evolve based on feedback. We plan to add CUA to the developer API soon.
Operator is one of our first agents, which are AIs capable of doing work for you independently—you give it a task and it will execute it. https://t.co/nbH7OMmkmO
«This is the first time our models can perform actions online, so we conducted extensive internal testing and engaged external experts to ensure that Operator is safe to use,» OpenAI noted.
The Operator «sees» interfaces through screenshots, presses buttons, enters text, and can correct errors on its own. If the task is complex, the agent hands control over to the user. Before important actions, such as entering passwords, the Operator always asks for confirmation. It also blocks malicious requests and prohibited content.
The agent already works with popular services like DoorDash, Instacart, OpenTable, and Uber. It can order food or make restaurant reservations.
It will eventually become part of ChatGPT and be available to a wider range of users, including Plus, Team, and Enterprise subscriptions, by integrating the agent into ChatGPT.
As a reminder, recently, ChatGPT user noticed updates in the chatbot client code that indicate that the Operator AI agent will be available in a pre-release version for Pro subscribers. According to rumors, the Operator will be able to perform a number of tasks in the browser on behalf of the user.
The Information reported that OpenAI could launch Operator as early as this week.
Last November, it became known that OpenAI was preparing to launch a new artificial intelligence agent codenamed Operator as early as 2025. OpenAI’s AI agent can use a computer to perform actions on behalf of a person, such as writing code or booking tickets.
Senior Engineering Manager and IT blogger Dima Maleev assessed the prospects for the launch of the Operator AI Agent by OpenAI.
«I like this technological evolution. But I feel sad for the people who are developing solutions for bot detection and protection against scraping.» Dima Maleev on the consequences of the launch of OpenAI’s AI agent Operator
Як нейромережі бачать вільну та незалежну Україну? Тест dev.ua
Нейронні мережі для генерації зображень бачать світ по-своєму, їхню логіку зрозуміти часом зовсім неможливо. Але таки хочеться. На честь Дня Незалежності України редакція dev.ua вирішила провести невеликий експеримент.
Ми задали чотирьом різним нейронним мережам п’ять однакових запитів: «прапор України», «День Незалежності України», «український Крим», «перемога України» та «українці». Отриманими результатами ми ділимося з вами нижче.
У TikTok тепер можна генерувати фон за допомогою нейромережі. Ми протестували її та ділимося результатами
У TikTok з’явилася нова функція «Розумний фон». З її допомогою як фон для тіктоків можна підставляти згенеровані нейромережею зображення. Редакція dev.ua протестувала цю технологію і ділиться своїми враженнями.