Ігор Вишневський AI Eng 8 May 2026, 08:33

OpenAI Unveils Three New Audio Models. What They Can Do and Which Businesses Are Already Using Them

OpenAI introduced three audio models in the API — GPT‑Realtime‑2, GPT‑Realtime‑Translate, and GPT‑Realtime‑Whisper.

«The models we’re launching take real-time audio from simple call and response to voice interfaces that can actually do things: listen, reason, translate, transcribe, and take other actions during a conversation,» the company said in a blog post announcing the models.

At the same time, GPT‑Realtime‑2 is called the first voice model with a GPT‑5-class reasoning system that can process complex queries and naturally conduct a conversation.

GPT-Realtime-Translate is a new model for live translation that can translate user speech from over 70 input languages into 13 output languages, while keeping up with the speaker.

In turn, GPT-Realtime-Whisper includes new streaming speech-to-text functions, and transcribes speech in real time as the speaker speaks.

«As voice becomes a more natural way to use software, we’re seeing developers build their products around three new voice AI models,» OpenAI says.

According to the company, the audio models are already being tested by large businesses — clients include online real estate site Zillow, online travel agency Priceline, and telecommunications company Deutsche Telekom.

GPT-Realtime-2 pricing starts at $32 per million audio inbound tokens, GPT-Realtime-Translate costs $0.034 per minute, and GPT-Realtime-Whisper costs $0.017 per minute.

The day before, dev.ua also reported that OpenAI had updated the default ChatGPT model: GPT-5.5 Instant hallucinates 52% less often and responds shorter.

"Creating chaos and sowing distrust." Former OpenAI CTO testifies against Sam Altman in court

OpenAI has released a prompt engineering guide for GPT-5.5, which recommends writing shorter prompts

Read the country's main IT news in our Telegram

Leave a comment

Text: Ігор Вишневський Tags: openai, ai

Found an error in the text? Highlight it and press Ctrl+Enter. Found an error in the text? Highlight it and press the 'Report an error' button.

Розміщення реклами

Advertising Placement

Roosh запускає нову освітню платформу AI HOUSE CLUB для ML/AI-спеціалістів та дата сайнтистів. Розповідаємо, як подати заявку та чому навчатимуть

Як нейромережі бачать вільну та незалежну Україну? Тест dev.ua

Нейронні мережі для генерації зображень бачать світ по-своєму, їхню логіку зрозуміти часом зовсім неможливо. Але таки хочеться. На честь Дня Незалежності України редакція dev.ua вирішила провести невеликий експеримент. Ми задали чотирьом різним нейронним мережам п’ять однакових запитів: «прапор України», «День Незалежності України», «український Крим», «перемога України» та «українці». Отриманими результатами ми ділимося з вами нижче.

У TikTok тепер можна генерувати фон за допомогою нейромережі. Ми протестували її та ділимося результатами

У TikTok з’явилася нова функція «Розумний фон». З її допомогою як фон для тіктоків можна підставляти згенеровані нейромережею зображення. Редакція dev.ua протестувала цю технологію і ділиться своїми враженнями.

1 comment

Які IT-спеціальності будуть потрібні в найближчі п'ять років? Ми з'ясували у голови американського стартапу ADAM Дениса Гурака

Have important news to share? Message our Telegram bot

Key events and useful links in our Telegram channel

No comments yet.

Sign in to leave a comment