UNIT.City — місце, де люди працюють... КРАЩЕ! Обирай свій простір просто зараз 👉
Олег ОнопрієнкоAI Eng
21 March 2025, 14:09
2025-03-21
OpenAI improves AI for voice recognition and generation. Model supports Ukrainian language
OpenAI has released updates to two of its key models, Whisper (an AI for transcription) and Voice Engine (a voice synthesis technology). The new versions offer even better speech recognition accuracy and more realistic voice reproduction, bringing AI closer to the level of natural communication.
OpenAI has released updates to two of its key models, Whisper (an AI for transcription) and Voice Engine (a voice synthesis technology). The new versions offer even better speech recognition accuracy and more realistic voice reproduction, bringing AI closer to the level of natural communication.
OpenAI’s updates make voice technology even more accessible and accurate. This could significantly improve the performance of automatic transcription services, voice assistants, and even systems for narrating videos or audiobooks, reports TechCrunch.
Whisper, OpenAI’s speech recognition model, is now faster and more accurate. It can handle complex accents, background noise, and even corrupted audio recordings better. This makes the technology even more effective for automatically transcribing interviews, conferences, and other conversational formats.
OpenAI has also improved its Voice Engine, its voice generation model. It can now more accurately mimic the human voice based on a short sample. This opens up new possibilities for voice assistants, text-to-speech, and even personalized voice content.
OpenAI has launched an AI version called o1-pro, which should generate “consistently better answers,” but will be the company’s most expensive AI model.