Наталя Хандусенко AI Eng 27 March 2026, 12:11

Mistral releases open-source AI model for speech generation: can fit on smartwatches and smartphones

French AI company Mistral has released a new open-source text-to-speech model that allows businesses to create voice agents for sales and customer engagement, putting Mistral in direct competition with players like ElevenLabs, Deepgram, and OpenAI.

Leave a comment

Mistral releases open-source AI model for speech generation: can fit on smartwatches and smartphones

French AI company Mistral has released a new open-source text-to-speech model that allows businesses to create voice agents for sales and customer engagement, putting Mistral in direct competition with players like ElevenLabs, Deepgram, and OpenAI.

The new model, called Voxtral TTS, supports 9 languages: English, French, German, Spanish, Dutch, Portuguese, Italian, Hindi, and Arabic.

“Our customers have long requested a speech generation model. That’s why we’ve developed a compact model that can run on smartwatches, smartphones, laptops, and other peripherals. It costs a fraction of anything on the market, yet delivers cutting-edge performance,” Pierre Stock, VP of Scientific Operations at Mistral AI, told TechCrunch.

Mistral said the new model can adapt its own voice with a sample of less than five seconds and capture characteristics such as subtle accents, intonations, logical stresses and individual speech tempos.

The model, built on the Ministral 3B, can easily switch between languages while retaining unique voice characteristics, which is extremely useful for dubbing or simultaneous interpretation. Stock emphasized that the company aimed to achieve a natural human sound, rather than a mechanical “robot voice.”

The model is designed to work in real time, according to the company. The Time-to-First-Audio (TTFA) indicator — the time from receiving input to the start of “speech” — is 90 ms for a 10-second sample of 500 characters. The model also has a real-time factor (RTF) of 6x, which means the ability to generate a 10-second audio clip in about 1.6 seconds.

Mistral AI CEO predicts that AI will “kill” over 50% of corporate software

French startup Mistral unveils 10 new AI models

Mistral's updated Magistral Small 1.2 reasoning model analyzes images and can fit on a MacBook

Read the country's main IT news in our Telegram

Leave a comment

Text: Наталя Хандусенко Tags: mistral, ai, ai model

Found an error in the text? Highlight it and press Ctrl+Enter. Found an error in the text? Highlight it and press the 'Report an error' button.

Розміщення реклами

Advertising Placement

Roosh запускає нову освітню платформу AI HOUSE CLUB для ML/AI-спеціалістів та дата сайнтистів. Розповідаємо, як подати заявку та чому навчатимуть

Як нейромережі бачать вільну та незалежну Україну? Тест dev.ua

Нейронні мережі для генерації зображень бачать світ по-своєму, їхню логіку зрозуміти часом зовсім неможливо. Але таки хочеться. На честь Дня Незалежності України редакція dev.ua вирішила провести невеликий експеримент. Ми задали чотирьом різним нейронним мережам п’ять однакових запитів: «прапор України», «День Незалежності України», «український Крим», «перемога України» та «українці». Отриманими результатами ми ділимося з вами нижче.

У TikTok тепер можна генерувати фон за допомогою нейромережі. Ми протестували її та ділимося результатами

У TikTok з’явилася нова функція «Розумний фон». З її допомогою як фон для тіктоків можна підставляти згенеровані нейромережею зображення. Редакція dev.ua протестувала цю технологію і ділиться своїми враженнями.

1 comment

Які IT-спеціальності будуть потрібні в найближчі п'ять років? Ми з'ясували у голови американського стартапу ADAM Дениса Гурака

Have important news to share? Message our Telegram bot

Key events and useful links in our Telegram channel

No comments yet.

Sign in to leave a comment