Gemini can transcribe audio and video into text, including the free version. An expert gave advice on how to use it
The Gemini AI service allows you to upload audio and video for transcription into text format.
The Gemini AI service allows you to upload audio and video for transcription into text format.
The Gemini AI service allows you to upload audio and video for transcription into text format.
AI expert Oleksiy Minakov reported this on his Facebook page.
«In the paid version of Gemini for $20/month, you can transcribe up to three hours of audio! And up to 1 hour of video. In the free version, you can transcribe up to 10 minutes of audio and up to 5 minutes of video,» he noted.
Minakov also noted the speed with which Gemini performs this task.
«The most important thing I liked was the speed of decoding. Instant! I’m not kidding — 2 hours of audio was decoded in a few seconds. Plus, you can add up to 10 files in one request!» added the AI expert.
As dev.ua reported, Google recently significantly updated its AI-based video generator Veo 3 and its accelerated version Veo 3 Fast. The company reduced the prices for content creation and also added the ability to generate vertical videos in a 9:16 aspect ratio and videos in 1080p HD resolution.
Additionally, Google has improved image generation in Gemini thanks to the nano-banana AI model.


