Google announces Gemini 3.5 Live Translate for instant voice-to-voice translation

Ars Technica Models

Summary

Google announces Gemini 3.5 Live Translate, a speech-to-speech model that provides instant voice translation in over 70 languages, rolling out across Google ecosystem.

<p>Google has been chasing real-time translation for years, which it says has been one of its "pioneering machine learning experiments." We've seen numerous demos on stage at Google events in the past, but you needed Google phones, earbuds, or some other specific setup. Last year, Google brought real-time translation to more users in the Translate app, and now it's expanding availability more. With the release of Gemini 3.5 Live Translate, you'll have access to instant translation in more places and with lower latency than ever before.</p> <p>The new AI model is part of the version 3.5 family that <a href="https://arstechnica.com/google/2026/05/google-announces-agent-optimized-gemini-3-5-flash-and-a-do-anything-model-called-omni/">launched at I/O</a>. Before today, Google had only rolled out the Flash version, but we're expecting a Pro model to drop in the coming weeks. Gemini 3.5 Live Translate is a speech-to-speech model tuned to automatically detect and translate in more than 70 languages.</p> <p><a href="https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-live-3-5-translate/">Google says</a> Gemini 3.5 Live Translate is fast enough to keep up with a normal conversation, following just a few seconds behind the speaker while also matching intonation, pacing, and pitch. In short, the voice sounds more like you than a generic robot. The demos, which are all being recorded under controlled conditions, do sound impressive. You won't have to wait long to verify the model's abilities for yourself, though.</p><p><a href="https://arstechnica.com/ai/2026/06/google-announces-gemini-3-5-live-translate-for-instant-voice-to-voice-translation/">Read full article</a></p> <p><a href="https://arstechnica.com/ai/2026/06/google-announces-gemini-3-5-live-translate-for-instant-voice-to-voice-translation/#comments">Comments</a></p>
Original Article
View Cached Full Text

Cached at: 06/10/26, 12:19 AM

# Google announces Gemini 3.5 Live Translate for instant voice-to-voice translation Source: [https://arstechnica.com/ai/2026/06/google-announces-gemini-3-5-live-translate-for-instant-voice-to-voice-translation/](https://arstechnica.com/ai/2026/06/google-announces-gemini-3-5-live-translate-for-instant-voice-to-voice-translation/) Google has been chasing real\-time translation for years, which it says has been one of its “pioneering machine learning experiments\.” We’ve seen numerous demos on stage at Google events in the past, but you needed Google phones, earbuds, or some other specific setup\. Last year, Google brought real\-time translation to more users in the Translate app, and now it’s expanding availability more\. With the release of Gemini 3\.5 Live Translate, you’ll have access to instant translation in more places and with lower latency than ever before\. The new AI model is part of the version 3\.5 family that[launched at I/O](https://arstechnica.com/google/2026/05/google-announces-agent-optimized-gemini-3-5-flash-and-a-do-anything-model-called-omni/)\. Before today, Google had only rolled out the Flash version, but we’re expecting a Pro model to drop in the coming weeks\. Gemini 3\.5 Live Translate is a speech\-to\-speech model tuned to automatically detect and translate in more than 70 languages\. [Google says](https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-live-3-5-translate/)Gemini 3\.5 Live Translate is fast enough to keep up with a normal conversation, following just a few seconds behind the speaker while also matching intonation, pacing, and pitch\. In short, the voice sounds more like you than a generic robot\. The demos, which are all being recorded under controlled conditions, do sound impressive\. You won’t have to wait long to verify the model’s abilities for yourself, though\. Speech translation in Google Meet with Gemini 3\.5 Live Translate\. Gemini 3\.5 Live Translate is rolling out across several parts of the Google ecosystem\. Developers can begin building with a public preview in the Gemini Live API or AI Studio\. The model processes speech continuously and handles all the multilingual inputs automatically, saving developers from manually configuring settings\. It also filters out background noise in busy environments\.

Similar Articles

Fluid, natural voice translation with Gemini 3.5 Live Translate

Google DeepMind Blog

Google releases Gemini 3.5 Live Translate, an audio model for near real-time speech-to-speech translation in over 70 languages, preserving speaker intonation and pacing. It is rolling out across Google products including the Gemini Live API, Google Meet, and Google Translate.

Gemini 3.5 Live Translate

Product Hunt

Gemini 3.5 Live Translate is a new audio model for real-time speech-to-speech translation.

Improved Gemini audio models for powerful voice experiences

Google DeepMind Blog

Google has updated Gemini 2.5 Flash Native Audio to improve live voice agent capabilities, including sharper function calling, better instruction following, and smoother conversation context retrieval. The update also introduces live speech translation in the Google Translate app beta, preserving intonation across 70+ languages.