@_philschmid: Build a realtime translation app with the new Gemini Live Translate, Next.js, LiveKit and Cloud Run. What it covers: 1.…
Summary
A tutorial on building a realtime translation app using Gemini Live Translate, Next.js, LiveKit, and Cloud Run, covering audio streaming, translation, and deployment.
View Cached Full Text
Cached at: 06/18/26, 06:10 PM
Build a realtime translation app with the new Gemini Live Translate, Next.js, LiveKit and Cloud Run. What it covers:
- Stream host audio via WebRTC to a LiveKit Room
- Pipe PCM frames to Gemini Live for on-the-fly translation
- Publish translated audio back as separate language tracks
- Optimize latency with 100ms frame chunking (50Hz → 10Hz)
- Deploy to Cloud Run with Secret Manager and auto-scaling
Links below
Blog: https://dev.to/googleai/build-a-realtime-translation-app-with-gemini-live-api-livekit-google-cloud-run-5474…
Code: https://github.com/google-gemini/gemini-live-api-examples/tree/main/gemini-live-translate-livekit…
Similar Articles
@_philschmid: Docs: http://ai.google.dev/gemini-api/docs/live-api/live-translate… GitHub: http://github.com/google-gemini/gemini-live…
Google launches Gemini Live API for real-time translation, with documentation, GitHub examples, and a blog post.
@livekit: We built a live multilingual, multi-person video call with Gemini 3.5 Live Translate on LiveKit. Everyone picks their l…
LiveKit built a live multilingual video call using Gemini 3.5 Live Translate, allowing participants to speak in their own language and hear translations in real time. The open source code is available on GitHub.
Gemini 3.5 Live Translate
Gemini 3.5 Live Translate is a new audio model for real-time speech-to-speech translation.
Build a Realtime Speech Translation (28 minute read)
OpenAI releases gpt-realtime-translate, a low-latency speech-to-speech model optimized for live interpretation, accompanied by a developer cookbook for building multilingual browser, phone, and video applications.
Fluid, natural voice translation with Gemini 3.5 Live Translate
Google releases Gemini 3.5 Live Translate, an audio model for near real-time speech-to-speech translation in over 70 languages, preserving speaker intonation and pacing. It is rolling out across Google products including the Gemini Live API, Google Meet, and Google Translate.