@GoogleDeepMind: More natural sounding speech Support for 70+ languages like Hindi, Japanese, and German SynthID watermarking on all out…
Summary
Google DeepMind upgraded its speech synthesis model to sound more natural across 70+ languages and now applies SynthID watermarking to all outputs.
View Cached Full Text
Cached at: 04/21/26, 11:28 AM
More natural sounding speech Support for 70+ languages like Hindi, Japanese, and German SynthID watermarking on all outputs
Similar Articles
@GoogleDeepMind: SynthID, our imperceptible watermark for AI-generated content, is expanding to more partners. We’re also adding new way…
Google DeepMind is expanding SynthID, an imperceptible watermark for AI-generated content, to more partners and adding new detection methods via Gemini App and Google Search.
@GoogleDeepMind: 3.5 Live Translate can convert speech into over 70 languages and processes it as it’s streamed - while keeping tone, pa…
Google DeepMind announces Live Translate, a feature that converts speech into over 70 languages in real-time while preserving tone, pace, and pitch for more natural conversations.
Google's SynthID AI watermarking tech is being adopted by OpenAI, Nvidia, and more
Google's SynthID AI watermarking technology is being adopted by OpenAI, Nvidia, and other companies, expanding its use beyond Google's own AI models.
SynthID Detector — a new portal to help identify AI-generated content
Google announced SynthID Detector, a verification portal that identifies AI-generated content across images, audio, video, and text by detecting imperceptible SynthID watermarks embedded in media created with Google's AI tools. The platform is rolling out to early testers with plans for broader availability to journalists, media professionals, and researchers.
Advancing voice intelligence with new models in the API
OpenAI has announced three new voice models in its API: GPT-Realtime-2 with advanced reasoning, GPT-Realtime-Translate for live multilingual translation, and GPT-Realtime-Whisper for streaming transcription, aiming to enable more natural and action-oriented voice applications.