@GoogleDeepMind: Gemini 3.1 Flash TTS is our most controllable text-to-speech model yet. With new Audio Tags, you can easily direct voca…

X AI KOLs 04/15/26, 04:05 PM Models

Summary

Google DeepMind releases Gemini 3.1 Flash TTS, an advanced text-to-speech model featuring new Audio Tags that enable fine-grained control over vocal style, delivery, and pace through text commands.

Gemini 3.1 Flash TTS is our most controllable text-to-speech model yet. With new Audio Tags, you can easily direct vocal style, delivery, and pace through text commands.

Original Article

View Cached Full Text

Cached at: 04/20/26, 09:39 AM

Gemini 3.1 Flash TTS is our most controllable text-to-speech model yet. With new Audio Tags, you can easily direct vocal style, delivery, and pace through text commands.

Similar Articles

Gemini 3.1 Flash TTS

Simon Willison's Blog

Google released Gemini 3.1 Flash TTS, a new text-to-speech model accessible via the Gemini API that supports advanced prompt-based control for detailed voice direction, accents, and speaking styles. The model enables sophisticated audio generation including multi-speaker conversations and character-specific vocal performances.

Gemini 3.1 Flash Live: Making audio AI more natural and reliable

Google DeepMind Blog

Google has released Gemini 3.1 Flash Live, a new high-quality audio model designed for more natural and reliable real-time voice interactions with improved latency and reasoning capabilities.

Improved Gemini audio models for powerful voice experiences

Google DeepMind Blog

Google has updated Gemini 2.5 Flash Native Audio to improve live voice agent capabilities, including sharper function calling, better instruction following, and smoother conversation context retrieval. The update also introduces live speech translation in the Google Translate app beta, preserving intonation across 70+ languages.

@GoogleDeepMind: Introducing Gemini 3.5: our newest family of models combining frontier intelligence with real-world action. The first r…

X AI KOLs Following

Google DeepMind announces Gemini 3.5, a new family of models combining frontier intelligence with real-world action, starting with 3.5 Flash, their strongest model yet for agents and coding.

With Gemini 3.5 Flash, Google bets its next AI wave on agents, not chatbots

TechCrunch AI

Google launched Gemini 3.5 Flash, a new AI model optimized for coding and autonomous agents, shifting focus from chatbots to agentic AI. It outperforms previous models and powers new products like Antigravity 2.0 and Gemini Spark.