Gemini Omni | I/O 2026 Keynote
Summary
Google releases Gemini Omni at I/O 2026, a new model capable of generating any output from any input, combining world knowledge with generative media to enable conversational video editing and creative morphing, first launching with Gemini Omni Flash.
View Cached Full Text
Cached at: 05/22/26, 07:03 PM
Similar Articles
Google’s Gemini Omni turns images, audio, and text into video — and that’s just the start
Google announces Gemini Omni, a family of multimodal models that can generate video from images, audio, and text, reasoning across inputs to produce consistent, high-quality outputs. The first model, Gemini Omni Flash, rolls out at Google I/O to the Gemini app, YouTube Shorts, and Flow.
9 demos of Gemini Omni and Gemini 3.5 in action
Google showcases 9 demos of its new Gemini Omni (video generation and editing via conversation) and Gemini 3.5 Flash (agentic model for complex tasks) models, demonstrated at Google I/O 2026.
Introducing Gemini Omni: Create Anything from Anything
Google introduces Gemini Omni, a new multimodal AI model capable of processing and generating content across text, images, audio, and video from any input type.
Gemini | I/O 2026 Keynote
Google announced at I/O 2026 a complete redesign of the Gemini app (neural representation), the multimodal creation model Gemini Omni, and proactive agent features such as Daily Brief and Gemini Spark, while also launching a voice-driven multi-document processing capability for macOS.
Gemini Omni
Gemini Omni is a new AI product that enables creation from any input, starting with video, as showcased on Product Hunt.