Gemini Omni

Hacker News Top Models

Summary

Gemini Omni is a new AI model from Google DeepMind that combines reasoning with creative capabilities, enabling multimodal understanding, video editing, and content generation, with built-in safety measures and digital watermarking.

No content available
Original Article
View Cached Full Text

Cached at: 05/19/26, 07:14 PM

# Gemini Omni Source: [https://deepmind.google/models/gemini-omni/](https://deepmind.google/models/gemini-omni/) Gemini Omni is where Gemini’s ability to reason meets the ability to create\. It delivers a leap in world understanding, multimodality, and editing\. --- Prompt: Make it look like the weird shape of my hand hole super zooms and magnifies the ground it's looking at in sharper quality\. Prompt: When the finger in <video\> touches the animal toy play the sound the animal makes Prompt: The lights of the apartments start turning on in sync with the music\. Prompt: Transport the violinist to the image environment Prompt: Make the violin invisible Prompt: Change the camera angle to be over the violinist’s shoulder\. Prompt: Change spaceship to <object\> --- Prompt: A marble rolling fast on a chain reaction style track, continuous smooth shot Prompt: claymation explainer of protein folding, everything is made out of clay, no hands, stop motion, accurate Prompt: A skeuomorphism stop motion explainer about how the brain hippocampus works with a compelling voiceover\. Don’t add seahorses\. No voice cuts at the end\. Don’t add text\. Prompt: The video shows items of the alphabet\. An unusual item starting with each letter is shown sitting on a table \(like a Capybara for C, disco globe for D and Lava Lamp for L\)\. All 26 letters must be represented by 26 items with matching lower thirds displaying the letter\. Only one item and lower third at a time\. Each lower third must look like a black marker written on a slip of paper in the bottom left\. Rapid fire, roughly 9 frames per item at 24FPS\. Last frame is a slip of paper "THE END"\. The whole video is accompanied by calm smooth music\. Prompt: word by word, one word on a the screen at a time: did, you, know, that, this, model, can, do, pretty, good, text\!? each word appears with a different animated style, perfect pacing to a rhythm, sizzle reel --- ### Creating your prompts Use our prompt guide to create realistic, coherent, and creative output\. Training/development evaluations including automated and human evaluations carried out continuously throughout and after the model’s training, to monitor its progress and performance Human red teaming conducted by specialist teams who sit outside of the model development team, across the policies and desiderata, deliberately trying to spot weaknesses and ensure the model adheres to safety policies and desired outcomes Automated red teaming to dynamically evaluate Gemini Omni Flash for safety and security considerations at scale, complementing human red teaming and static evaluations Ethics and safety reviews conducted ahead of the model’s release Content created or edited with Omni in the Gemini app, Google Flow or YouTube includes our imperceptible[SynthID](https://deepmind.google/blog/identifying-ai-generated-images-with-synthid/)digital watermark and[C2PA Content Credentials](https://contentcredentials.org/?utm_source=deepmind.google&utm_medium=referral&utm_campaign=gdm&utm_content=)\. You can easily verify content through the Gemini app and coming soon to Chrome and Search\. You can find out more about how we're expanding our content transparency and verification tools to help you understand how content was created and edited across the web[in our blog post](https://blog.google/innovation-and-ai/products/identifying-ai-generated-media-online?utm_source=deepmind.google&utm_medium=referral&utm_campaign=gdm&utm_content=)\. --- ### Gemini Supercharge your creativity and productivity ### Google Flow An AI creative studio built with and for creatives ### YouTube Shorts A shorter way to discover, watch, and create on YouTube ---

Similar Articles

What is Gemini Omni?

YouTube AI Channels

Gemini Omni is a Google AI tool for editing video using natural language prompts, sketches, and multi-turn conversations, enabling scene, object, and style transformations.

Gemini Omni

Product Hunt

Gemini Omni is a new AI product that enables creation from any input, starting with video, as showcased on Product Hunt.