@GoogleDeepMind: Instead of writing complex code, the team interacted with Spot using plain English. We built a bridge between Gemini Ro…
Summary
Google DeepMind has integrated Gemini with Boston Dynamics' Spot robot, enabling natural language control without complex coding. Users can now instruct Spot using plain English to perform complex tasks like navigation, photography, and object manipulation.
View Cached Full Text
Cached at: 04/20/26, 09:39 AM
Instead of writing complex code, the team interacted with Spot using plain English. We built a bridge between Gemini Robotics ER and Spot’s system, giving the AI a basic set of tools to move freely, take photos, and grab things - enabling it to carry out more complex tasks.
Similar Articles
@GoogleDeepMind: We teamed up with @BostonDynamics to power their robot Spot with Gemini Robotics embodied reasoning models. This means …
Google DeepMind partnered with Boston Dynamics to integrate Gemini Robotics embodied reasoning models into their Spot robot, enabling improved environmental understanding, object identification, and command following for tasks like tidying rooms.
Gemini Robotics brings AI into the physical world
Google DeepMind introduces Gemini Robotics, a Gemini 2.0-based vision-language-action model designed to control physical robots with improved generality, interactivity, and dexterity. The company also launches Gemini Robotics-ER for spatial reasoning and partners with Apptronik to develop humanoid robots.
Gemini Robotics 1.5 brings AI agents into the physical world
Google DeepMind introduces Gemini Robotics 1.5 and Gemini Robotics-ER 1.5, advancing physical AI agents that can perceive, plan, think, and act to complete complex multi-step tasks. Gemini Robotics-ER 1.5 is now available to developers via the Gemini API.
@GoogleDeepMind: We’re reimagining a 50-year-old interface - the mouse pointer - with AI. These experimental demos show how people can i…
Google DeepMind is experimenting with reimagining the mouse pointer interface using Gemini AI, allowing users to control screens through motion, speech, and natural shorthand.
@GoogleDeepMind: Deep Research and Deep Research Max are our latest autonomous research agents powered by Gemini 3.1 Pro. They can safel…
Google DeepMind launched Deep Research and Deep Research Max, autonomous agents using Gemini 3.1 Pro to browse web and custom data for professional, fully-cited reports.