@googlegemma: Real-time social robotics, from the cloud to your local device. Watch Ian from our DevX team use Gemini Live for a seam…
Summary
Google Gemma team demonstrates real-time social robotics using Gemini Live on the Reachy Mini robot, showcasing both cloud and local inference with Gemma 4.
View Cached Full Text
Cached at: 06/12/26, 05:01 PM
Real-time social robotics, from the cloud to your local device.
Watch Ian from our DevX team use Gemini Live for a seamless voice chat with Reachy Mini.
Then, stick around until the end to see the robot running locally on Gemma 4! https://t.co/NrpKuYIE5b
Similar Articles
Gemini Robotics On-Device brings AI to local robotic devices
Google DeepMind introduces Gemini Robotics On-Device, an efficient VLA model optimized to run locally on robotic devices, enabling low-latency operation and offline capability while maintaining strong dexterous manipulation and task generalization. The model can be fine-tuned with as few as 50-100 demonstrations and comes with an SDK for developers.
Gemini Robotics brings AI into the physical world
Google DeepMind introduces Gemini Robotics, a Gemini 2.0-based vision-language-action model designed to control physical robots with improved generality, interactivity, and dexterity. The company also launches Gemini Robotics-ER for spatial reasoning and partners with Apptronik to develop humanoid robots.
Gemma 4 running fully offline on WebGPU with Transformers.js, controlling Reachy Mini over WebSerial.
Demonstrates running Gemma 4 offline in the browser using WebGPU and Transformers.js to control a Reachy Mini robot via WebSerial.
Gemini Robotics-ER 1.6: Powering real-world robotics tasks through enhanced embodied reasoning
Google DeepMind introduces Gemini Robotics-ER 1.6, a specialized AI model enhancing embodied reasoning for robotics through improved spatial awareness, task planning, and instrument reading capabilities.
Gemma 4 VLA Demo on Jetson Orin Nano Super
NVIDIA and Hugging Face publish a hands-on demo showing Gemma 4 running as a vision-language-action model entirely on the Jetson Orin Nano Super, using local STT/TTS and webcam input.