Google DeepMind introduced the Gemini Robotics On-Device model for on-device robotics tasks, offering adaptability and accessibility.
The on-device model is designed to bring multimodal reasoning and understanding into the physical world, optimized for two-armed robots with minimal computational requirements.
DeepMind also released the Gemini Robotics SDK for developers to evaluate and adapt the model for various tasks and environments, emphasizing low-latency inference and robustness.
Gemini Robotics On-Device demonstrates strong generalization in visual, semantic, and behavioral aspects, enabling robots to perform complex tasks like following natural language instructions and executing dexterous manipulations.