Solving Rubik’s Cube with a robot hand
Summary
OpenAI developed a robot hand capable of solving a Rubik's Cube using a novel technique called Automatic Domain Randomization (ADR), which progressively increases simulation difficulty to enable effective transfer of learned behaviors from simulation to the real world.
View Cached Full Text
Cached at: 04/20/26, 02:55 PM
Similar Articles
Learning dexterity
OpenAI announces Dactyl, a system that learns robotic hand dexterity through simulation and reinforcement learning, using LSTMs to generalize across different physical environments and the Rapid PPO implementation to train policies that transfer to real-world manipulation tasks.
Domain randomization and generative models for robotic grasping
Researchers explore a data generation pipeline using domain randomization and procedurally generated objects to train a deep neural network for robotic grasp planning. The proposed autoregressive model achieves >90% success on unseen objects in simulation and 80% in the real world, despite being trained only on random simulated objects.
Sim-to-real transfer of robotic control with dynamics randomization
OpenAI researchers demonstrate a method to bridge the reality gap in robotic control by training policies with randomized simulator dynamics, enabling robots trained purely in simulation to successfully transfer to real-world tasks like object manipulation without physical training.
OpenAI Robotics Symposium 2019
OpenAI hosted its first Robotics Symposium on April 27, 2019, bringing together robotics and machine learning experts to discuss learning robots and demonstrate their humanoid robot hand solving manipulation tasks using vision and reinforcement learning.
Generalizing from simulation
OpenAI describes challenges with conventional RL on robotics tasks and introduces Hindsight Experience Replay (HER), a new RL algorithm that enables agents to learn from binary rewards by reframing failures as intended outcomes, combined with domain randomization for sim-to-real transfer.