Sim-to-real transfer of robotic control with dynamics randomization

OpenAI Blog Papers

Summary

OpenAI researchers demonstrate a method to bridge the reality gap in robotic control by training policies with randomized simulator dynamics, enabling robots trained purely in simulation to successfully transfer to real-world tasks like object manipulation without physical training.

No content available
Original Article Export to Word Export to PDF
View Cached Full Text

Cached at: 04/20/26, 02:45 PM

# Sim-to-real transfer of robotic control with dynamics randomization Source: [https://openai.com/index/sim-to-real-transfer-of-robotic-control-with-dynamics-randomization/](https://openai.com/index/sim-to-real-transfer-of-robotic-control-with-dynamics-randomization/) ## Abstract Simulations are attractive environments for training agents as they provide an abundant source of data and alleviate certain safety concerns during the training process\. But the behaviours developed by agents in simulation are often specific to the characteristics of the simulator\. Due to modeling error, strategies that are successful in simulation may not transfer to their real world counterparts\. In this paper, we demonstrate a simple method to bridge this "reality gap"\. By randomizing the dynamics of the simulator during training, we are able to develop policies that are capable of adapting to very different dynamics, including ones that differ significantly from the dynamics on which the policies were trained\. This adaptivity enables the policies to generalize to the dynamics of the real world without any training on the physical system\. Our approach is demonstrated on an object pushing task using a robotic arm\. Despite being trained exclusively in simulation, our policies are able to maintain a similar level of performance when deployed on a real robot, reliably moving an object to a desired location from random initial configurations\. We explore the impact of various design decisions and show that the resulting policies are robust to significant calibration error\.

Similar Articles

Generalizing from simulation

OpenAI Blog

OpenAI describes challenges with conventional RL on robotics tasks and introduces Hindsight Experience Replay (HER), a new RL algorithm that enables agents to learn from binary rewards by reframing failures as intended outcomes, combined with domain randomization for sim-to-real transfer.

Asymmetric actor critic for image-based robot learning

OpenAI Blog

OpenAI proposes an asymmetric actor-critic method for robot learning that leverages full state observability in simulators to train policies that operate on partial observations (RGBD images), enabling effective sim-to-real transfer without real-world training data.

Spam detection in the physical world

OpenAI Blog

OpenAI demonstrates that domain randomization—randomly varying colors, textures, lighting, and camera settings in simulated training data—enables deep learning models to effectively transfer from simulation to real-world robotic spam detection tasks without retraining from scratch.

Domain randomization and generative models for robotic grasping

OpenAI Blog

Researchers explore a data generation pipeline using domain randomization and procedurally generated objects to train a deep neural network for robotic grasp planning. The proposed autoregressive model achieves >90% success on unseen objects in simulation and 80% in the real world, despite being trained only on random simulated objects.