Domain randomization and generative models for robotic grasping

OpenAI Blog Papers

Summary

Researchers explore a data generation pipeline using domain randomization and procedurally generated objects to train a deep neural network for robotic grasp planning. The proposed autoregressive model achieves >90% success on unseen objects in simulation and 80% in the real world, despite being trained only on random simulated objects.

No content available
Original Article Export to Word Export to PDF
View Cached Full Text

Cached at: 04/20/26, 02:56 PM

# Domain randomization and generative models for robotic grasping Source: [https://openai.com/index/domain-randomization-and-generative-models-for-robotic-grasping/](https://openai.com/index/domain-randomization-and-generative-models-for-robotic-grasping/) ## Abstract Deep learning\-based robotic grasping has made significant progress thanks to algorithmic improvements and increased data availability\. However, state\-of\-the\-art models are often trained on as few as hundreds or thousands of unique object instances, and as a result generalization can be a challenge\. In this work, we explore a novel data generation pipeline for training a deep neural network to perform grasp planning that applies the idea of domain randomization to object synthesis\. We generate millions of unique, unrealistic procedurally generated objects, and train a deep neural network to perform grasp planning on these objects\.Since the distribution of successful grasps for a given object can be highly multimodal, we propose an autoregressive grasp planning model that maps sensor inputs of a scene to a probability distribution over possible grasps\. This model allows us to sample grasps efficiently at test time \(or avoid sampling entirely\)\. We evaluate our model architecture and data generation pipeline in simulation and the real world\. We find we can achieve a \>90% success rate on previously unseen realistic objects at test time in simulation despite having only been trained on random objects\. We also demonstrate an 80% success rate on real\-world grasp attempts despite having only been trained on random simulated objects\.

Similar Articles

Spam detection in the physical world

OpenAI Blog

OpenAI demonstrates that domain randomization—randomly varying colors, textures, lighting, and camera settings in simulated training data—enables deep learning models to effectively transfer from simulation to real-world robotic spam detection tasks without retraining from scratch.

Sim-to-real transfer of robotic control with dynamics randomization

OpenAI Blog

OpenAI researchers demonstrate a method to bridge the reality gap in robotic control by training policies with randomized simulator dynamics, enabling robots trained purely in simulation to successfully transfer to real-world tasks like object manipulation without physical training.

Solving Rubik’s Cube with a robot hand

OpenAI Blog

OpenAI developed a robot hand capable of solving a Rubik's Cube using a novel technique called Automatic Domain Randomization (ADR), which progressively increases simulation difficulty to enable effective transfer of learned behaviors from simulation to the real world.

RAD-2: Scaling Reinforcement Learning in a Generator-Discriminator Framework

Hugging Face Daily Papers

RAD-2 presents a unified generator-discriminator framework for autonomous driving that combines diffusion-based trajectory generation with RL-optimized reranking, achieving 56% collision rate reduction compared to diffusion-based planners. The approach introduces techniques like Temporally Consistent Group Relative Policy Optimization and BEV-Warp simulation environment for efficient large-scale training.

Generative models

OpenAI Blog

OpenAI publishes an overview of generative models as an approach to developing machine understanding of the world, explaining how these models work by learning to generate data similar to their training sets and their potential applications across various domains.