RoboEvolve: Co-Evolving Planner-Simulator for Robotic Manipulation with Limited Data

Hugging Face Daily Papers 05/13/26, 12:00 AM Papers

Summary

RoboEvolve is a framework that co-evolves a VLM planner and VGM simulator for robotic manipulation, achieving data efficiency with only 500 unlabeled seed images and robust continual learning.

The scalability of robotic manipulation is fundamentally bottlenecked by the scarcity of task-aligned physical interaction data. While vision-language models (VLMs) and video generation models (VGMs) hold promise for autonomous data synthesis, they suffer from semantic-spatial misalignment and physical hallucinations, respectively. To bridge this gap, we introduce RoboEvolve, a novel framework that couples a VLM planner and a VGM simulator into a mutually reinforcing co-evolutionary loop. Operating purely on unlabeled seed images, RoboEvolve leverages a cognitive-inspired dual-phase mechanism: (i) daytime exploration fosters physically grounded behavioral discovery through a semantic-controlled multi-granular reward, and (ii) nighttime consolidation mines "near-miss" failures to stabilize policy optimization. Guided by an autonomous progressive curriculum, the system naturally scales from simple atomic actions to complex tasks. Extensive experiments demonstrate that RoboEvolve (I) achieves superior effectiveness, elevating base planners by 30 absolute points and amplifying simulator success by 48% on average; (II) exhibits extreme data efficiency, surpassing fully supervised baselines with merely 500 unlabeled seeds--a 50x reduction; and (III) demonstrates robust continual learning without catastrophic forgetting.

Original Article

View Cached Full Text

Cached at: 05/14/26, 04:16 AM

Paper page - RoboEvolve: Co-Evolving Planner-Simulator for Robotic Manipulation with Limited Data

Source: https://huggingface.co/papers/2605.13775

Abstract

RoboEvolve combines vision-language and video generation models in a co-evolutionary framework to enable scalable robotic manipulation with improved data efficiency and continuous learning capabilities.

The scalability of robotic manipulation is fundamentally bottlenecked by the scarcity of task-aligned physical interaction data. Whilevision-language models(VLMs) andvideo generation models(VGMs) hold promise for autonomous data synthesis, they suffer from semantic-spatial misalignment and physical hallucinations, respectively. To bridge this gap, we introduce RoboEvolve, a novel framework that couples a VLM planner and a VGM simulator into a mutually reinforcingco-evolutionary loop. Operating purely on unlabeled seed images, RoboEvolve leverages a cognitive-inspired dual-phase mechanism: (i) daytime exploration fosters physically grounded behavioral discovery through asemantic-controlled multi-granular reward, and (ii) nighttime consolidation mines “near-miss” failures to stabilize policy optimization. Guided by anautonomous progressive curriculum, the system naturally scales from simple atomic actions to complex tasks. Extensive experiments demonstrate that RoboEvolve (I) achieves superior effectiveness, elevating base planners by 30 absolute points and amplifying simulator success by 48% on average; (II) exhibits extreme data efficiency, surpassing fully supervised baselines with merely 500 unlabeled seeds--a 50x reduction; and (III) demonstrates robustcontinual learningwithoutcatastrophic forgetting.

View arXiv page View PDF Add to collection

Get this paper in your agent:

hf papers read 2605\.13775

Don’t have the latest CLI?curl \-LsSf https://hf\.co/cli/install\.sh \| bash

Models citing this paper0

No model linking this paper

Cite arxiv.org/abs/2605.13775 in a model README.md to link it from this page.

Datasets citing this paper0

No dataset linking this paper

Cite arxiv.org/abs/2605.13775 in a dataset README.md to link it from this page.

Spaces citing this paper0

No Space linking this paper

Cite arxiv.org/abs/2605.13775 in a Space README.md to link it from this page.

Collections including this paper0

No Collection including this paper

Add this paper to acollectionto link it from this page.

RoboEvolve: Co-Evolving Planner-Simulator for Robotic Manipulation with Limited Data

Paper page - RoboEvolve: Co-Evolving Planner-Simulator for Robotic Manipulation with Limited Data

Abstract

Models citing this paper0

Datasets citing this paper0

Spaces citing this paper0

Collections including this paper0

Similar Articles

MLEvolve: A Self-Evolving Framework for Automated Machine Learning Algorithm Discovery

EvoTrainer: Co-Evolving LLM Policies and Training Harnesses for Autonomous Agentic Reinforcement Learning

CoEvolve: Training LLM Agents via Agent-Data Mutual Evolution

EvoMap/evolver

A better method for planning complex visual tasks

Submit Feedback

Similar Articles

MLEvolve: A Self-Evolving Framework for Automated Machine Learning Algorithm Discovery

EvoTrainer: Co-Evolving LLM Policies and Training Harnesses for Autonomous Agentic Reinforcement Learning

CoEvolve: Training LLM Agents via Agent-Data Mutual Evolution

A better method for planning complex visual tasks