progress-awareness

#progress-awareness

Retrospective Progress-Aware Self-Refinement for LLM Agent Training

arXiv cs.CL ↗ · 6d ago Cached

This paper introduces RePro, a framework that trains LLM agents to self-generate progress signals through a forward-then-reflect rollout paradigm, achieving up to 12% absolute success rate gains on WebShop, ALFWorld, and Sokoban benchmarks.

0 favorites 0 likes

progress-awareness

Retrospective Progress-Aware Self-Refinement for LLM Agent Training

Submit Feedback