Tag
This paper introduces PYTHALAB-MERA, an external controller for frozen local LLMs that uses validation-grounded memory and retrieval to improve coding agent performance. It demonstrates superior success rates in strict validation tasks compared to self-refinement baselines by leveraging execution feedback and temporal difference learning.