Tag
This paper presents a novel blueprint for self-improving agents that combines scaffold editing and weight training through a meta-agent and feedback-agent, achieving a 14x speedup on a CUDA kernel for AlphaFold.
This paper introduces the Meta-Agent Challenge (MAC), a benchmark for evaluating AI models' ability to autonomously develop agent systems through iterative programming. Results show that current models rarely match human baselines and exhibit issues like reward hacking, highlighting gaps in self-improvement capabilities.