agent-adaptation

#agent-adaptation

EvoTest: Evolutionary Test-Time Learning for Self-Improving Agentic Systems

arXiv cs.CL ↗ · 2026-04-20 Cached

EvoTest introduces J-TTL, a benchmark for measuring agent test-time learning capabilities, and proposes an evolutionary framework where an Actor Agent plays games while an Evolver Agent iteratively improves the system's prompts, memory, and hyperparameters without fine-tuning. The method demonstrates superior performance compared to reflection and memory-based baselines on complex text-based games.

0 favorites 0 likes

agent-adaptation

EvoTest: Evolutionary Test-Time Learning for Self-Improving Agentic Systems

Submit Feedback