decision-control

Tag

Cards List
#decision-control

AgentAtlas: Beyond Outcome Leaderboards for LLM Agents

arXiv cs.AI · 2026-05-22 Cached

This paper introduces AgentAtlas, a framework that goes beyond outcome-only leaderboards for LLM agents by proposing a six-state control-decision taxonomy and a nine-category trajectory-failure taxonomy to evaluate agent behavior more comprehensively.

0 favorites 0 likes
← Back to home

Submit Feedback