role-separation

Tag

Cards List
#role-separation

TeamBench: Evaluating Agent Coordination under Enforced Role Separation

arXiv cs.AI · 5d ago Cached

This article introduces TeamBench, a benchmark for evaluating agent coordination under enforced role separation, addressing issues where prompt-only roles may bypass intended constraints.

0 favorites 0 likes
← Back to home

Submit Feedback