cost-aware-evaluation

#cost-aware-evaluation

Multi-Agent Routing as Set-Valued Prediction: A WildChat Benchmark and Cost-Aware Evaluation

arXiv cs.LG ↗ · yesterday Cached

This paper formulates multi-agent routing as set-valued prediction, introduces a WildChat-derived benchmark with 3,000 prompts over a 12-agent catalog, and evaluates methods including supervised classifiers and cost-aware routing to study accuracy-cost trade-offs.

0 favorites 0 likes

cost-aware-evaluation

Multi-Agent Routing as Set-Valued Prediction: A WildChat Benchmark and Cost-Aware Evaluation

Submit Feedback