cods-2025

Tag

Cards List
#cods-2025

Results and Retrospective Analysis of the CODS 2025 AssetOpsBench Challenge

arXiv cs.AI · 6d ago Cached

This paper presents a retrospective analysis of the CODS 2025 AssetOpsBench challenge, evaluating multi-agent AI systems on industrial tasks. It highlights discrepancies between public and hidden leaderboards and offers diagnostics for future agentic benchmarks.

0 favorites 0 likes
← Back to home

Submit Feedback