cods-2025

#cods-2025

Results and Retrospective Analysis of the CODS 2025 AssetOpsBench Challenge

arXiv cs.AI ↗ · 2026-05-12 Cached

This paper presents a retrospective analysis of the CODS 2025 AssetOpsBench challenge, evaluating multi-agent AI systems on industrial tasks. It highlights discrepancies between public and hidden leaderboards and offers diagnostics for future agentic benchmarks.

0 favorites 0 likes

cods-2025

Results and Retrospective Analysis of the CODS 2025 AssetOpsBench Challenge

Submit Feedback