Tag
This paper introduces Ishigaki-IDS-Bench, a benchmark for evaluating LLMs' ability to generate Information Delivery Specification (IDS) XML from BIM information requirements. Evaluation of 10 LLMs shows best models achieve 65.6% macro F1 for content agreement but only 27.7% pass the Content audit, indicating struggles with standard and vocabulary constraints.