Tag
This paper introduces IntentGrasp, a comprehensive benchmark for evaluating large language models' intent understanding capabilities, revealing poor performance across 20 tested models. It proposes Intentional Fine-Tuning (IFT) as a solution, which significantly improves model performance and demonstrates strong cross-domain generalizability.