Tag
Introduces CAREBench, a benchmark grounded in appraisal theory to evaluate LLMs' emotion understanding through cognitive appraisal reasoning, revealing that current models struggle with reasoning and positive emotion recognition despite matching humans on some downstream tasks.