consumer-simulation

Tag

Cards List
#consumer-simulation

Can LLMs Think Like Consumers? Benchmarking Crowd-Level Reaction Reconstruction with ConsumerSimBench

arXiv cs.CL · 2026-05-19 Cached

Introduces ConsumerSimBench, a benchmark for evaluating LLMs' ability to reconstruct crowd-level consumer reactions from real Chinese social media topics. Tests show frontier models cover only 47.8% of real reaction criteria, highlighting a gap between technical benchmark performance and social intuition.

0 favorites 0 likes
← Back to home

Submit Feedback