consumer-simulation

#consumer-simulation

Can LLMs Think Like Consumers? Benchmarking Crowd-Level Reaction Reconstruction with ConsumerSimBench

arXiv cs.CL ↗ · 2026-05-19 Cached

Introduces ConsumerSimBench, a benchmark for evaluating LLMs' ability to reconstruct crowd-level consumer reactions from real Chinese social media topics. Tests show frontier models cover only 47.8% of real reaction criteria, highlighting a gap between technical benchmark performance and social intuition.

0 favorites 0 likes

consumer-simulation

Can LLMs Think Like Consumers? Benchmarking Crowd-Level Reaction Reconstruction with ConsumerSimBench

Submit Feedback