user-simulation

#user-simulation

RealUserSim: Bridging the Reality Gap in Agent Benchmarking via Grounded User Simulation

arXiv cs.AI ↗ · 2026-05-22 Cached

The paper introduces RealUserSim, a framework that grounds LLM-based user simulation in real human behavioral data from 14,000+ authentic conversations to bridge the reality gap in agent benchmarking. It shows that grounded simulation raises behavioral match rates from 24.2% to 45.3% and reveals failure mechanisms invisible to cooperative simulators.

0 favorites 0 likes

#user-simulation

SalesSim: Benchmarking and Aligning Multimodal Language Models as Retail User Simulators

arXiv cs.CL ↗ · 2026-05-12 Cached

This paper introduces SalesSim, a framework and benchmark for evaluating Multimodal LLMs as retail user simulators, identifying gaps in persona alignment and proposing a new reinforcement learning method called UserGRPO.

0 favorites 0 likes

user-simulation

RealUserSim: Bridging the Reality Gap in Agent Benchmarking via Grounded User Simulation

SalesSim: Benchmarking and Aligning Multimodal Language Models as Retail User Simulators

Submit Feedback