Tag
RoboStressBench proposes a benchmark for evaluating vision-language model robustness to physical visual stresses (material, viewpoint, lighting, geometry) in embodied scenes, identifying stress-specific failure modes.