perspective-bias

#perspective-bias

Why Far Looks Up: Probing Spatial Representation in Vision-Language Models

Hugging Face Daily Papers ↗ · 2026-05-28 Cached

Investigates spatial representation in vision-language models, revealing a consistent bias where models conflate vertical image position with distance, and introduces SpatialTunnel synthetic benchmark to expose this shortcut; finds that better disentangled spatial representations improve robustness.

0 favorites 0 likes

perspective-bias

Why Far Looks Up: Probing Spatial Representation in Vision-Language Models

Submit Feedback