numerical-understanding

#numerical-understanding

SPACENUM: Revisiting Spatial Numerical Understanding in VLMs

arXiv cs.AI ↗ · 2026-05-25 Cached

This paper presents SpaceNum, a unified framework to evaluate how vision-language models (VLMs) understand numerical values in spatial contexts, finding that current models largely fail to ground numbers spatially and often perform close to random guessing.

0 favorites 0 likes

numerical-understanding

SPACENUM: Revisiting Spatial Numerical Understanding in VLMs

Submit Feedback