real-world-images

#real-world-images

WildTableBench: Benchmarking Multimodal Foundation Models on Table Understanding In the Wild

Hugging Face Daily Papers ↗ · 2026-05-01 Cached

WildTableBench introduces the first question-answering benchmark for real-world table images, revealing that existing multimodal foundation models struggle significantly with structural perception and numerical reasoning, with only one model exceeding 50% accuracy.

0 favorites 0 likes

real-world-images

WildTableBench: Benchmarking Multimodal Foundation Models on Table Understanding In the Wild

Submit Feedback