Tag
HakushoBench is a Japanese chart and table VQA benchmark built from governmental white papers to evaluate vision-language models' understanding of complex visual data, challenging open-weight models with a 58.6% accuracy and a 34.9-point gap to proprietary models.