cognitive-development

Tag

Cards List
#cognitive-development

LEVANTE-bench: Multi-Scale Comparison of VLMs to Children Using Cognitive Tasks (or, "Is Your VLM Smarter Than a 5th Grader?")

arXiv cs.LG · 2026-06-05 Cached

This paper introduces LEVANTE-bench, a benchmark that systematically evaluates vision-language models on six cognitive tasks and compares their performance to children aged 5-12, finding that current VLMs align only partially with children's cognitive abilities.

0 favorites 0 likes
← Back to home

Submit Feedback