fluid-intelligence

Tag

Cards List
#fluid-intelligence

Mind's Eye: A Benchmark of Visual Abstraction, Transformation and Composition for Multimodal LLMs

Hugging Face Daily Papers · 2026-04-17 Cached

Researchers introduce Mind’s Eye, a benchmark of eight visual-cognitive tasks that reveals top multimodal LLMs score under 50% while humans reach 80%, exposing major gaps in visual abstraction, relation mapping and mental transformation.

0 favorites 0 likes
← Back to home

Submit Feedback