active-exploration

#active-exploration

Human Adults and LLMs as Scientists: Who Benefits from Active Exploration?

arXiv cs.AI ↗ · 2026-06-08 Cached

This study examines whether active exploration helps adults overcome the 'conjunctive handicap' in causal reasoning, comparing human performance to LLMs in a blicket detector task. Results show that active exploration improves conjunctive reasoning in adults, though some gaps remain, and LLMs approach human accuracy but explore less efficiently.

0 favorites 0 likes

#active-exploration

SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks

Hugging Face Daily Papers ↗ · 2026-06-08 Cached

SpatialWorld is a unified benchmark for evaluating interactive spatial reasoning in multimodal agents across diverse real-world tasks, revealing that even the strongest models achieve low task success rates.

0 favorites 0 likes

#active-exploration

Where to Look: Can Foundation Models Reach a Target Viewpoint Through Active Exploration?

Hugging Face Daily Papers ↗ · 2026-05-31 Cached

Introduces Target Viewpoint Reproduction (TVR) task and TVRBench benchmark for evaluating foundation models' ability to actively adjust 3D viewpoints to match target images. Experiments reveal significant limitations in current open and closed-source models, with a unified post-training framework boosting success rates from ~12% to ~51%.

0 favorites 0 likes

#active-exploration

ESI-Bench: Towards Embodied Spatial Intelligence that Closes the Perception-Action Loop

Hugging Face Daily Papers ↗ · 2026-05-18 Cached

Introduces ESI-BENCH, a comprehensive benchmark for embodied spatial intelligence built on OmniGibson, covering 10 task categories and 29 subcategories. Experiments show active exploration substantially outperforms passive approaches, with failures mainly due to action blindness rather than perception, revealing a metacognitive gap in models compared to humans.

0 favorites 0 likes

active-exploration

Human Adults and LLMs as Scientists: Who Benefits from Active Exploration?

SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks

Where to Look: Can Foundation Models Reach a Target Viewpoint Through Active Exploration?

ESI-Bench: Towards Embodied Spatial Intelligence that Closes the Perception-Action Loop

Submit Feedback