mllm-agents

Tag

Cards List
#mllm-agents

MineExplorer: Evaluating Open-World Exploration of MLLM Agents in Minecraft

Hugging Face Daily Papers · 2026-05-29 Cached

The MineExplorer benchmark evaluates multimodal large language model agents' open-world exploration abilities in Minecraft using atomic and multi-hop tasks designed through multi-agent synthesis. Experiments show that open-world exploration remains challenging, with strong models degrading sharply over longer trajectories.

0 favorites 0 likes
← Back to home

Submit Feedback