mllm-agents

#mllm-agents

MineExplorer: Evaluating Open-World Exploration of MLLM Agents in Minecraft

Hugging Face Daily Papers ↗ · 2026-05-29 Cached

The MineExplorer benchmark evaluates multimodal large language model agents' open-world exploration abilities in Minecraft using atomic and multi-hop tasks designed through multi-agent synthesis. Experiments show that open-world exploration remains challenging, with strong models degrading sharply over longer trajectories.

0 favorites 0 likes

mllm-agents

MineExplorer: Evaluating Open-World Exploration of MLLM Agents in Minecraft

Submit Feedback