MAP: A Map-then-Act Paradigm for Long-Horizon Interactive Agent Reasoning

Hugging Face Daily Papers 05/13/26, 12:00 AM Papers

Summary

The paper proposes the Map-then-Act Paradigm (MAP), a plug-and-play framework that shifts environmental understanding before execution in interactive LLM agents, achieving consistent gains across benchmarks and enabling frontier models to surpass near-zero baseline performance in 22 of 25 game environments.

Current interactive LLM agents rely on goal-conditioned stepwise planning, where environmental understanding is acquired reactively during execution rather than established beforehand. This temporal inversion leads to Delayed Environmental Perception: agents must infer environmental constraints through trial-and-error, resulting in an Epistemic Bottleneck that traps them in inefficient failure cycles. Inspired by human affordance perception and cognitive map theory, we propose the Map-then-Act Paradigm (MAP), a plug-and-play framework that shifts environment understanding before execution. MAP consists of three stages: (1) Global Exploration, acquiring environment-general priors; (2) Task-Specific Mapping, constructing a structured cognitive map; and (3) Knowledge-Augmented Execution, solving tasks grounded on the map. Experiments show consistent gains across benchmarks and LLMs. On ARC-AGI-3, MAP enables frontier models to surpass near-zero baseline performance in 22 of 25 game environments. We further introduce MAP-2K, a dataset of map-then-act trajectories, and show that training on it outperforms expert execution traces, suggesting that understanding environments is more fundamental than imitation.

Original Article Export to Word Export to PDF

View Cached Full Text

Cached at: 05/14/26, 04:16 AM

Paper page - MAP: A Map-then-Act Paradigm for Long-Horizon Interactive Agent Reasoning

Source: https://huggingface.co/papers/2605.13037

Abstract

Interactive LLM agents suffer from delayed environmental perception and epistemic bottlenecks due to reactive understanding during execution, which the proposed Map-then-Act Paradigm (MAP) addresses by acquiring environmental knowledge beforehand through global exploration, task-specific mapping, and knowledge-augmented execution.

Current interactive LLM agents rely ongoal-conditioned stepwise planning, whereenvironmental understandingis acquired reactively during execution rather than established beforehand. This temporal inversion leads to Delayed Environmental Perception: agents must infer environmental constraints throughtrial-and-error, resulting in anEpistemic Bottleneckthat traps them in inefficient failure cycles. Inspired by humanaffordance perceptionandcognitive map theory, we propose theMap-then-Act Paradigm(MAP), a plug-and-play framework that shifts environment understanding before execution. MAP consists of three stages: (1)Global Exploration, acquiring environment-general priors; (2)Task-Specific Mapping, constructing a structured cognitive map; and (3)Knowledge-Augmented Execution, solving tasks grounded on the map. Experiments show consistent gains across benchmarks and LLMs. OnARC-AGI-3, MAP enables frontier models to surpass near-zero baseline performance in 22 of 25 game environments. We further introduceMAP-2K, a dataset of map-then-act trajectories, and show that training on it outperforms expert execution traces, suggesting that understanding environments is more fundamental than imitation.

View arXiv page View PDF Add to collection

Get this paper in your agent:

hf papers read 2605\.13037

Don’t have the latest CLI?curl \-LsSf https://hf\.co/cli/install\.sh \| bash

Models citing this paper0

No model linking this paper

Cite arxiv.org/abs/2605.13037 in a model README.md to link it from this page.

Datasets citing this paper0

No dataset linking this paper

Cite arxiv.org/abs/2605.13037 in a dataset README.md to link it from this page.

Spaces citing this paper0

No Space linking this paper

Cite arxiv.org/abs/2605.13037 in a Space README.md to link it from this page.

Collections including this paper0

No Collection including this paper

Add this paper to acollectionto link it from this page.

MAP: A Map-then-Act Paradigm for Long-Horizon Interactive Agent Reasoning

Paper page - MAP: A Map-then-Act Paradigm for Long-Horizon Interactive Agent Reasoning

Abstract

Models citing this paper0

Datasets citing this paper0

Spaces citing this paper0

Collections including this paper0

Similar Articles

TMAS: Scaling Test-Time Compute via Multi-Agent Synergy

AIPO: : Learning to Reason from Active Interaction

Agentick: A Unified Benchmark for General Sequential Decision-Making Agents

Learning Agentic Policy from Action Guidance

Tools as Continuous Flow for Evolving Agentic Reasoning

Submit Feedback

Similar Articles

TMAS: Scaling Test-Time Compute via Multi-Agent Synergy

AIPO: : Learning to Reason from Active Interaction

Agentick: A Unified Benchmark for General Sequential Decision-Making Agents

Learning Agentic Policy from Action Guidance

Tools as Continuous Flow for Evolving Agentic Reasoning