mdp

#mdp

The Sim-to-Real Gap of Foundation Model Agents: A Unified MDP Perspective

arXiv cs.AI ↗ · 2026-06-08 Cached

This paper formalizes the sim-to-real gap for foundation model agents as a Markov Decision Process problem, proposing a unified research agenda to adapt classical solutions like domain randomization for improving agent robustness and reliability in real-world deployment.

0 favorites 0 likes

#mdp

A Goal-Set Characterization of Task Composition in the Boolean Task Algebra

arXiv cs.LG ↗ · 2026-06-04 Cached

This paper revisits the Boolean Task Algebra (BTA) for zero-shot task composition in reinforcement learning, proving that in deterministic MDPs all optimal extended Q-functions collapse to just two components (universal and empty tasks), making the originally proposed logarithmic base task set redundant. The authors introduce a goal-set-based composition method that reduces learning costs and composition time while preserving policy performance across multiple experimental domains.

0 favorites 0 likes

mdp

The Sim-to-Real Gap of Foundation Model Agents: A Unified MDP Perspective

A Goal-Set Characterization of Task Composition in the Boolean Task Algebra

Submit Feedback