theory-of-mind

#theory-of-mind

MindZero: Learning Online Mental Reasoning With Zero Annotations

arXiv cs.AI ↗ · 2d ago Cached

MindZero introduces a self-supervised reinforcement learning framework that trains multimodal large language models for efficient and robust online mental reasoning without requiring mental state annotations, outperforming model-based methods in accuracy and efficiency.

0 favorites 0 likes

#theory-of-mind

Differentiable Belief-based Opponent Shaping

arXiv cs.AI ↗ · 6d ago Cached

This paper introduces Differentiable Belief-based Opponent Shaping (D-BOS), a first-order method that treats observer beliefs as the shaped state and differentiates through belief update dynamics, allowing optimal strategies to emerge naturally from the environment's reward structure in hidden-role multi-agent settings.

0 favorites 0 likes

#theory-of-mind

OmniToM: Benchmarking Theory of Mind in LLMs via Explicit Belief Modeling

arXiv cs.AI ↗ · 2026-05-27 Cached

OmniToM introduces a benchmark that evaluates large language models' theory of mind by requiring explicit belief structure extraction and labeling, revealing a bottleneck in tracking actor-specific beliefs despite strong performance on endpoint QA tasks.

0 favorites 0 likes

#theory-of-mind

Agent-ToM: Learning to Monitor Autonomous LLM Agents via Theory-of-Mind Reasoning

arXiv cs.LG ↗ · 2026-05-26 Cached

Proposes Agent-ToM, a learning-to-monitor framework using Theory-of-Mind reasoning to detect covert malicious behavior in autonomous LLM agents by inferring beliefs and intents, outperforming baseline monitors.

0 favorites 0 likes

#theory-of-mind

OSCToM: RL-Guided Adversarial Generation for High-Order Theory of Mind

arXiv cs.AI ↗ · 2026-05-22 Cached

This paper presents OSCToM, an RL-guided method for generating adversarial data to test nested belief conflicts in LLMs, improving Theory of Mind reasoning on benchmarks like FANToM.

0 favorites 0 likes

#theory-of-mind

Does Theory of Mind Improvement Really Benefit Human-AI Interactions? Empirical Findings from Interactive Evaluations

arXiv cs.AI ↗ · 2026-05-18 Cached

This paper proposes a new interactive evaluation paradigm for Theory of Mind in LLMs, finding that improvements on static benchmarks do not translate to better performance in dynamic human-AI interactions, highlighting the need for interaction-based assessments.

0 favorites 0 likes

#theory-of-mind

Theory of Mind in Action: The Instruction Inference Task in Dynamic Human-Agent Collaboration

arXiv cs.CL ↗ · 2026-04-20 Cached

This paper introduces the Instruction Inference task to evaluate Theory of Mind capabilities in LLM-based agents during human-agent collaboration with incomplete or ambiguous instructions. The authors present Tomcat, an LLM agent tested on GPT-4o, DeepSeek-R1, and Gemma-3-27B, demonstrating performance comparable to human participants in inferring unspoken intentions.

0 favorites 0 likes

#theory-of-mind

Learning to model other minds

OpenAI Blog ↗ · 2017-09-14 Cached

OpenAI and University of Oxford researchers present LOLA (Learning with Opponent-Learning Awareness), a reinforcement learning method that enables agents to model and account for the learning of other agents, discovering cooperative strategies in multi-agent games like the iterated prisoner's dilemma and coin game.

0 favorites 0 likes

theory-of-mind

Submit Feedback