belief-modeling

#belief-modeling

OmniToM: Benchmarking Theory of Mind in LLMs via Explicit Belief Modeling

arXiv cs.AI ↗ · 2026-05-27 Cached

OmniToM introduces a benchmark that evaluates large language models' theory of mind by requiring explicit belief structure extraction and labeling, revealing a bottleneck in tracking actor-specific beliefs despite strong performance on endpoint QA tasks.

0 favorites 0 likes

belief-modeling

OmniToM: Benchmarking Theory of Mind in LLMs via Explicit Belief Modeling

Submit Feedback