@HuggingPapers: When should LLMs update, preserve, or ignore information? Contextual Belief Management is what long-horizon reasoning w…
Summary
Introduces BeliefTrack, a method for contextual belief management in LLMs, reducing reasoning failures by over 70%.
View Cached Full Text
Cached at: 05/31/26, 04:58 AM
When should LLMs update, preserve, or ignore information?
Contextual Belief Management is what long-horizon reasoning was missing. We introduce BeliefTrack—and show that optimizing belief states cuts reasoning failures by over 70%. https://t.co/7gwuNLNd1t
Similar Articles
When Should Models Change Their Minds? Contextual Belief Management in Large Language Models
This paper introduces Contextual Belief Management (CBM) for LLMs to handle long-term information, proposes the BeliefTrack benchmark for evaluation, and demonstrates that reinforcement learning and representation-level steering significantly reduce belief management failures.
Parallel LLM Reasoning for Bias-Resilient, Robust Conceptual Abstraction
This paper proposes a framework for parallel chunk-level processing of long documents with LLMs to reduce cumulative bias and improve evidence traceability, achieving significant reductions in omission errors and unsupported claims.
Belief Engine: Configurable and Inspectable Stance Dynamics in Multi-Agent LLM Deliberation
The paper introduces the Belief Engine, an auditable belief-update layer for LLM agents that makes stance changes in multi-agent deliberation configurable and inspectable by treating belief as an evidential state with explicit update rules.
LLMs Know When They Know, but Do Not Act on It: A Metacognitive Harness for Test-time Scaling
This paper proposes a metacognitive harness that separates monitoring from reasoning in LLMs, using pre-solve feeling-of-knowing and post-solve judgment-of-learning signals to control when to trust, retry, or aggregate answers, improving accuracy on text, code, and multimodal benchmarks without parameter updates.
When Correct Beliefs Collapse: Epistemic Resilience of LLMs under Clinical Pressure
This paper investigates how large language models maintain correct beliefs under adversarial pressure in clinical settings, proposing R-FT fine-tuning to improve epistemic resilience while balancing corrigibility, and demonstrating significant robustness gains on medical benchmarks.