bayesian-reasoning

#bayesian-reasoning

BayesBench: Evaluating LLM Belief Trajectories Under Multi-Turn Evidence Accumulation

arXiv cs.AI ↗ · 4d ago Cached

BayesBench evaluates how closely large language models' belief updates match Bayesian reasoning in multi-turn evidence accumulation tasks, finding that while scaling improves latent inference, models struggle to use that understanding for downstream predictions.

0 favorites 0 likes

#bayesian-reasoning

BALAR : A Bayesian Agentic Loop for Active Reasoning

arXiv cs.AI ↗ · 2026-05-08 Cached

This paper introduces BALAR, a training-free Bayesian agentic loop algorithm that enables large language models to actively reason and ask clarifying questions in multi-turn interactions. It demonstrates significant performance improvements over baselines on detective, puzzle, and clinical diagnosis benchmarks.

0 favorites 0 likes

bayesian-reasoning

BayesBench: Evaluating LLM Belief Trajectories Under Multi-Turn Evidence Accumulation

BALAR : A Bayesian Agentic Loop for Active Reasoning

Submit Feedback