worst-case

#worst-case

Infra-Bayesian Reinforcement Learning Agents Outperform Classical RL For Worst-Case Robustness

arXiv cs.LG ↗ · 2026-05-25 Cached

This paper presents the first implementation of an infra-Bayesian reinforcement learning agent, demonstrating that it outperforms classical RL in worst-case regret and handles Newcomb's problem optimally, offering a step toward robustness under model misspecification.

0 favorites 0 likes

worst-case

Infra-Bayesian Reinforcement Learning Agents Outperform Classical RL For Worst-Case Robustness

Submit Feedback