worst-case

Tag

Cards List
#worst-case

Infra-Bayesian Reinforcement Learning Agents Outperform Classical RL For Worst-Case Robustness

arXiv cs.LG · 2026-05-25 Cached

This paper presents the first implementation of an infra-Bayesian reinforcement learning agent, demonstrating that it outperforms classical RL in worst-case regret and handles Newcomb's problem optimally, offering a step toward robustness under model misspecification.

0 favorites 0 likes
← Back to home

Submit Feedback