robust-mdps

Tag

Cards List
#robust-mdps

Robust Shielding for Safe Reinforcement Learning

arXiv cs.AI · 2026-06-02 Cached

Introduces a novel shielding framework for robust Markov decision processes (RMDPs) that formally guarantees safety under uncertain transition dynamics, proving soundness and optimality. The approach combines with PAC guarantees for learned models, enabling safe reinforcement learning in unknown environments.

0 favorites 0 likes
← Back to home

Submit Feedback