constrained-mdps

#constrained-mdps

Seeing Before Colliding: Anticipatory Safe RL with Frozen Vision-Language Models

arXiv cs.LG ↗ · 17h ago Cached

This paper presents VLM-Safe-RL, a framework that integrates frozen vision-language models into constrained MDP Lagrangian updates to provide anticipatory cost signals for safe reinforcement learning in high-speed visual control tasks. The method outperforms standard constraint-aware baselines on Safety-Gymnasium FormulaOne L2 and generalizes to held-out environments.

0 favorites 0 likes

constrained-mdps

Seeing Before Colliding: Anticipatory Safe RL with Frozen Vision-Language Models

Submit Feedback