model-based

#model-based

Generative OOD-regularized Model-based Policy Optimization

arXiv cs.LG ↗ · 2026-05-26 Cached

Introduces GORMPO, a density-regularized offline RL algorithm that uses generative density modeling to restrict policy updates to high-density areas, achieving 17% improvement on a real-world medical dataset and outperforming state-of-the-art baselines.

0 favorites 0 likes

model-based

Generative OOD-regularized Model-based Policy Optimization

Submit Feedback