model-based

Tag

Cards List
#model-based

Generative OOD-regularized Model-based Policy Optimization

arXiv cs.LG · 2026-05-26 Cached

Introduces GORMPO, a density-regularized offline RL algorithm that uses generative density modeling to restrict policy updates to high-density areas, achieving 17% improvement on a real-world medical dataset and outperforming state-of-the-art baselines.

0 favorites 0 likes
← Back to home

Submit Feedback