gating-policy

#gating-policy

Finding the Time to Think: Learning Planning Budgets in Real-Time RL

arXiv cs.LG ↗ · 2d ago Cached

This paper introduces variable-delay real-time RL, where agents decide how long to deliberate in environments that progress during decision-making, and proposes a lightweight gating policy to select state-dependent planning budgets, outperforming fixed-budget and heuristic baselines in several real-time games.

0 favorites 0 likes

gating-policy

Finding the Time to Think: Learning Planning Budgets in Real-Time RL

Submit Feedback