gating-policy

Tag

Cards List
#gating-policy

Finding the Time to Think: Learning Planning Budgets in Real-Time RL

arXiv cs.LG · 2d ago Cached

This paper introduces variable-delay real-time RL, where agents decide how long to deliberate in environments that progress during decision-making, and proposes a lightweight gating policy to select state-dependent planning budgets, outperforming fixed-budget and heuristic baselines in several real-time games.

0 favorites 0 likes
← Back to home

Submit Feedback