multi-timescale

#multi-timescale

Performance-Driven Environment Abstraction with Multi-Timescale Learning

arXiv cs.LG ↗ · 2026-06-17 Cached

This paper proposes a performance-driven state abstraction method for reinforcement learning that directly optimizes decision quality, using a multi-timescale framework to jointly adapt the policy and a tree-structured abstraction. The algorithm refines or aggregates state space based on Q-value discrepancies, achieving better sample efficiency and faster replanning than baselines.

0 favorites 0 likes

#multi-timescale

Representation over Routing: Overcoming Surrogate Hacking in Multi-Timescale PPO

Hugging Face Daily Papers ↗ · 2026-05-21 Cached

This paper identifies surrogate hacking and temporal uncertainty as failure modes in multi-timescale RL, and proposes a Target Decoupling architecture that removes routing from the actor, using the critic for auxiliary representation learning. The method eliminates policy collapse on the LunarLander-v2 benchmark and stably surpasses the 'Environment Solved' threshold without hyperparameter hacking.

0 favorites 0 likes

multi-timescale

Performance-Driven Environment Abstraction with Multi-Timescale Learning

Representation over Routing: Overcoming Surrogate Hacking in Multi-Timescale PPO

Submit Feedback