performance-variation

#performance-variation

Performance Variation in Deep Reinforcement Learning

arXiv cs.LG ↗ · 2026-06-08 Cached

This paper identifies limitations of conventional uncertainty estimates for deep reinforcement learning and proposes percentile-based statistics and visualization to better assess run-to-run performance variation. Case studies demonstrate the method on PPO, SAC, TD-MPC, DQN, and Rainbow algorithms.

0 favorites 0 likes

performance-variation

Performance Variation in Deep Reinforcement Learning

Submit Feedback