convergence-analysis

#convergence-analysis

A Switching System Theory of Q-Learning with Linear Function Approximation

arXiv cs.LG ↗ · 15h ago Cached

This paper presents a switching-system theory for Q-learning with linear function approximation, using joint spectral radius to analyze convergence stability under deterministic, i.i.d., and Markovian observations.

0 favorites 0 likes

#convergence-analysis

On the Divergence of Differential Temporal Difference Learning without Local Clocks

arXiv cs.LG ↗ · 2d ago Cached

This paper addresses an open problem in reinforcement learning by providing a counterexample showing that differential temporal difference learning can diverge when using a global clock, despite converging with a local clock, in average-reward settings.

0 favorites 0 likes

convergence-analysis

A Switching System Theory of Q-Learning with Linear Function Approximation

On the Divergence of Differential Temporal Difference Learning without Local Clocks

Submit Feedback