convergence-analysis

Tag

Cards List
#convergence-analysis

Behavior-Induced Mirror-Prox Temporal-Difference Learning for Faster Off-Policy Prediction

arXiv cs.AI · 2026-05-29 Cached

This paper proposes STHTD-MP, a behavior-induced Mirror-Prox temporal-difference method for faster off-policy prediction in reinforcement learning. It replaces the covariance metric with the behavior-policy Bellman matrix and provides convergence analysis and experimental comparisons.

0 favorites 0 likes
#convergence-analysis

Sign-Separated Finite-Time Error Analysis of Q-Learning

arXiv cs.AI · 2026-05-18 Cached

This paper develops a sign-separated finite-time error analysis for constant step-size Q-learning, decomposing the error into negative and positive parts and providing bounds that reveal an asymmetry related to overestimation.

0 favorites 0 likes
#convergence-analysis

A Switching System Theory of Q-Learning with Linear Function Approximation

arXiv cs.LG · 2026-05-13 Cached

This paper presents a switching-system theory for Q-learning with linear function approximation, using joint spectral radius to analyze convergence stability under deterministic, i.i.d., and Markovian observations.

0 favorites 0 likes
#convergence-analysis

On the Divergence of Differential Temporal Difference Learning without Local Clocks

arXiv cs.LG · 2026-05-11 Cached

This paper addresses an open problem in reinforcement learning by providing a counterexample showing that differential temporal difference learning can diverge when using a global clock, despite converging with a local clock, in average-reward settings.

0 favorites 0 likes
← Back to home

Submit Feedback