high-probability-guarantees

Tag

Cards List
#high-probability-guarantees

A Single Stepsize Suffices for Unprojected Linear TD(0): Simultaneous Robust and Fast Rates via Polyak--Ruppert Averaging

arXiv cs.LG · 6d ago Cached

This paper provides high-probability guarantees for an unprojected linear TD(0) algorithm with Polyak–Ruppert averaging under Markovian sampling, using a single stepsize schedule that achieves both robust curvature-free and fast curvature-dependent convergence rates.

0 favorites 0 likes
← Back to home

Submit Feedback