polyak-ruppert-averaging

Tag

Cards List
#polyak-ruppert-averaging

A Single Stepsize Suffices for Unprojected Linear TD(0): Simultaneous Robust and Fast Rates via Polyak--Ruppert Averaging

arXiv cs.LG · 6d ago Cached

This paper provides high-probability guarantees for an unprojected linear TD(0) algorithm with Polyak–Ruppert averaging under Markovian sampling, using a single stepsize schedule that achieves both robust curvature-free and fast curvature-dependent convergence rates.

0 favorites 0 likes
← Back to home

Submit Feedback