variance-reduction

Tag

Cards List
#variance-reduction

Variance reduction for policy gradient with action-dependent factorized baselines

OpenAI Blog · 2018-03-20 Cached

OpenAI researchers derive a bias-free action-dependent baseline for variance reduction in policy gradient methods, demonstrating improved learning efficiency on high-dimensional control tasks, multi-agent, and partially observed environments.

0 favorites 0 likes
← Back to home

Submit Feedback