off-policy-learning

Tag

Cards List
#off-policy-learning

Ingredients for robotics research

OpenAI Blog · 2018-02-26 Cached

OpenAI presents Hindsight Experience Replay (HER), a reinforcement learning technique that enables robots to learn from failed attempts by retroactively treating achieved alternative outcomes as successful goals, allowing learning even with sparse reward signals.

0 favorites 0 likes
← Back to home

Submit Feedback