reward-functions

#reward-functions

A debugger for RL reward functions that detects reward hacking during training [P]

Reddit r/MachineLearning ↗ · 3d ago

A debugger that detects reward hacking in reinforcement learning reward functions during training, aiding developers in identifying and fixing issues.

0 favorites 0 likes

#reward-functions

Large-scale study of curiosity-driven learning

OpenAI Blog ↗ · 2018-08-13 Cached

OpenAI presents a large-scale empirical study of curiosity-driven reinforcement learning without extrinsic rewards across 54 benchmark environments, showing strong performance and investigating the role of feature spaces in prediction-based reward signals.

0 favorites 0 likes

reward-functions

A debugger for RL reward functions that detects reward hacking during training [P]

Large-scale study of curiosity-driven learning

Submit Feedback