monte-carlo-dropout

Tag

Cards List
#monte-carlo-dropout

Reward Models Can Be Too Sensitive (22 minute read)

TLDR AI · 4d ago Cached

This paper argues that reward models in RL are often oversensitive, assigning different scores to equally good responses, and proposes a training-free discretization algorithm using Monte Carlo dropout to reduce oversensitivity, improving policy quality.

0 favorites 0 likes
#monte-carlo-dropout

Discretizing Reward Models

Hugging Face Daily Papers · 2026-06-19 Cached

This paper identifies oversensitivity in continuous reward models for reinforcement learning, where equally good responses receive different scores, and proposes a discretization technique using Monte Carlo dropout to reduce this oversensitivity while maintaining discriminative ability, leading to better policies and less reward hacking.

0 favorites 0 likes
← Back to home

Submit Feedback