preference-coupling

Tag

Cards List
#preference-coupling

Calibrating the Evaluator: Does Probability Calibration Mitigate Preference Coupling in LLM Agent Feedback Loops?

arXiv cs.LG · 2d ago Cached

This paper presents the first study of probability calibration as a mitigation for evaluator preference coupling in LLM agent feedback loops, showing that calibrated evaluator judgments reduce coupling coefficients by 20-49% and divergence by 45-67%.

0 favorites 0 likes
← Back to home

Submit Feedback