self-evaluation

Tag

Cards List
#self-evaluation

Self-Evaluation Is Already There: Eliciting Latent Judge Calibration in Base LLMs with Minimal Data

Hugging Face Daily Papers · 2026-06-03 Cached

This paper introduces Self-Evaluation Elicitation (SEE), which uses calibration-coupled reinforcement learning and masked distillation to elicit latent judge calibration in base LLMs with minimal data, improving calibration across benchmarks while preserving answer quality.

0 favorites 0 likes
← Back to home

Submit Feedback