rubric-generation

Tag

Cards List
#rubric-generation

Generating and Refining Dynamic Evaluation Rubrics for LLM-as-a-Judge

arXiv cs.CL · 2026-06-01 Cached

This paper proposes a training-free method to automatically generate fine-grained evaluation rubrics for LLM-as-a-judge without human annotation, and further introduces an iterative fine-tuning strategy for a rubric generator that outperforms larger proprietary models.

0 favorites 0 likes
#rubric-generation

SCOPE: Self-Play via Co-Evolving Policies for Open-Ended Tasks

Hugging Face Daily Papers · 2026-05-29 Cached

SCOPE is a self-play framework for open-ended tasks that co-evolves a Challenger and Solver policy, achieving up to +10.4 points on benchmarks without external supervision.

0 favorites 0 likes
← Back to home

Submit Feedback