human-ai-agreement

#human-ai-agreement

AI4SE and SE4AI Exploration: A Decade Looking Back and Forward

arXiv cs.AI ↗ · 2026-06-20 Cached

This paper reviews the progress in AI for Systems Engineering (AI4SE) and Systems Engineering for AI (SE4AI) over the past decade, identifies five critical research gaps, and provides a human-AI agreement dataset and web explorer for relevance judgments.

0 favorites 0 likes

#human-ai-agreement

Quantifying the Statistical Effect of Rubric Modifications on Human-Autorater Agreement

arXiv cs.CL ↗ · 2026-05-08 Cached

This study analyzes how modifications to evaluation rubrics, such as shifting from holistic to analytic criteria, impact the agreement between human raters and AI autoraters. The findings suggest that providing examples and reducing bias improves agreement, while higher complexity tends to decrease it.

0 favorites 0 likes

human-ai-agreement

AI4SE and SE4AI Exploration: A Decade Looking Back and Forward

Quantifying the Statistical Effect of Rubric Modifications on Human-Autorater Agreement

Submit Feedback