preference-based

Tag

Cards List
#preference-based

Offline Preference-Based Trajectory Evaluation

arXiv cs.LG · 2026-06-17 Cached

This paper proposes offline preference-based trajectory evaluation for agentic systems, which compares trajectories via temporal preferences rather than binary success metrics. It shows that this approach reduces ties from roughly 75% to 35%, improving discriminative power and data efficiency across diverse benchmarks.

0 favorites 0 likes
← Back to home

Submit Feedback