action-trajectories

Tag

Cards List
#action-trajectories

PreAct-Bench: Benchmarking Predictive Monitoring in LLMs

arXiv cs.LG · 2026-06-10 Cached

PreAct-Bench is a benchmark of 1,000 paired ethical and unethical action trajectories across five domains, designed to evaluate the ability of LLMs to predict harmful outcomes from partial trajectories (predictive monitoring). Results show that while humans perform well, current LLMs struggle, highlighting the need for future-oriented risk reasoning.

0 favorites 0 likes
← Back to home

Submit Feedback