action-trajectories

#action-trajectories

PreAct-Bench: Benchmarking Predictive Monitoring in LLMs

arXiv cs.LG ↗ · 2026-06-10 Cached

PreAct-Bench is a benchmark of 1,000 paired ethical and unethical action trajectories across five domains, designed to evaluate the ability of LLMs to predict harmful outcomes from partial trajectories (predictive monitoring). Results show that while humans perform well, current LLMs struggle, highlighting the need for future-oriented risk reasoning.

0 favorites 0 likes

action-trajectories

PreAct-Bench: Benchmarking Predictive Monitoring in LLMs

Submit Feedback