surprisal

#surprisal

STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability

Hugging Face Daily Papers ↗ · 2026-06-17 Cached

STARE addresses policy entropy collapse in GRPO-based reinforcement learning for large language models by introducing surprisal-guided token-level advantage reweighting and target-entropy regulation, achieving 4%-8% accuracy gains on AIME benchmarks.

0 favorites 0 likes

#surprisal

Trajectory Dynamics in Language Model Hidden States Predict Human Processing Costs Beyond Surprisal

arXiv cs.CL ↗ · 2026-06-05 Cached

Introduces trajectory extrapolation error, a measure derived from transformer LM hidden states that predicts human reading times independently of and orthogonally to surprisal, revealing a dissociable component of incremental processing cost.

0 favorites 0 likes

#surprisal

Why are language models less surprised than humans? Testing the Parse Multiplicity Mismatch Hypothesis

arXiv cs.CL ↗ · 2026-05-18 Cached

This paper tests the Parse Multiplicity Mismatch Hypothesis, proposing that language models underpredict human processing difficulty in garden path sentences because they can consider more simultaneous parses. Using RNNGs with beam search, they find reducing the number of active parses increases predicted garden path effects, but not enough to fully capture human data.

0 favorites 0 likes

surprisal

STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability

Trajectory Dynamics in Language Model Hidden States Predict Human Processing Costs Beyond Surprisal

Why are language models less surprised than humans? Testing the Parse Multiplicity Mismatch Hypothesis

Submit Feedback