failure-detection

#failure-detection

I asked how you all handle agent memory. Here's the pattern in the replies, and the one thing nobody's actually solved.

Reddit r/AI_Agents ↗ · 22h ago

A community discussion on agent memory reveals that while various patches exist for what to write down (e.g., plain files, layered memory, post-mortems), the unsolved problem is what to keep—detecting failures is tractable, but deciding which lessons persist still needs human judgment.

0 favorites 0 likes

#failure-detection

How Language Models Fail: Token-Level Signatures of Committed and Persistent Reasoning Failures

arXiv cs.CL ↗ · 2d ago Cached

This paper characterizes two distinct processes by which language models fail in reasoning—committed failure and persistent uncertainty—using token-level uncertainty signals, and demonstrates implications for self-consistency and failure detection strategies.

0 favorites 0 likes

#failure-detection

AEGIS: A Backup Reflex for Physical AI

arXiv cs.AI ↗ · 2d ago Cached

AEGIS uses activation-probe early warning to switch to a stronger policy before failures compound in long-horizon robot manipulation, recovering twice as many failures as budget-matched escalation.

0 favorites 0 likes

#failure-detection

Hide-and-Seek in Trajectories: Discovering Failure Signals for VLA Runtime Monitoring

Hugging Face Daily Papers ↗ · 2026-05-29 Cached

Hide-and-Seek is a framework that detects robot execution failures in VLA models by localizing failure-indicative actions through contrastive learning without step-level annotations, achieving state-of-the-art multi-task failure detection.

0 favorites 0 likes

#failure-detection

Lost in the Folds: When Cross-Validation Is Not a Deep Ensemble for Uncertainty Estimation

Hugging Face Daily Papers ↗ · 2026-05-18 Cached

This paper compares cross-validation ensembles to deep ensembles for uncertainty estimation in medical image segmentation. Deep ensembles outperform cross-validation ensembles in calibration and failure detection, while cross-validation ensembles better approximate inter-rater variability.

0 favorites 0 likes

failure-detection

I asked how you all handle agent memory. Here's the pattern in the replies, and the one thing nobody's actually solved.

How Language Models Fail: Token-Level Signatures of Committed and Persistent Reasoning Failures

AEGIS: A Backup Reflex for Physical AI

Hide-and-Seek in Trajectories: Discovering Failure Signals for VLA Runtime Monitoring

Lost in the Folds: When Cross-Validation Is Not a Deep Ensemble for Uncertainty Estimation

Submit Feedback