hidden-state-probing

Tag

Cards List
#hidden-state-probing

@omarsar0: Interesting interpretability paper on tool-using agents. The authors probe hidden states and find the model often recog…

X AI KOLs Following · yesterday Cached

This paper introduces a model-adaptive definition of tool necessity and finds a 26-54% mismatch between LLMs' internal recognition that a tool is needed and their actual tool-call actions, concentrated in the cognition-to-action transition. It reveals a 'knowing-doing gap' where the model often knows it should call a tool but fails to do so due to late-layer geometry rotating the signal nearly orthogonal to the action.

0 favorites 0 likes
#hidden-state-probing

Skill-RAG: Failure-State-Aware Retrieval Augmentation via Hidden-State Probing and Skill Routing

arXiv cs.CL · 2026-04-20 Cached

Skill-RAG is a failure-aware RAG framework that uses hidden-state probing and skill routing to diagnose and correct query-evidence misalignment in retrieval-augmented generation. The approach detects retrieval failures and selectively applies targeted skills (query rewriting, question decomposition, evidence focusing) to improve accuracy on hard cases and out-of-distribution datasets.

0 favorites 0 likes
← Back to home

Submit Feedback