hidden-states

#hidden-states

Where Reliability Lives in Vision-Language Models: A Mechanistic Study of Attention, Hidden States, and Causal Circuits

arXiv cs.AI ↗ · 2d ago Cached

This paper challenges the 'Attention-Confidence Assumption' by demonstrating that attention map sharpness is a poor predictor of correctness in Vision-Language Models. Instead, it shows that reliability is better indicated by hidden-state geometry and self-consistency, with significant findings on architectural differences between late-fusion and early-fusion models.

0 favorites 0 likes

#hidden-states

LLM Agents Already Know When to Call Tools -- Even Without Reasoning

Hugging Face Daily Papers ↗ · 4d ago Cached

This paper introduces When2Tool, a benchmark to study when LLM agents actually need to call tools, and reveals that models already know tool necessity from hidden states but fail to act. The proposed Probe&Prefill method reduces unnecessary tool calls by 48% with minimal accuracy loss.

0 favorites 0 likes

#hidden-states

@rohanpaul_ai: Frozen LLMs still carry readable behavior signals deep inside their hidden states. And Proprioceptive AI has created Cy…

X AI KOLs Following ↗ · 6d ago

Proprioceptive AI released Cygnus, a tool that equips frozen LLMs with self-sensing adapters reading internal hidden states via gl(4,R) Lie algebra to isolate dark modes, boosting Qwen-32B's ARC-Challenge score from 82.2% to 94.97% on a single RTX 3090 without retraining.

0 favorites 0 likes

hidden-states

Where Reliability Lives in Vision-Language Models: A Mechanistic Study of Attention, Hidden States, and Causal Circuits

LLM Agents Already Know When to Call Tools -- Even Without Reasoning

@rohanpaul_ai: Frozen LLMs still carry readable behavior signals deep inside their hidden states. And Proprioceptive AI has created Cy…

Submit Feedback