metacognition

Tag

Cards List
#metacognition

Humans Disengage, Reasoning Models Persist: Separating Difficulty Registration from Deliberation Allocation

arXiv cs.AI · 4d ago Cached

This paper dissociates difficulty registration from deliberation allocation in large reasoning models (LRMs) and humans, finding that LRMs spend more tokens on problems they get wrong while humans spend less time on failures, revealing opposite within-item patterns despite similar cross-item difficulty correlations.

0 favorites 0 likes
#metacognition

Faithful uncertainty in LLM agents: calibration vs utility tradeoff in practice[D]

Reddit r/MachineLearning · 2026-06-04

A practitioner discusses the calibration vs. utility tradeoff in LLM agents, sharing experience with a verifier-based pipeline that reduces hallucinated tool calls by ~60% but introduces latency costs and drops easy correct answers.

0 favorites 0 likes
#metacognition

Making LLMs tell you how confident they really are through probe-targeted fine tuning.[R]

Reddit r/MachineLearning · 2026-05-29

This research presents probe-targeted fine-tuning (LoRA) to make LLMs verbally express their internal confidence, achieving causal control over confidence outputs and demonstrating that models often know when they are right or wrong but fail to articulate it.

0 favorites 0 likes
#metacognition

Can LLMs Introspect? A Reality Check

arXiv cs.AI · 2026-05-27 Cached

This paper argues that recent claims about LLMs' ability to introspect are not justified, as behavioral evidence alone cannot distinguish genuine introspection from pattern matching on surface-level cues. The authors re-examine two evaluation paradigms and find that models rely on input-level features rather than genuine access to internal states.

0 favorites 0 likes
#metacognition

LLMs Show No Signs Of Individuated Metacognition

arXiv cs.LG · 2026-05-26 Cached

This paper investigates whether frontier LLMs exhibit individuated metacognition—the ability to assess their own item-level capabilities beyond shared signals. Through factor analysis and pairwise calibration across 20 models and six benchmarks, the authors find no evidence of such metacognition; confidence differences reduce to a single shared difficulty factor, suggesting models rely on a common difficulty signal rather than model-specific self-knowledge.

0 favorites 0 likes
#metacognition

@rohanpaul_ai: New Google paper says LLMs should stop pretending certainty and instead clearly show when they are unsure. Hallucinatio…

X AI KOLs Following · 2026-05-25 Cached

A new Google paper argues that LLMs should focus on expressing uncertainty honestly rather than aiming for perfect factuality, proposing 'faithful uncertainty' to build trust.

0 favorites 0 likes
#metacognition

Metacognition as Reward: Reinforcing LLM Reasoning via Knowledge and Regulation Signals

arXiv cs.CL · 2026-05-25 Cached

Introduces Metacognition-as-Reward (MaR), a reinforcement learning framework that guides LLM reasoning via metacognitive knowledge and regulation signals, achieving up to 11% improvement over vanilla methods on reasoning benchmarks.

0 favorites 0 likes
#metacognition

@IntuitMachine: https://x.com/IntuitMachine/status/2058141021842571510

X AI KOLs Timeline · 2026-05-23 Cached

This essay argues that evaluation is the hardest problem in production AI, not generation, and decomposes AI self-knowledge into calibration, discrimination, and expression, with implications for system design.

0 favorites 0 likes
#metacognition

Position: Artificial Intelligence Needs Meta Intelligence -- the Case for Metacognitive AI

arXiv cs.AI · 2026-05-18 Cached

This position paper argues that incorporating metacognition as a design principle can lead to more accurate, secure, and efficient AI systems, and demonstrates the concept through a Federated Learning case study and a software framework for experimentation.

0 favorites 0 likes
#metacognition

LLMs Know When They Know, but Do Not Act on It: A Metacognitive Harness for Test-time Scaling

arXiv cs.LG · 2026-05-15 Cached

This paper proposes a metacognitive harness that separates monitoring from reasoning in LLMs, using pre-solve feeling-of-knowing and post-solve judgment-of-learning signals to control when to trust, retry, or aggregate answers, improving accuracy on text, code, and multimodal benchmarks without parameter updates.

0 favorites 0 likes
#metacognition

TRIAGE: Evaluating Prospective Metacognitive Control in LLMs under Resource Constraints

arXiv cs.AI · 2026-05-14 Cached

Introduces TRIAGE, a framework for evaluating LLMs' prospective metacognitive control under token budgets, finding substantial gaps in their ability to allocate compute efficiently across problems.

0 favorites 0 likes
#metacognition

Decomposing and Steering Functional Metacognition in Large Language Models

arXiv cs.CL · 2026-05-12 Cached

This research paper investigates functional metacognition in Large Language Models, demonstrating that internal states like evaluation awareness and self-assessed capability are linearly decodable from residual stream activations. The authors propose a mechanistic framework to steer these states, showing causal control over reasoning behaviors, verbosity, and safety responses.

0 favorites 0 likes
#metacognition

Domain-level metacognitive monitoring in frontier LLMs: A 33-model atlas

arXiv cs.CL · 2026-05-11 Cached

This study presents a 33-model atlas analyzing domain-level metacognitive monitoring in frontier LLMs using MMLU benchmarks, revealing significant variations in confidence calibration across different knowledge domains that are obscured by aggregate metrics.

0 favorites 0 likes
#metacognition

From Intention to Text: AI-Supported Goal Setting in Academic Writing

arXiv cs.CL · 2026-04-20 Cached

This paper presents WriteFlow, an AI voice-based writing assistant designed to support reflective academic writing through goal-oriented interaction, addressing limitations of efficiency-focused writing tools by scaffolding metacognitive regulation and goal articulation. Findings from a Wizard-of-Oz study with 12 expert users demonstrate that the system effectively supports iterative goal refinement and goal-text alignment during the drafting process.

0 favorites 0 likes
#metacognition

The Metacognitive Monitoring Battery: A Cross-Domain Benchmark for LLM Self-Monitoring

arXiv cs.CL · 2026-04-20 Cached

A new cross-domain benchmark (Metacognitive Monitoring Battery) with 524 items evaluates LLM self-monitoring capabilities across six cognitive domains using human psychometric methodology. Applied to 20 frontier LLMs, it reveals three distinct metacognitive profiles and shows that accuracy rank and metacognitive sensitivity rank are largely inverted.

0 favorites 0 likes
← Back to home

Submit Feedback