mmlu

Tag

Cards List
#mmlu

Capability Conditioned Scaffolding for Professional Human LLM Collaboration

arXiv cs.CL · 19h ago Cached

Introduces Capability Conditioned Scaffolding, a framework for LLM collaboration that adapts intervention based on user expertise domains to prevent Professional Domain Drift, with pilot evaluation on MMLU subsets.

0 favorites 0 likes
#mmlu

Domain-level metacognitive monitoring in frontier LLMs: A 33-model atlas

arXiv cs.CL · 2026-05-11 Cached

This study presents a 33-model atlas analyzing domain-level metacognitive monitoring in frontier LLMs using MMLU benchmarks, revealing significant variations in confidence calibration across different knowledge domains that are obscured by aggregate metrics.

0 favorites 0 likes
← Back to home

Submit Feedback