positional-bias

Tag

Cards List
#positional-bias

The Readout Shortcut: Positional Number Copying Dominates Arithmetic CoT Readout in Small Language Models

arXiv cs.LG · 2026-05-25 Cached

This paper identifies a 'positional copying' shortcut where small language models answer arithmetic questions by copying the last number before the answer delimiter, bypassing actual reasoning. This effect explains why shuffling CoT steps retains performance; it accounts for 89-92% of teacher-forcing accuracy in 1-3B models on GSM8K.

0 favorites 0 likes
#positional-bias

Self-Describing Structured Data with Dual-Layer Guidance: A Lightweight Alternative to RAG for Precision Retrieval in Large-Scale LLM Knowledge Navigation

arXiv cs.CL · 2026-04-23 Cached

SDSR proposes lightweight self-describing structured data with dual-layer guidance to exploit LLM primacy bias, achieving 100% routing accuracy without vector DBs.

0 favorites 0 likes
← Back to home

Submit Feedback