coupling

#coupling

Lying Is Just a Phase: The Hidden Alignment Transition in Language Model Scaling

arXiv cs.LG ↗ · 2026-05-20

This paper identifies a phase transition in language model scaling where below a critical parameter count, reasoning and truthfulness are anticorrelated, but above it they cooperate. It provides diagnostics and interventions for improving alignment across model families.

0 favorites 0 likes

coupling

Lying Is Just a Phase: The Hidden Alignment Transition in Language Model Scaling

Submit Feedback