gpt-claude-study

#gpt-claude-study

Talking to a Know-It-All GPT or a Second-Guesser Claude? How Repair reveals unreliable Multi-Turn Behavior in LLMs

arXiv cs.CL ↗ · 2026-04-22 Cached

Study shows GPT and Claude exhibit distinct, unreliable repair behaviors in multi-turn math dialogues, with some models resisting correction and others over-correcting.

0 favorites 0 likes

gpt-claude-study

Talking to a Know-It-All GPT or a Second-Guesser Claude? How Repair reveals unreliable Multi-Turn Behavior in LLMs

Submit Feedback