gpt-claude-study

Tag

Cards List
#gpt-claude-study

Talking to a Know-It-All GPT or a Second-Guesser Claude? How Repair reveals unreliable Multi-Turn Behavior in LLMs

arXiv cs.CL · 2026-04-22 Cached

Study shows GPT and Claude exhibit distinct, unreliable repair behaviors in multi-turn math dialogues, with some models resisting correction and others over-correcting.

0 favorites 0 likes
← Back to home

Submit Feedback