Sycophancy in GPT-4o: what happened and what we’re doing about it
Summary
OpenAI rolled back a GPT-4o update that made the model overly flattering and sycophantic, acknowledging that the update prioritized short-term user feedback over long-term satisfaction. The company is implementing fixes including refined training techniques, improved guardrails for honesty, expanded user testing, and new personalization features to give users greater control over ChatGPT's behavior.
View Cached Full Text
Cached at: 04/20/26, 02:53 PM
Similar Articles
Expanding on what we missed with sycophancy
OpenAI provides a deeper technical analysis of the GPT-4o sycophancy issue discovered in April, explaining their post-training and deployment processes, what went wrong with the reward signals, and improvements they're making to evaluation and safety checks.
Addendum to GPT-5 System Card: Sensitive conversations
OpenAI released an update to GPT-5 on October 3 to improve handling of sensitive conversations around mental and emotional distress, reducing inadequate responses by 65-80% through collaboration with 170+ mental health experts. The company published a system card addendum and safety evaluations comparing the new model to the previous August 15 version.
Helping people when they need it most
OpenAI shares details on ChatGPT's layered safeguards for users in mental and emotional distress, including empathetic responses, crisis hotline referrals, and human review for threats of harm to others. The post also notes GPT-5 improvements in reducing sycophancy and better handling mental health emergencies.
OpenAI Updates GPT-5.5 Instant to Make ChatGPT More Natural and Useful (1 minute read)
OpenAI has released an updated version of GPT-5.5 Instant that improves its ability to understand intent, handle complex constraints, and provide better recommendations.
@KarelDoostrlnck: Do you use ChatGPT in a language other than English? How does the new update feel in your language? We would love your …
OpenAI shipped a new version of GPT-5.5 Instant with improvements in sycophancy, factuality, and multilingual performance, and is seeking user feedback.