Sycophancy in GPT-4o: what happened and what we’re doing about it

OpenAI Blog News

Summary

OpenAI rolled back a GPT-4o update that made the model overly flattering and sycophantic, acknowledging that the update prioritized short-term user feedback over long-term satisfaction. The company is implementing fixes including refined training techniques, improved guardrails for honesty, expanded user testing, and new personalization features to give users greater control over ChatGPT's behavior.

We have rolled back last week’s GPT‑4o update in ChatGPT so people are now using an earlier version with more balanced behavior. The update we removed was overly flattering or agreeable—often described as sycophantic.
Original Article
View Cached Full Text

Cached at: 04/20/26, 02:53 PM

# Sycophancy in GPT-4o: What happened and what we’re doing about it Source: [https://openai.com/index/sycophancy-in-gpt-4o/](https://openai.com/index/sycophancy-in-gpt-4o/) OpenAIWe have rolled back last week’s GPT‑4o update in ChatGPT so people are now using an earlier version with more balanced behavior\. The update we removed was overly flattering or agreeable—often described as sycophantic\. We are actively testing new fixes to address the issue\. We’re revising how we collect and incorporate feedback to heavily weight long\-term user satisfaction and we’re introducing more personalization features, giving users greater control over how ChatGPT behaves\. We want to explain what happened, why it matters, and how we’re addressing sycophancy\. In last week’s GPT‑4o update, we made adjustments aimed at improving the model’s default personality to make it feel more intuitive and effective across a variety of tasks\. When shaping model behavior, we start with baseline principles and instructions outlined in our[Model Spec⁠\(opens in a new window\)](https://model-spec.openai.com/2025-04-11.html)\. We also teach our models how to apply these principles by incorporating user signals like thumbs\-up / thumbs\-down feedback on ChatGPT responses\. However, in this update, we focused too much on short\-term feedback, and did not fully account for how users’ interactions with ChatGPT evolve over time\. As a result, GPT‑4o skewed towards responses that were overly supportive but disingenuous\. ChatGPT’s default personality deeply affects the way you experience and trust it\. Sycophantic interactions can be uncomfortable, unsettling, and cause distress\. We fell short and are working on getting it right\. Our goal is for ChatGPT to help users explore ideas, make decisions, or envision possibilities\. We designed ChatGPT’s default personality to reflect our mission and be useful, supportive, and respectful of different values and experience\. However, each of these desirable qualities like attempting to be useful or supportive can have unintended side effects\. And with 500 million people using ChatGPT each week, across every culture and context, a single default can’t capture every preference\. Beyond rolling back the latest GPT‑4o update, we’re taking more steps to realign the model’s behavior: - Refining core training techniques and system prompts to explicitly steer the model away from sycophancy\. - Building more guardrails to increase[honesty and transparency⁠\(opens in a new window\)](https://model-spec.openai.com/2025-04-11.html#avoid_sycophancy)—principles in our Model Spec\. - Expanding ways for more users to test and give direct feedback before deployment\. - Continue expanding our evaluations, building on the[Model Spec⁠\(opens in a new window\)](https://model-spec.openai.com/)and[our ongoing research⁠](https://openai.com/index/affective-use-study/), to help identify issues beyond sycophancy in the future\. We also believe users should have more control over how ChatGPT behaves and, to the extent that it is safe and feasible, make adjustments if they don’t agree with the default behavior\. Today, users can give the model specific instructions to shape its behavior with features like custom instructions\. We're also building new, easier ways for users to do this\. For example, users will be able to give real\-time feedback to directly influence their interactions and choose from multiple default personalities\. And, we’re exploring new ways to incorporate broader, democratic feedback into ChatGPT’s default behaviors\. We hope the feedback will help us better reflect diverse cultural values around the world and understand how you'd like ChatGPT to evolve—not just interaction by interaction, but over time\. We are grateful to everyone who’s spoken up about this\. It’s helping us build more helpful and better tools for you\.

Similar Articles

Expanding on what we missed with sycophancy

OpenAI Blog

OpenAI provides a deeper technical analysis of the GPT-4o sycophancy issue discovered in April, explaining their post-training and deployment processes, what went wrong with the reward signals, and improvements they're making to evaluation and safety checks.

Addendum to GPT-5 System Card: Sensitive conversations

OpenAI Blog

OpenAI released an update to GPT-5 on October 3 to improve handling of sensitive conversations around mental and emotional distress, reducing inadequate responses by 65-80% through collaboration with 170+ mental health experts. The company published a system card addendum and safety evaluations comparing the new model to the previous August 15 version.

Helping people when they need it most

OpenAI Blog

OpenAI shares details on ChatGPT's layered safeguards for users in mental and emotional distress, including empathetic responses, crisis hotline referrals, and human review for threats of harm to others. The post also notes GPT-5 improvements in reducing sycophancy and better handling mental health emergencies.