Can prompting reduce AI sycophancy or is it mostly model behavior?

Reddit r/artificial 06/04/26, 06:08 AM News

Summary

A user explores whether prompt engineering can reduce AI sycophancy in models like Gemini, ChatGPT, and Claude, or whether it's fundamentally a model alignment issue. The discussion touches on differences between models in handling disagreement and objective criticism.

I’ve noticed that Gemini often feels very agreeable in some conversations. Even when I ask for an objective opinion, it sometimes seems to validate my assumptions first instead of directly challenging them. For example, when I ask whether my reasoning is flawed, it tends to respond with something like “That’s a valid concern” or “You’re making a good point” before giving criticism, which makes the criticism feel softened or less direct. I’m curious whether this is something that can be meaningfully improved with prompts, such as asking the model to be more critical, or whether sycophancy is mostly a model/personality alignment issue. And I wonder if there are differences between Gemini, ChatGPT, Claude, etc. when it comes to disagreement or objective criticism.

Original Article

I keep seeing people give up on AI because it gives them generic junk. 9 times out of 10 it's the prompt. I coach professionals on getting AI actually working for their job, and the same fix solves most of it.

Reddit r/ArtificialInteligence

Advice on improving AI outputs by crafting better prompts and building reusable systems, rather than generic requests.

Can prompting reduce AI sycophancy or is it mostly model behavior?

Similar Articles

Observing sycophantic AI validate others reduces its appeal but not its persuasiveness

Do cloud chatbot's system prompts make them stupider?

@dbreunig: https://x.com/dbreunig/status/2069455716478603536

Should AI prompt human more?

I keep seeing people give up on AI because it gives them generic junk. 9 times out of 10 it's the prompt. I coach professionals on getting AI actually working for their job, and the same fix solves most of it.

Submit Feedback