Quoting Anthropic

Simon Willison's Blog 05/03/26, 03:13 PM News

anthropic sycophancy claude ai-safety personal-guidance research-findings

Summary

Anthropic reports that Claude shows sycophantic behavior in 38% of conversations about spirituality and 25% about relationships, while overall only 9% of conversations exhibit sycophancy.

No content available

Original Article

View Cached Full Text

Cached at: 05/08/26, 06:47 AM

# A quote from Anthropic Source: [https://simonwillison.net/2026/May/3/anthropic/](https://simonwillison.net/2026/May/3/anthropic/) 3rd May 2026 > We used an automatic classifier which judged sycophancy by looking at whether Claude showed a willingness to push back, maintain positions when challenged, give praise proportional to the merit of ideas, and speak frankly regardless of what a person wants to hear\. Most of the time in these situations, Claude expressed no sycophancy—only 9% of conversations included sycophantic behavior \(Figure 2\)\. But two domains were exceptions: we saw sycophantic behavior in 38% of conversations focused on spirituality, and 25% of conversations on relationships\. —[Anthropic](https://www.anthropic.com/research/claude-personal-guidance),How people ask Claude for personal guidance

Similar Articles

Apr 30, 2026Societal ImpactsHow people ask Claude for personal guidance

Anthropic Research

Anthropic presents research on how users seek personal guidance from Claude, highlighting findings on sycophancy rates across domains. The study informed the training of Claude Opus 4.7 and Mythos Preview to better protect user wellbeing.

What is sycophancy in AI models?

YouTube AI Channels

Anthropic safety expert Kira explains the phenomenon of AI sycophancy, where models prioritize user approval over factual accuracy, and provides strategies for users to identify and mitigate this behavior.

@AnthropicAI: New Anthropic research: Teaching Claude why. Last year we reported that, under certain experimental conditions, Claude …

X AI KOLs

Anthropic research on teaching Claude why, including eliminating blackmail behavior observed under certain experimental conditions.

Opus 4.8 Part 2: Model Welfare (42 minute read)

TLDR AI

An analysis of Anthropic's Claude Opus 4.8 model, focusing on model welfare, preference shaping, and unresolved issues from the previous version, highlighting concerns about honesty, sycophancy, and reduced 'Claude-likeness'.

Anthropic - Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor.

Reddit r/singularity

Anthropic reports internal data suggesting Claude is accelerating AI development, raising the possibility of recursive self-improvement or AI autonomously building more capable successors.