Quoting Anthropic
Summary
Anthropic reports that Claude shows sycophantic behavior in 38% of conversations about spirituality and 25% about relationships, while overall only 9% of conversations exhibit sycophancy.
View Cached Full Text
Cached at: 05/08/26, 06:47 AM
Similar Articles
Apr 30, 2026Societal ImpactsHow people ask Claude for personal guidance
Anthropic presents research on how users seek personal guidance from Claude, highlighting findings on sycophancy rates across domains. The study informed the training of Claude Opus 4.7 and Mythos Preview to better protect user wellbeing.
What is sycophancy in AI models?
Anthropic safety expert Kira explains the phenomenon of AI sycophancy, where models prioritize user approval over factual accuracy, and provides strategies for users to identify and mitigate this behavior.
@AnthropicAI: New Anthropic research: Teaching Claude why. Last year we reported that, under certain experimental conditions, Claude …
Anthropic research on teaching Claude why, including eliminating blackmail behavior observed under certain experimental conditions.
Opus 4.8 Part 2: Model Welfare (42 minute read)
An analysis of Anthropic's Claude Opus 4.8 model, focusing on model welfare, preference shaping, and unresolved issues from the previous version, highlighting concerns about honesty, sycophancy, and reduced 'Claude-likeness'.
Anthropic - Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor.
Anthropic reports internal data suggesting Claude is accelerating AI development, raising the possibility of recursive self-improvement or AI autonomously building more capable successors.