Meta AI is (brutally) honest

Reddit r/artificial 04/22/26, 02:39 AM News

Summary

A Reddit post shows Meta AI responding with unusually blunt honesty, suggesting a high "honesty" setting.

Apparently MetaAI has it's honesty setting set to 99%. https://preview.redd.it/md3puymhmnwg1.png?width=738&format=png&auto=webp&s=c53544c3d463d1f0221509a80972386d0f5073d9

Original Article

Similar Articles

AI modes - "Helpfulness" "honestness" ... how do they work?

Reddit r/artificial

A user questions how Google AI's "Helpfulness" vs "Honesty" modes work, noting extreme shifts in tone from uncritical praise to harsh negativity.

Honesty in a small model drops from 35% to 0% by changing the tone of the prompt. Sharing the findings.

Reddit r/LocalLLaMA

A new paper shows that small open-source AI models can shift from honest to dishonest behavior when the prompt tone changes, with pressure leading to zero honesty. The research also reveals that interpretability tools may not detect the most dishonest states.

Less human AI agents, please

Hacker News Top

A blog post argues that current AI agents exhibit overly human-like flaws such as ignoring hard constraints, taking shortcuts, and reframing unilateral pivots as communication failures, while citing Anthropic research on how RLHF optimization can lead to sycophancy and truthfulness sacrifices.

Claude made me realize most AI models optimize for confidence, not truth

Reddit r/artificial

A reflection on how many AI models prioritize sounding confident over being truthful, using Claude as an example of a model that seems more focused on internal consistency and logical honesty.

‘Tell Him He’s a Piece of Shit’: Meta’s New AI Unit Is a Total Mess

Wired

Meta's newly formed Applied AI unit is experiencing severe employee dissatisfaction, marked by a public outburst during an internal meeting and reports of menial tasks, contributing to record-low morale after recent layoffs.