AI modes - "Helpfulness" "honestness" ... how do they work?

Reddit r/artificial News

Summary

A user questions how Google AI's "Helpfulness" vs "Honesty" modes work, noting extreme shifts in tone from uncritical praise to harsh negativity.

Hi there, i am currently looking for a new job - and sometimes ask googles ai mode. Since those answers where all sugar coated and everything i typed was a great idea, plan - whatever i looked for the reason of that. By default the "Helpfulness" mode seems to be activated - so i asked for "honesness" mode instead. Now everything i typed is - according to the ai - kinda trash and i probably won't be able to do it anyway (e.g. i am over 40 and ai tells me i am to old and that it won't work anyway). Reality probably is somewhere in between. So my question is about those modes - are they simple instructions that the ai follows - like beeing supportive no matter what vs trashing everything no matter what - or is the behaviour somewhat based on the sources the ai finds regarding my questions or comments?
Original Article

Similar Articles

Meta AI is (brutally) honest

Reddit r/artificial

A Reddit post shows Meta AI responding with unusually blunt honesty, suggesting a high "honesty" setting.

Can prompting reduce AI sycophancy or is it mostly model behavior?

Reddit r/artificial

A user explores whether prompt engineering can reduce AI sycophancy in models like Gemini, ChatGPT, and Claude, or whether it's fundamentally a model alignment issue. The discussion touches on differences between models in handling disagreement and objective criticism.

Google is building a lifestyle profiling engine, not a "helpful assistant"

Reddit r/ArtificialInteligence

Google's AI strategy is criticized as a surveillance-based profiling engine that forces users into consent through mandatory login, circumventing GDPR. The article exposes Google's plan to replace traditional search with AI-generated answers and personalized tracking, calling it a legal loophole wrapped in AI hype.

The AI Epistemic Deference Index: A Continuous Measure of Sycophancy

arXiv cs.AI

The paper introduces the AI Epistemic Deference Index (AEDI), a continuous measure of how much a model's expressed support for a factual claim shifts based on the user's stated attitude, and evaluates eight prominent models, finding substantial sycophancy with differences across providers.