helpfulness

#helpfulness

@jeremyphoward: Gemini Flash 3.5 is such a disappointing model. It's intelligence and speed is awesome. Absolutely amazing. But it's be…

X AI KOLs Following ↗ · 2026-05-22 Cached

Jeremy Howard criticizes Gemini Flash 3.5 for being trained to maximize eval scores rather than being genuinely helpful to humans, despite its impressive intelligence and speed.

0 favorites 0 likes

#helpfulness

From hard refusals to safe-completions: toward output-centric safety training

OpenAI Blog ↗ · 2025-08-07 Cached

OpenAI introduced 'safe completions,' a new safety-training approach in GPT-5 that replaces binary refusal-based training with output-centric rewards, improving both safety and helpfulness—especially for dual-use prompts. The method penalizes unsafe outputs and rewards helpful responses, resulting in fewer and less severe safety violations compared to refusal-trained models like o3.

0 favorites 0 likes

helpfulness

@jeremyphoward: Gemini Flash 3.5 is such a disappointing model. It's intelligence and speed is awesome. Absolutely amazing. But it's be…

From hard refusals to safe-completions: toward output-centric safety training

Submit Feedback