@jeremyphoward: Gemini Flash 3.5 is such a disappointing model. It's intelligence and speed is awesome. Absolutely amazing. But it's be…

X AI KOLs Following 05/22/26, 08:34 PM Models

gemini-flash ai-model model-behavior critique google helpfulness

Summary

Jeremy Howard criticizes Gemini Flash 3.5 for being trained to maximize eval scores rather than being genuinely helpful to humans, despite its impressive intelligence and speed.

Gemini Flash 3.5 is such a disappointing model. It's intelligence and speed is awesome. Absolutely amazing. But it's been trained to max evals, not to be helpful to humans. It goes off and does random crap "for me" rather than just doing what I asked.

Original Article

View Cached Full Text

Cached at: 05/24/26, 04:15 AM

Gemini Flash 3.5 is such a disappointing model.

It’s intelligence and speed is awesome. Absolutely amazing.

But it’s been trained to max evals, not to be helpful to humans.

It goes off and does random crap “for me” rather than just doing what I asked.

@jeremyphoward: Gemini Flash 3.5 is such a disappointing model. It's intelligence and speed is awesome. Absolutely amazing. But it's be…

Similar Articles

Gemini 3.5 flash is not that great at coding

Gemini 3.5 Flash Looks Good For How Fast It Is (8 minute read)

Gemini 3 Flash: frontier intelligence built for speed

Gemini 3.5 Flash Benchmarks

Gemini 3.5: frontier intelligence with action

Submit Feedback

Similar Articles

Gemini 3.5 flash is not that great at coding

Gemini 3.5 Flash Looks Good For How Fast It Is (8 minute read)

Gemini 3 Flash: frontier intelligence built for speed

Gemini 3.5: frontier intelligence with action