@jeremyphoward: Gemini Flash 3.5 is such a disappointing model. It's intelligence and speed is awesome. Absolutely amazing. But it's be…

X AI KOLs Following Models

Summary

Jeremy Howard criticizes Gemini Flash 3.5 for being trained to maximize eval scores rather than being genuinely helpful to humans, despite its impressive intelligence and speed.

Gemini Flash 3.5 is such a disappointing model. It's intelligence and speed is awesome. Absolutely amazing. But it's been trained to max evals, not to be helpful to humans. It goes off and does random crap "for me" rather than just doing what I asked.
Original Article
View Cached Full Text

Cached at: 05/24/26, 04:15 AM

Gemini Flash 3.5 is such a disappointing model.

It’s intelligence and speed is awesome. Absolutely amazing.

But it’s been trained to max evals, not to be helpful to humans.

It goes off and does random crap “for me” rather than just doing what I asked.

Similar Articles

Gemini 3 Flash: frontier intelligence built for speed

Google DeepMind Blog

Google has released Gemini 3 Flash, a fast, cost-effective AI model that combines Pro-grade reasoning with Flash-level speed for tasks like coding, complex analysis, and agentic workflows.

Gemini 3.5 Flash Benchmarks

Reddit r/singularity

Benchmark results for the Gemini 3.5 Flash model are discussed, likely showcasing its performance across various AI tasks.

Gemini 3.5: frontier intelligence with action

Google DeepMind Blog

Google announces Gemini 3.5, a new family of AI models focused on agentic workflows and coding, starting with 3.5 Flash which delivers frontier performance at high speed.