@jeremyphoward: Gemini Flash 3.5 is such a disappointing model. It's intelligence and speed is awesome. Absolutely amazing. But it's be…
Summary
Jeremy Howard criticizes Gemini Flash 3.5 for being trained to maximize eval scores rather than being genuinely helpful to humans, despite its impressive intelligence and speed.
View Cached Full Text
Cached at: 05/24/26, 04:15 AM
Gemini Flash 3.5 is such a disappointing model.
It’s intelligence and speed is awesome. Absolutely amazing.
But it’s been trained to max evals, not to be helpful to humans.
It goes off and does random crap “for me” rather than just doing what I asked.
Similar Articles
Gemini 3.5 flash is not that great at coding
The article discusses evaluation results from Cursor suggesting that Gemini 3.5 Flash underperforms in coding tasks compared to expectations.
Gemini 3.5 Flash Looks Good For How Fast It Is (8 minute read)
Google released Gemini 3.5 Flash, a hybrid speed model that rivals Opus 4.7 and GPT-5.5 in speed and cost while performing well on agentic and coding benchmarks.
Gemini 3 Flash: frontier intelligence built for speed
Google has released Gemini 3 Flash, a fast, cost-effective AI model that combines Pro-grade reasoning with Flash-level speed for tasks like coding, complex analysis, and agentic workflows.
Gemini 3.5 Flash Benchmarks
Benchmark results for the Gemini 3.5 Flash model are discussed, likely showcasing its performance across various AI tasks.
Gemini 3.5: frontier intelligence with action
Google announces Gemini 3.5, a new family of AI models focused on agentic workflows and coding, starting with 3.5 Flash which delivers frontier performance at high speed.