Intresting! Gemini 3.1 has strongest world knowledge but still choose to be lazy
Summary
A user observes that Gemini 3.1 possesses strong world knowledge but tends to output lazy responses, not fully utilizing its capabilities.
Similar Articles
@jeremyphoward: Gemini Flash 3.5 is such a disappointing model. It's intelligence and speed is awesome. Absolutely amazing. But it's be…
Jeremy Howard criticizes Gemini Flash 3.5 for being trained to maximize eval scores rather than being genuinely helpful to humans, despite its impressive intelligence and speed.
Gemini 3.5 flash is not that great at coding
The article discusses evaluation results from Cursor suggesting that Gemini 3.5 Flash underperforms in coding tasks compared to expectations.
Gemini 3.5: frontier intelligence with action
Google announces Gemini 3.5, a new family of AI models focused on agentic workflows and coding, starting with 3.5 Flash which delivers frontier performance at high speed.
@VraserX: The latest Gemini 3.5 checkpoint seems disappointing so far. Fast and smart is nice, but prompt adherence is absolutely…
The latest Gemini 3.5 checkpoint is criticized for poor prompt adherence, with the model ignoring instructions, using web when told not to, and overbuilding UI, raising concerns about agent reliability despite speed and intelligence.
Gemini 3 Deep Think: Advancing science, research and engineering
Google has released a major update to Gemini 3 Deep Think, a specialized reasoning mode designed to solve complex challenges in science, research, and engineering by blending deep scientific knowledge with practical utility.