Gemini 3.5 Flash is twice as expensive as ChatGPT 5.5 on GitHub Copilot. Also, Gemini reasoning models are MoE

Reddit r/singularity News

Summary

Gemini 3.5 Flash is priced twice as high as ChatGPT 5.5 on GitHub Copilot. Additionally, the article notes that Gemini reasoning models are Mixture of Experts (MoE), which may make them less competitive for advanced tasks.

Also FYI, Gemini reasoning models (2.5 Pro, 3.0 Pro and 3.1 Pro) were MoE. I don't know why this isn't more broadly discussed. As long as Google keeps doing that, they are bound to be out-competed by OpenAI and Anthropic, and deliver largely useless models for advanced applications (e.g. research). Clearly, Google is more concerned with developing fast and cheap* LLMs they can integrate across their products, especially google search, rather than with being the best model for advanced tasks. (*Ironically Flash is not cheap, at least for now)
Original Article

Similar Articles

Gemini 3 Flash: frontier intelligence built for speed

Google DeepMind Blog

Google has released Gemini 3 Flash, a fast, cost-effective AI model that combines Pro-grade reasoning with Flash-level speed for tasks like coding, complex analysis, and agentic workflows.

Introducing Gemini 2.5 Flash

Google DeepMind Blog

Google announces Gemini 2.5 Flash, a new hybrid reasoning model available in preview through the Gemini API. The model features toggleable thinking capabilities, fine-grained thinking budgets for quality-cost-latency tradeoffs, and maintains fast inference speeds while improving performance over 2.0 Flash.