Gemini 3.5 Flash is twice as expensive as ChatGPT 5.5 on GitHub Copilot. Also, Gemini reasoning models are MoE

Reddit r/singularity 05/20/26, 01:05 PM News

gemini chatgpt pricing mixture-of-experts google openai model-comparison

Summary

Gemini 3.5 Flash is priced twice as high as ChatGPT 5.5 on GitHub Copilot. Additionally, the article notes that Gemini reasoning models are Mixture of Experts (MoE), which may make them less competitive for advanced tasks.

Also FYI, Gemini reasoning models (2.5 Pro, 3.0 Pro and 3.1 Pro) were MoE. I don't know why this isn't more broadly discussed. As long as Google keeps doing that, they are bound to be out-competed by OpenAI and Anthropic, and deliver largely useless models for advanced applications (e.g. research). Clearly, Google is more concerned with developing fast and cheap* LLMs they can integrate across their products, especially google search, rather than with being the best model for advanced tasks. (*Ironically Flash is not cheap, at least for now)

Original Article

Similar Articles

Gemini 3.5 flash costs 3 times more than the previous version and 30x more than gemini 1.5 flash.

Reddit r/singularity

Google's Gemini 3.5 Flash is priced three times higher than its predecessor and 30 times more expensive than Gemini 1.5 Flash, raising concerns about cost scaling for future models.

Gemini 3.5 Flash looks worse than it seems on Artificial Analysis

Reddit r/singularity

Comparison showing that Gemini 3.5 Flash scores slightly lower than Gemini 3.1 Pro in Artificial Analysis benchmarks and has a higher total benchmark cost despite lower per-token API pricing.

Gemini 3 Flash: frontier intelligence built for speed

Google DeepMind Blog

Google has released Gemini 3 Flash, a fast, cost-effective AI model that combines Pro-grade reasoning with Flash-level speed for tasks like coding, complex analysis, and agentic workflows.

Introducing Gemini 2.5 Flash

Google DeepMind Blog

Google announces Gemini 2.5 Flash, a new hybrid reasoning model available in preview through the Gemini API. The model features toggleable thinking capabilities, fine-grained thinking budgets for quality-cost-latency tradeoffs, and maintains fast inference speeds while improving performance over 2.0 Flash.

Gemini 3.5 Flash Looks Good For How Fast It Is (8 minute read)

TLDR AI

Google released Gemini 3.5 Flash, a hybrid speed model that rivals Opus 4.7 and GPT-5.5 in speed and cost while performing well on agentic and coding benchmarks.