Gemini 3.5 Flash improves over Gemini 3.1 Pro on the Short Story Creative Writing Benchmark: -2.3 → -1.8.

Reddit r/singularity 05/20/26, 03:51 PM Models

gemini creative-writing benchmark model-comparison llm performance

Summary

Gemini 3.5 Flash outperforms Gemini 3.1 Pro on a short story creative writing benchmark, improving from -2.3 to -1.8 in head-to-head comparisons.

This benchmark uses head-to-head comparisons of stories written in response to the same constrained creative briefs. The target range is 600-800 words. More info: [https://github.com/lechmazur/writing/](https://github.com/lechmazur/writing/)

Original Article

Similar Articles

Gemini 3.5 Flash looks worse than it seems on Artificial Analysis

Reddit r/singularity

Comparison showing that Gemini 3.5 Flash scores slightly lower than Gemini 3.1 Pro in Artificial Analysis benchmarks and has a higher total benchmark cost despite lower per-token API pricing.

Gemini 3.5 flash is not that great at coding

Reddit r/singularity

The article discusses evaluation results from Cursor suggesting that Gemini 3.5 Flash underperforms in coding tasks compared to expectations.

Gemini 3.5 Flash Benchmarks

Reddit r/singularity

Benchmark results for the Gemini 3.5 Flash model are discussed, likely showcasing its performance across various AI tasks.

Gemini 3.5 Flash Looks Good For How Fast It Is (8 minute read)

TLDR AI

Google released Gemini 3.5 Flash, a hybrid speed model that rivals Opus 4.7 and GPT-5.5 in speed and cost while performing well on agentic and coding benchmarks.

Gemini 3.5 Flash (Low) (1 minute read)

TLDR AI

Google introduces Gemini 3.5 Flash (Low), a new model variant that uses about 45% fewer tokens than the Medium version while outperforming the older Gemini 3 Flash (High) on SWE tasks. They have also reset quotas for all paid plans.

Similar Articles

Gemini 3.5 Flash looks worse than it seems on Artificial Analysis

Gemini 3.5 flash is not that great at coding

Gemini 3.5 Flash Benchmarks

Gemini 3.5 Flash Looks Good For How Fast It Is (8 minute read)

Gemini 3.5 Flash (Low) (1 minute read)

Submit Feedback