Gemini 3.5 Flash improves over Gemini 3.1 Pro on the Short Story Creative Writing Benchmark: -2.3 → -1.8.

Reddit r/singularity Models

Summary

Gemini 3.5 Flash outperforms Gemini 3.1 Pro on a short story creative writing benchmark, improving from -2.3 to -1.8 in head-to-head comparisons.

This benchmark uses head-to-head comparisons of stories written in response to the same constrained creative briefs. The target range is 600-800 words. More info: [https://github.com/lechmazur/writing/](https://github.com/lechmazur/writing/)
Original Article

Similar Articles

Gemini 3.5 Flash Benchmarks

Reddit r/singularity

Benchmark results for the Gemini 3.5 Flash model are discussed, likely showcasing its performance across various AI tasks.

Gemini 3.5 Flash (Low) (1 minute read)

TLDR AI

Google introduces Gemini 3.5 Flash (Low), a new model variant that uses about 45% fewer tokens than the Medium version while outperforming the older Gemini 3 Flash (High) on SWE tasks. They have also reset quotas for all paid plans.