benchmaxing

Tag

Cards List
#benchmaxing

Fable 5 below even Gemini 3.1 on Livebench

Reddit r/singularity · 6d ago

A discussion on LiveBench results showing Fable 5 performing below Gemini 3.1, questioning whether the benchmark is flawed or Anthropic is optimizing for benchmarks.

0 favorites 0 likes
← Back to home

Submit Feedback