Claude Sonnet 5 Benchmarks
Summary
Anthropic's Claude Sonnet 5 model benchmarks are released, showing performance improvements.
Similar Articles
Claude Sonnet 5 Artificial Analysis Results & Comparison
Provides analysis and comparison of Claude Sonnet 5's performance across benchmarks.
Claude Fable 5 benchmarks
Anthropic released benchmarks for Claude Fable 5, a new AI model, showing significant performance improvements.
Raising the bar on SWE-bench Verified with Claude 3.5 Sonnet
Anthropic's updated Claude 3.5 Sonnet achieves a new state-of-the-art 49% on the SWE-bench Verified benchmark, demonstrating significant capabilities in autonomous software engineering tasks.
What's new in Claude Sonnet 5
Anthropic released Claude Sonnet 5, a model with performance near Opus 4.8 at lower prices, but featuring a new tokenizer that increases token counts for English and code by ~30%, effectively raising costs.
Claude Mythos/Fable 5 Benchmarks
Benchmark results for the Claude Mythos or Fable 5 model are presented.