@natolambert: The crazy jump in perf for Claude 5 Fable is vindication for people who say Opus 4.5 and were like "yeah I should (most…
Summary
Nathan Lambert highlights the significant performance improvement of Claude 5 Fable, suggesting it validates the shift away from manual coding.
Similar Articles
Claude Fable 5 benchmarks
Anthropic released benchmarks for Claude Fable 5, a new AI model, showing significant performance improvements.
Differences Between Claude Opus 4.8 and Claude Fable 5 on MineBench
A detailed comparison of Claude Opus 4.8 and Claude Fable 5 on the MineBench benchmark, highlighting trade-offs in inference time, cost, build quality, and prompting sensitivity.
ProgramBench result for Fable 5 is in, doubling Opus 4.8 even with 4.8 fallback "99% of the runs"
ProgramBench results show Fable 5 achieving double the performance of Opus 4.8, even with fallback to 4.8 in 99% of runs.
Initial impressions of Claude Fable 5
Claude Fable 5 and Claude Mythos 5 have been released by Anthropic, offering a 1 million token context window and doubled pricing compared to Opus 4.8. Fable 5 includes strict safety guardrails, while Mythos 5 lacks them. Initial impressions describe it as a powerful and capable model.
@bentossell: wait… if most people think 5.5 is better than 4.7, i assume that’s due to terminal coding benchmark… 4.8 is still outpe…
The tweet discusses the release of Claude Opus 4.8, which improves upon Opus 4.7 with sharper judgment and longer independent work, though it notes that version 5.5 still outperforms it on a terminal coding benchmark.