Mythos can improve speed of training code 52x (compared to human 4x at 4-8hrs)
Summary
Anthropic's Mythos system achieved a 52x speedup in optimizing training code compared to a human's 4x speedup over 4-8 hours on the same task, with the caveat that absolute multiples depend heavily on starting code quality. The like-for-like comparison shows roughly 3x–52x improvement across models over the past year.
Similar Articles
@AnthropicAI: Each time we release a model, we run the same test: give it code that trains a small AI model, ask the new model to spe…
Anthropic shares internal benchmark results showing dramatic AI coding improvement: while Claude Opus 4 averaged ~3x speedup on an ML code optimization task in May 2024, the new Mythos Preview model achieved ~52x speedup this April, compared to 4-8 hours for a skilled human to reach 4x.
New Mythos checkpoint shows continued improvement: “On a 32-step corporate network attack we estimate takes a human expert ~20 hours, this checkpoint completes the full attack in 6 /10 attempts.”
Mythos releases a new checkpoint that can complete a 32-step corporate network attack in 6 out of 10 attempts, compared to ~20 hours for a human expert.
More evidence of Mythos's strength in Cybersecurity/Hacking - compared to 5.5, it got 18/41 n-day exploits, vs 1/41. Open Source/Weights models get nothing
Mythos demonstrates strong performance in cybersecurity hacking, achieving 18 out of 41 n-day exploits compared to 1 for version 5.5, while open-source models get none.
Anthropic likely to release Mythos in the "near future"
Anthropic is expected to release a new AI model called Mythos in the near future.
Mythos 1 has been spotted in Claude Code
Mythos 1, a new AI model, has been spotted in Claude Code, Anthropic's coding assistant, hinting at a potential new release.