Mythos can improve speed of training code 52x (compared to human 4x at 4-8hrs)

Reddit r/singularity 06/04/26, 07:37 PM News

Summary

Anthropic's Mythos system achieved a 52x speedup in optimizing training code compared to a human's 4x speedup over 4-8 hours on the same task, with the caveat that absolute multiples depend heavily on starting code quality. The like-for-like comparison shows roughly 3x–52x improvement across models over the past year.

[https://www.anthropic.com/institute/recursive-self-improvement](https://www.anthropic.com/institute/recursive-self-improvement) Edit: The footnote reads: «How large the speedup gets depends heavily on how much room for improvement the starting code leaves, and it should not be read as a real-world training speedup. So the absolute multiple is not the figure to anchor on here. What is more informative is the like-for-like comparison that this experimental setup makes possible, both across models (\~3x to \~52x over the past year) and against a skilled human (\~4x in four to eight hours on the same task).»

Original Article

Similar Articles

@AnthropicAI: Each time we release a model, we run the same test: give it code that trains a small AI model, ask the new model to spe…

X AI KOLs

Anthropic shares internal benchmark results showing dramatic AI coding improvement: while Claude Opus 4 averaged ~3x speedup on an ML code optimization task in May 2024, the new Mythos Preview model achieved ~52x speedup this April, compared to 4-8 hours for a skilled human to reach 4x.

New Mythos checkpoint shows continued improvement: “On a 32-step corporate network attack we estimate takes a human expert ~20 hours, this checkpoint completes the full attack in 6 /10 attempts.”

Reddit r/singularity

Mythos releases a new checkpoint that can complete a 32-step corporate network attack in 6 out of 10 attempts, compared to ~20 hours for a human expert.

More evidence of Mythos's strength in Cybersecurity/Hacking - compared to 5.5, it got 18/41 n-day exploits, vs 1/41. Open Source/Weights models get nothing

Reddit r/singularity

Mythos demonstrates strong performance in cybersecurity hacking, achieving 18 out of 41 n-day exploits compared to 1 for version 5.5, while open-source models get none.

Anthropic likely to release Mythos in the "near future"

Reddit r/singularity

Anthropic is expected to release a new AI model called Mythos in the near future.

Mythos 1 has been spotted in Claude Code

Reddit r/singularity

Mythos 1, a new AI model, has been spotted in Claude Code, Anthropic's coding assistant, hinting at a potential new release.

Similar Articles

@AnthropicAI: Each time we release a model, we run the same test: give it code that trains a small AI model, ask the new model to spe…

New Mythos checkpoint shows continued improvement: “On a 32-step corporate network attack we estimate takes a human expert ~20 hours, this checkpoint completes the full attack in 6 /10 attempts.”

More evidence of Mythos's strength in Cybersecurity/Hacking - compared to 5.5, it got 18/41 n-day exploits, vs 1/41. Open Source/Weights models get nothing

Anthropic likely to release Mythos in the "near future"

Mythos 1 has been spotted in Claude Code

Submit Feedback