New Mythos checkpoint shows continued improvement: “On a 32-step corporate network attack we estimate takes a human expert ~20 hours, this checkpoint completes the full attack in 6 /10 attempts.”

Reddit r/singularity Models

Summary

Mythos releases a new checkpoint that can complete a 32-step corporate network attack in 6 out of 10 attempts, compared to ~20 hours for a human expert.

No content available
Original Article

Similar Articles

Mythos can improve speed of training code 52x (compared to human 4x at 4-8hrs)

Reddit r/singularity

Anthropic's Mythos system achieved a 52x speedup in optimizing training code compared to a human's 4x speedup over 4-8 hours on the same task, with the caveat that absolute multiples depend heavily on starting code quality. The like-for-like comparison shows roughly 3x–52x improvement across models over the past year.

Project Glasswing: what Mythos showed us

Hacker News Top

Cloudflare tested Anthropic's Mythos Preview LLM, designed for security vulnerability research, and found it capable of chaining multiple bugs into exploits and generating working proofs, representing a significant advancement over general-purpose frontier models.