@AnthropicAI: AI research is a series of next-step decisions. We looked at sessions where a human researcher took a wrong turn, showe…

X AI KOLs Models

Summary

Anthropic's Mythos Preview model outperformed human researchers in correcting wrong-turn decisions 64% of the time, a major improvement from 22% in 2024, showcasing Claude's advancing research assistance capabilities.

AI research is a series of next-step decisions. We looked at sessions where a human researcher took a wrong turn, showed Claude the session up to that point, and asked it what to do next. Mythos Preview improved on humans 64% of the time—up from 22% in 2024.
Original Article

Similar Articles