Mythos-class models will diffuse throughout the world by 2029 (7 minute read)

TLDR AI News

Summary

Saagar Pateder analyzes the diminishing marginal returns of AI intelligence for consumer and enterprise tasks, and predicts that open-weight models will diffuse globally by 2029, based on historical trends in model performance and cost.

Model performance has only improved over time. There's currently no reason it shouldn't continue to improve in the future. Open weight models are only a few months behind frontier models on benchmarks. If current trends continue, it is likely that a Claude Fable 5-level open model that can run on a device with 16 GB of RAM will be possible by early 2029.
Original Article
View Cached Full Text

Cached at: 06/12/26, 02:50 PM

# Mythos-class models will diffuse throughout the world by 2029 — Saagar Pateder Source: [https://spateder.com/projects/20260611/openweightmodels](https://spateder.com/projects/20260611/openweightmodels) ## Model capabilities improve over time, but open\-weight models lag the frontier I often ask Claude mundane questions about cooking, fitness, and cars, among other things, and I can’t say I’ve found Fable 5 to be some magical step change vs\. previous Claude models \(e\.g\., Opus 4\.7\) at answering my day\-to\-day questions\. I was already in awe of the fact that for $20/month I can have functionally unlimited access to incredible intelligence in my pocket; Fable 5 may be smarter, but it’s probably not going to help me plan a date night dinner any better\. There are diminishing marginal returns to intelligence; the majority of my \(and probably most consumers’\) day\-to\-day AI usage isn’t going to really benefit from a smarter model\. Let’s shift focus to the enterprise\. There’s a vast array of jobs to be done and people to do them: lawyers and executive assistants and nurses and customer service workers and account managers and accountants\. Seriously,[there is a LOT of white\-collar work being done today in the US](https://data.bls.gov/oes/#/area/0000000/2025)\. You could imagine some tier\-system that bucketed these types of work into difficulty levels: manual data\-entry would probably be pretty low on the list; \(some\) work done by biology researchers or lawyers or software engineers would probably be higher up on the list\. But the same law of diminishing marginal returns applies: beyond a certain point, hiring a smarter\-than\-necessary human doesn’t really improve performance\. And if you wanted to augment or automate this labor – diminishing marginal returns applies to model intelligence also\. But again, there’s a diversity of tasks, and new models can continue to push the frontier forward for some while not being materially better on others\. Fable 5 is clearly a[gamechanger for hardcore software engineering](https://www.anthropic.com/news/claude-fable-5-mythos-5)and beating Pokemon; I haven’t seen notable performance improvements in my Chipotle burrito\-bowl ordering workflow\. Model performance has[only improved over time](https://artificialanalysis.ai/trends?license=proprietary&model-creators=openai%2Cgoogle%2Canthropic#frontier-language-model-intelligence-over-time), and I see no reason why it shouldn’t continue to improve in the future\. Let’s turn our task difficulty tier list into a y\-axis and show model performance over time\. This is just illustrative; a precise mapping from AAII score to capabilities on real world tasks is unclear, and I’m not trying to make a prediction that doctors or lawyers or software engineers will be automated by 20XX\. I’m merely saying that \(1\) the frontier models have gotten better over time, that \(2\) they’ll probably continue to do so, and that \(3\) as they get better and better, more and more tasks will reach the asymptote for diminishing marginal returns to model intelligence\. Behind the frontier lies open\-weight models: models that theoretically anyone could run with the right compute hardware\. Open\-weight models are usually substantially cheaper vs\. models from Google / Anthropic / OpenAI, but are also[less intelligent](https://artificialanalysis.ai/trends?license=proprietary%2Copen&model-creators=openai%2Cgoogle%2Canthropic#progress-in-open-weights-vs-proprietary-intelligence)\. How far behind open\-weight models are vs\. the frontier is up for debate, but for now let’s assume the answer is ~4 months or so on benchmarks \[1\]\. Open\-weight models also come in a variety of sizes\. For example, the Gemma 4 family of open\-weight models from Google comes in E2B, E4B, 12B, 26B A4B, and 31B sizes\. Understanding the alphabet soup isn’t important, but[larger models \(more parameters\) typically correlates to more intelligence](https://artificialanalysis.ai/models/open-source?model-filters=reasoning-models%2Copen-source&models=gemma-4-31b%2Cdeepseek-v4-pro%2Cdeepseek-v4-flash%2Cminimax-m2-7%2Cnvidia-nemotron-3-ultra-550b-a55b%2Ckimi-k2-6%2Cmimo-v2-5-pro%2Cglm-5-1%2Cqwen3-6-35b-a3b%2Cqwen3-6-27b#intelligence-vs-total-parameters), while smaller models can run on smaller and less expensive devices \(e\.g\., phones, laptops\)\. Let’s add two more lines to our graph above: one for the cutting edge of open\-weight models, and another for what could feasibly run on an average laptop\. If you’re interested in how I arrived at these numbers, you can find a full analysis[here](https://drive.google.com/file/d/1CGZylAT0EhTwhK-IuxvwRVtz4eieZL3I/view?usp=sharing)\(download the file and open it in Chrome\), and the full data and Python scripts behind it[here](https://drive.google.com/drive/folders/1_JtTmSbvT1qLVEtKl_J59BDzCy4Sy_1T?usp=sharing)\. ## What does Fable 5 being diffuse throughout the economy entail? I doubt consumers will care much about running on\-device models\. ChatGPT Free\-tier consumers probably don’t care about having access to the smartest models and probably aren’t running into rate limits all that often; they probably do care about ease of use \(not having to set anything up\), a strong memory system, and access to multimodal outputs \(image generation has clearly caught on with the consumer crowd\)\. Seeing ads here and there won’t be much of a turn off \(see: Instagram, Google Search\)\. Paid consumers probably won’t care much about on\-device models either: if you care about model intelligence, you’re sticking with the closed\-weight frontier, if you care about rate limits, I imagine a more built out ads engine can solve that \(would you rather wait for your limits to reset, or press on with ads if the option were presented to you?\)\. It’s a different story in the enterprise\. Excluding FOMO\-driven tokenmaxxing, enterprises make decisions by looking at basic ROI calculations, and if the[90th percentile of businesses are spending $7200/year/employee on AI spend](https://ramp.com/data/ai-index)\[2\], there’s going to be a pretty strong incentive to switch over to an open\-weight model that costs ~20% of that or to a local model that’s free\. The unknowable trillion\-dollar\-question is for what workloads frontier models will continue to command positive ROI over their open\-weight and local counterparts\. I can see a world where frontier models continue to be worth their price in fields like life sciences, healthcare, finance, law, and engineering \(whether physical or digital\) over the next handful of years\. I also can see a world where e\.g\., Opus 5\.5 is good enough for the vast majority of tasks done in the vast majority of enterprises, and companies that run the numbers conclude that buying every power user a ~$5,000 laptop with an[RTX Spark](https://nvidianews.nvidia.com/news/nvidia-microsoft-windows-pcs-agents-rtx-spark)inside is the right capex\-opex tradeoff\. And though I hate to end on a sour note, anyone having easy \(I took me 30 minutes and 4 prompts to get Claude to install an open weight model on my machine\) access to the cybersecurity capabilities of a Mythos\-class model is certainly a terrifying thought\. Sufficiently empowered, just one bad actor can ruin a lot of people’s day\.

Similar Articles

Can tech companies learn to love cheaper AI models? 

TechCrunch AI

TechCrunch reports on a potential industry shift as companies consider switching to cheaper, smaller AI models instead of always using the most powerful ones, driven by escalating costs. Predictions like Brian Armstrong's suggest 80% of workloads could run on 99% cheaper models within 12-18 months, which would significantly impact major AI labs like OpenAI and Anthropic.