@FinanceYF5: Regret not using Fable 5 before it was discontinued? He ran it for 6 days straight. 1/ Most people treat Fable 5 as a faster chat box. Someone let a Fable 5 agent run for 6 days without anyone at the helm before writing the conclusion: 90% of people only use 10% of its capabilities. It was built for 'running for days'...
Summary
A user let a Fable 5 agent run continuously for 6 days without human intervention, concluding that most people only use 10% of its capacity.
View Cached Full Text
Cached at: 06/16/26, 05:38 PM
Regret not using Fable 5 before it was canceled? He ran it continuously for 6 days 👇
1/ 🚀 Most people treat Fable 5 like a faster chat box
Someone let a Fable 5 agent run continuously for 6 days, with no human steering, before writing down the conclusion: 90% of people only use 10% of its capability.
It was built to “run for days,” yet people only use it for minutes. https://t.co/bCqmgKh0Vp
Regret not using Fable 5 before it was canceled? He ran it continuously for 6 days
1/ Most people treat Fable 5 like a faster chat box
Someone let a Fable 5 agent run continuously for 6 days, with no human steering, before writing down the conclusion: 90% of people only use 10% of its capability.
It was built to “run for days,” yet people only use it for minutes.
2/ The real dividing line: self-learning vs self-improvement
Self-learning means the model modifies its own weights — Fable 5 doesn’t do that, and no production model currently does.
Self-improvement means the system around the model compounds: each run writes lessons into memory, skills sharpen with use. The model stays the same, but the environment gets smarter with every run.
3/ Building the compounding stack from the bottom up, four layers
Bottom layer: primitives — Fable 5, sub-agents, worktree. Most people only touch this layer. Second layer: orchestration — goal loops, dynamic workflows, cloud Routines. Third layer: memory — state files, Skills, knowledge bases. Top layer: self-improvement — visual self-checks, evaluation loops, rule distillation.
4/ Don’t throw everything at Fable 5
It costs about 5x more per token than Opus 4.8 ($10/M input, $50/M output).
Let Fable 5 be the orchestrator, Sonnet 4.6 handle the heavy lifting, Haiku 4.5 act as the grader, and fall back to Opus 4.8 automatically when blocked by safety classifiers.
5/ Never let the model grade itself
Anthropic’s own experiments: the version with an independent verifier dared to make bigger changes, pushing a failed experiment all the way to maximum results; the self-grading version only tweaked one safety parameter and gave up early.
The agent that writes code should never be the one grading it.
6/ Five stages of memory: fail → investigate → verify → distill → reference
Sonnet 4.6 mostly stops at stage one, piling up failure notes no one ever reads again. Fable 5 can go the full distance — at its peak, verification coverage hit over 70%, distilling facts into reusable rules.
The gap isn’t the model — it’s whether you have state files.
7/ Self-improvement is a property of the system, not the model
In every experiment that proves this, the models on both sides are identical. What changes is the system around them: verifiers, state files, evaluation loops.
Pick a layer you haven’t built yet, add it tomorrow, then add the next.
Original post:
That’s all
If you like this topic:
- Follow me (@FinanceYF5)
- Like + repost the first post below
1/ a16z’s four charts for one week: AI is repricing everything
Manufacturing reshoring, Americans drinking, search clicks, SaaS stock prices — four seemingly unrelated things point to the same divergence.
Those who can’t tell an AI story are being left behind.
Similar Articles
@FinanceYF5: Oh my god... Fable 5 is back, and it's insanely powerful. Someone asked Fable to make a game called 'Super Smart Racing'... With just 4 prompts and $173 worth of tokens, Fable 5 created this game. (Prompts below)
Fable 5 model only used 4 prompts and $173 worth of tokens to create a game called 'Super Smart Racing', demonstrating its extremely strong generative capabilities.
@FinanceYF5: 1/ Someone ran 3 tasks with Fable 5. The $200 Max membership, 5-hour quota depleted 73%. He said he was stunned. Not because it's expensive — because Opus 4.8 never produced anything like this. A thread explains clearly
A user ran 3 tasks using the Fable 5 model, consuming 73% of the Max membership's 5-hour quota, sparking discussions as Opus 4.8 never consumed resources in this way.
@RookieRicardoR: Fable 5 Max, five tasks, 3300 lines of code, ran for 90 minutes, is this right?
Discusses the performance of Fable 5 Max (five tasks, 3300 lines of code, 90 minutes) and notes that with the latest version of Claude Code (170), Fable 5's cost is twice that of Ops 4.8.
@FinanceYF5: Generate a game with one sentence, build a city with one sentence, recreate a classic with one sentence. Fable 5's first day of release is insane!
Fable 5 has been released, capable of generating games, cities, or recreating classics with just one sentence, demonstrating the powerful ability of text-to-3D content.
@FinanceYF5: 1/ Within hours of Fable 5's release, Twitter is in chaos. Karpathy says it's a major version leap. Some call it their "singularity moment." Others say they're starting to fear the future of software engineering. Today's top 10 reactions to watch.
The release of Fable 5 has sparked widespread discussion. Andrej Karpathy calls it a major version leap, some consider it their "singularity moment," while others worry about the future of software engineering. This article summarizes 10 most noteworthy reactions.