11.67% ARC-AGI-2 Local Eval on a Single 4090: The TOPAS Recursive Architecture

Reddit r/LocalLLaMA 05/07/26, 08:56 PM Models

arc-agi recursive-architecture reasoning efficient-ai consumer-hardware benchmark

Summary

The authors present TOPAS, a recursive AI architecture achieving 11.67% on ARC-AGI-2 using a single RTX 4090, aiming to demonstrate that architectural efficiency can outweigh raw compute power.

I'm not sure too many people care about the ARC-AGI-2 competition anymore, but still...I thought some might find this interesting. They're running it one last time this year. Everyone is currently leaderboard-stuffing using the winning open-source code from last year. That's why if you take a peak it's really just the same scores clogging it up. We're doing something a bit different though, building a highly efficient, deep-recursion model from scratch. We just hit 11.67% on the public LB, but that's with a massive asterisk. We don't have a cluster. We have **one RTX 4090**. And we're only 14 days or so into training a 100m parameter model. Locally, this checkpoint actually hit 36%. On the Kaggle submission, our TTT is computationally heavy because of the recursive loops. To avoid a total submission timeout, we set the thresholds too high, and the model ended up outputting \[\] (null) for nearly half the puzzles...hence the 11.67%. We're trying to show that ARC isn't just a Compute War, but an architecture war. Small models using biological memory models can punch way above their weight class if they can handle the reasoning loops. We're tuning the time-management logic tonight and expect to put a 20% score up tomorrow once we let the model actually finish the thought process. And beyond that...the actual model is still in training, in the Grokking phase. We strongly believe that if we give it another 3-5 weeks to fully train we could drop something really groundbreaking on that leaderboard. If you're interested in how we're scaling recursive reasoning on consumer metal, we'd love to answer questions about it.

Original Article

11.67% ARC-AGI-2 Local Eval on a Single 4090: The TOPAS Recursive Architecture

Similar Articles

Sakana Fugu (3 minute read)

Claude Opus 4.8 scores over 1% on ARC-AGI 3 !!

Getting peak TOPS on a Ryzen AI 7 350 NPU

@Recursive_SI: https://x.com/Recursive_SI/status/2064980090702962699

@rohanpaul_ai: Quite a massive inferencing rack breakthrough from @TensordyneInc . They just announced an AI-inference rack, claiming …

Submit Feedback

Similar Articles

Claude Opus 4.8 scores over 1% on ARC-AGI 3 !!
Claude Opus 4.8 achieves a score of over 1% on the ARC-AGI 3 benchmark, demonstrating slight progress on a difficult AI reasoning test.

Getting peak TOPS on a Ryzen AI 7 350 NPU

@Recursive_SI: https://x.com/Recursive_SI/status/2064980090702962699

@rohanpaul_ai: Quite a massive inferencing rack breakthrough from @TensordyneInc . They just announced an AI-inference rack, claiming …