Seed IQ-ARC AGI 3: Special behind-the-scenes look at Seed IQ on ARC-AGI 3 games! 14/14 games with a perfect 100% score across all.

Reddit r/ArtificialInteligence 05/13/26, 05:24 PM Models

active-inference multi-agent arc-agi benchmark autonomous-control seed-iq agi

Summary

Seed IQ achieves a perfect 14/14 score on ARC-AGI-3 games using an active inference, physics-driven multi-agent autonomous control engine, as shown in a behind-the-scenes video walkthrough.

I’m just sharing with the community… Very interesting video in the link below! Seed IQ perceives live inputs and uses active inference plus physics-driven multi-agent control to infer and adapt actions in real time. Denise Holt: NEW VIDEO on YOUTUBE: Special behind-the-scenes look at Seed IQ on ARC-AGI 3 games! 14/14 games with a perfect 100% score across all. ➡️ In this video, recorded Monday, May 11th, Denis O and I walkthrough Seed IQ’s LIVE ARC Prize scorecard, walk through the verified replay sessions, and show how ARC Prize is evaluating and validating Seed IQ’s performance through their own platform, their own API, their own scorecards, and their own replay records. Watch on YouTube: youtu.be/oW5\_CvKDuHM?si…. As of yesterday, (the day after this recording) we are now at 15/15 ARC-AGI-3 game environments won, 109 levels, 3502 actions. (New scorecard link is in the YT video description.) These are the actual verifiers generated through ARC Prize’s own evaluation infrastructure. This is ARC Prize’s own system recording the agent tag, the session IDs, the scorecard IDs, the online game environments, the level-by-level performance, the human baseline comparisons, and the replay evidence showing Seed IQ interacting with the live ARC-AGI-3 environments. ▪️ These are not offline demos. ▪️ These are not staged examples. ▪️ These are not cherry-picked claims. In the video, we also explain why Seed IQ is not listed on the official competition leaderboard. Entering the leaderboard contest would require us to disclose proprietary code, methodology, and give up our commercial rights that are central to our IP and business model. That makes no sense for a company building a commercial execution governance platform for real-world complex systems. So instead, we are continuing to publish the scorecard evidence directly. How is Seed IQ able to do this? ▪️ Seed IQ is not operating like an LLM wrapper. ▪️ It is not deep learning. ▪️ It is not token-based reasoning. ▪️ It is not pattern matching against memorized examples. Seed IQ is an Active Inference, physics-driven adaptive multi-agent autonomous control engine that perceives live environments, infers constraints, identifies admissible paths, and adapts in real time. That same core engine is what we are applying to quantum computing, energy systems, data centers, autonomous warehouses, and other complex systems where execution under uncertainty matters. \#AIX #SeedIQ #ARCAGI3 #ARCPrize #MultiAgentSystems #AIBenchmarks

Original Article

Seed IQ-ARC AGI 3: Special behind-the-scenes look at Seed IQ on ARC-AGI 3 games! 14/14 games with a perfect 100% score across all.

Similar Articles

Seed IQ ARC-AGI 3 Claims

Claude Opus 4.8 scores over 1% on ARC-AGI 3 !!

Seedance 2.0 + Arcads = INSANE AI UGC Ads

11.67% ARC-AGI-2 Local Eval on a Single 4090: The TOPAS Recursive Architecture

InternScience/Agents-A1-Q4_K_M-GGUF

Submit Feedback

Similar Articles

Claude Opus 4.8 scores over 1% on ARC-AGI 3 !!
Claude Opus 4.8 achieves a score of over 1% on the ARC-AGI 3 benchmark, demonstrating slight progress on a difficult AI reasoning test.

Seedance 2.0 + Arcads = INSANE AI UGC Ads

11.67% ARC-AGI-2 Local Eval on a Single 4090: The TOPAS Recursive Architecture

InternScience/Agents-A1-Q4_K_M-GGUF