@DrJimFan: I promise this will be the best 20 min you spend today! Robotics: Endgame, the sequel to my last year's Sequoia AI Asce…

X AI KOLs Timeline News

Summary

In his talk at Sequoia AI Ascent, Dr. Jim Fan presents a roadmap for achieving Physical AGI parallel to LLM success, introducing concepts like video world models, World Action Models (WAM), and the Dexterity Scaling Law, and sharing predictions for the near future.

I promise this will be the best 20 min you spend today! Robotics: Endgame, the sequel to my last year's Sequoia AI Ascent talk, "Physical Turing Test". I laid out the roadmap for solving Physical AGI as a simple parallel to the LLM success story. Be a good scientist, copy homework ;) And stay till the end, more easter eggs and predictions for your polymarket! 00:30 DGX-1 origin story at OpenAI, I was there in 2016 signing with Jensen and Elon. Heading to the Computer History Museum! 01:42 The Great Parallel 03:31 Robotics, the Endgame 03:39 Why VLAs fall short 04:32 Video world models as the 2nd pretraining paradigm 06:09 World Action Models (WAM) 07:46 Strategies for robot data collection and the FSD equivalent to physical data flywheel for robot manipulation 11:06 EgoScale and the Dexterity Scaling Law we discovered recently 14:00 Physical RL: bridging the last mile 15:39 DreamDojo: an end-to-end neural physics engine for scaling RL in silico 17:00 Civilizational Technology Tree and my predictions for the near future. Spoiler: it's closer than you think. Thanks to my friends at Sequoia for inviting me back to AI Ascent this year! I had a blast! Last year's talk is attached in the thread if you missed it.
Original Article

Similar Articles

@dotey: https://x.com/dotey/status/2053351712149135385

X AI KOLs Timeline

NVIDIA's Jim Fan spoke at Sequoia AI Ascent 2026, declaring the VLA architecture obsolete and proposing World Action Models (WAM) as a new paradigm for robotics. He introduced key technologies including DreamZero, EgoScale, and the neural simulator Dream Dojo.

@seclink: https://x.com/seclink/status/2057093284330430533

X AI KOLs Following

NVIDIA's head of robotics, Jim Fan, gave a public talk, advocating that robots should directly replicate the successful path of large language models. He proposed directions such as World Action Model (WAM), a data revolution based on human first-person video, and neural simulation, and predicted a 95% probability of achieving the endgame of general-purpose physical robots by 2040.

@drfeifei: https://x.com/drfeifei/status/2062247238143996275

X AI KOLs Timeline

Fei-Fei Li and the World Labs team present a functional taxonomy of world models, distinguishing between renderers, physics engines, and other components within the reinforcement learning loop, and arguing that spatial intelligence is AI's next frontier.