@swyx: This pod was an incredible gift to the community: not only our first pod about @xAI, but Ethan really indulged on all o…
Summary
A tweet praising a podcast episode where former xAI world model lead Ethan He provides deep insights into training SOTA video generation world models, covering Grok Imagine, Cosmos, and the parallels between video and coding agents.
View Cached Full Text
Cached at: 06/01/26, 05:47 PM
This pod was an incredible gift to the community:
not only our first pod about @xAI, but Ethan really indulged on all our questions on how to train a SOTA Videogen world model, including specific areas (consistent extending/editing, voice) that Grok @Imagine is still SOTA, https://t.co/Sl4AqAt7RA
Latent.Space (@latentspacepod): 🆕Grok Imagine’s Video Agent Moment: Cosmos, xAI, World Models, Generative UI, & the Codex Phase for Video!
https://t.co/Z3qhj368Tu
@EthanHe_42, former @xai world model lead and @nvidia Cosmos researcher, explains why AI video may follow the same path as coding agents, how Grok
Similar Articles
@EthanHe_42: In @latentspacepod podcast, I shared my view on video generation, world models, LLMs, agents, continual learning and wh…
Ethan He shares his insights from a Latent Space podcast, discussing key ideas about video generation, world models, LLMs, agents, continual learning, and the next frontiers in AI.
@swyx: full writeup and links here
A Latent Space podcast episode discusses the thesis that video models derive intelligence from LLMs, and that the next frontier is video agents. Guest Ethan He, who built Grok Imagine at xAI, shares insights on building frontier image and video systems.
@aiDotEngineer: Tokenmaxxing, Productivity, & internal AI Platforms @swyx in conversation with @GergelyOrosz, Editor of The Pragmatic E…
Podcast discussion on "tokenmaxxing," real-world AI productivity gains, and how internal AI platforms are reshaping software engineering roles.
@ycombinator: We're entering a new era of software where a single person, working with AI agents, can build products that previously …
A LightconePod podcast episode discussing the rise of AI coding agents like Claude Code and OpenClaw, exploring how single developers can now build products that previously required entire teams, along with emerging workflows and the concept of "tokenmaxxing".
Why Video Agent models are next — Ethan He, xAI Grok Imagine (98 minute read)
Ethan He from xAI discusses why video agent models are the next frontier, arguing that video models derive intelligence from LLMs and that the evolution of video generation will mirror AI coding, shifting from one-shot output to multi-turn planning and execution.