World models: how close are we to something usable in a real product?

Reddit r/singularity News

Summary

An indie developer building a voice-first learning game for kids asks whether interactive world models will be production-ready within 12–18 months or if pre-rendered assets plus real-time avatars are the better near-term path.

I'm a dad of two (8 and 10) building a voice-first learning game for kids 6-12. Think Carmen Sandiego, but the kid is inside the adventure, talking to characters and solving the plot as they learn. Today I'm using 2D Rive animations driven by LLM reactions. Kids engage, but the ceiling is low. What I actually want is a real-time rendered character and world that the agent can direct moment to moment. So I've been tracking Genie 3, Odyssey, World Labs, and the avatar side (Runway, Anam). My working thesis is that within 18 months, the convergence of interactive real-time world models and real-time avatars will reach a usable production level. Is anyone here actually shipping or prototyping on a world model today, outside demos? Does 12-18 months feel reasonable, or am I being optimistic? And for a scripted-adventure use case (known characters, recurring world, narrative beats), is a world model the right primitive, or is it overkill vs. stitched pre-gen assets + a real-time avatar layer?
Original Article

Similar Articles

Do you think World Models will lead to AGI?

Reddit r/ArtificialInteligence

A discussion on whether world models, which learn internal environment representations to simulate physics and plan actions, could lead to AGI by overcoming the limitations of reactive predictive text models like LLMs.

Genie 3: A new frontier for world models

Google DeepMind Blog

DeepMind announces Genie 3, a general-purpose world model capable of generating interactive environments from text prompts at 24fps in 720p with improved consistency and real-time interactivity compared to previous versions.

@drfeifei: https://x.com/drfeifei/status/2062247238143996275

X AI KOLs Timeline

Fei-Fei Li and the World Labs team present a functional taxonomy of world models, distinguishing between renderers, physics engines, and other components within the reinforcement learning loop, and arguing that spatial intelligence is AI's next frontier.