world-models

#world-models

Summary: Gemini Co-Lead on World Models, RL's Next Domains & Continual Learning

Reddit r/artificial ↗ · 6h ago Cached

A summary of Oriol Vinyals' discussion on Google's Gemini models, world models, multimodal AI, agents, and challenges like continual learning and true innovation.

1 favorites 1 likes

#world-models

Conformal Orbit-Valid Trust Horizons for Equivariant World Models

arXiv cs.LG ↗ · 8h ago Cached

This paper proposes a method to certify the trust horizon of latent world models with known group symmetries by calibrating a raw error-propagation curve using split-conformal prediction and leveraging equivariance to transport certificates over the entire group orbit. The approach provides finite-sample guarantees and demonstrates non-vacuous certificates on symmetric 2D and 3D substrates.

0 favorites 0 likes

#world-models

When Do Conservation Laws Survive Learned Representations? Certified Horizons for Latent World Models

arXiv cs.LG ↗ · 8h ago Cached

This paper studies when conservation laws can be certified in learned latent world models, proposing bounded horizons that guarantee how long rollouts stay on physical invariant level sets using measurable model defects.

0 favorites 0 likes

#world-models

@rohanpaul_ai: New Microsoft paper argues that transformers generalize better when they learn compact internal states, not just next t…

X AI KOLs Timeline ↗ · yesterday Cached

Microsoft's NextLat paper proposes a self-supervised training method where transformers predict their next hidden state instead of just the next token, leading to more compact world models, better planning and reasoning, and up to 3.3x faster generation.

0 favorites 0 likes

#world-models

Qwen-AgentWorld: Language World Models for General Agents

Hacker News Top ↗ · yesterday Cached

Qwen-AgentWorld introduces language world models for agentic environments, covering seven domains with long chain-of-thought reasoning. The work includes a new benchmark, AgentWorldBench, and shows that world modeling improves downstream agent performance.

0 favorites 0 likes

#world-models

Causal-rCM: A Unified Teacher-Forcing and Self-Forcing Open Recipe for Autoregressive Diffusion Distillation in Streaming Video Generation and Interactive World Models

Hugging Face Daily Papers ↗ · yesterday Cached

This paper introduces Causal-rCM, a unified teacher-forcing and self-forcing framework for autoregressive diffusion distillation in streaming video generation and interactive world models, achieving state-of-the-art performance with fast convergence.

0 favorites 0 likes

#world-models

@AlphaSignalAI: https://x.com/AlphaSignalAI/status/2069424192274252094

X AI KOLs Timeline ↗ · yesterday Cached

Microsoft's NextLat introduces a training objective that rewards belief-state representations instead of relying solely on next-token prediction, pushing models toward compact world models for better generalization.

0 favorites 0 likes

#world-models

@garridoq_: After 4.5 formative years at FAIR, I am thrilled to join AMI Labs as a Member of Technical Staff ! I'm looking forward …

X AI KOLs Timeline ↗ · 2d ago Cached

After 4.5 years at FAIR, a researcher joins AMI Labs to work on JEPA and World Models.

0 favorites 0 likes

#world-models

@rohanpaul_ai: Can LLM agents actually discover hidden rules by interacting? The answer is uncomfortable. The more complicated the hid…

X AI KOLs Following ↗ · 3d ago Cached

This paper investigates whether LLM agents can infer hidden world models through interaction, finding that they struggle to build stable internal models as complexity increases.

0 favorites 0 likes

#world-models

Reward as An Agent for Embodied World Models

arXiv cs.AI ↗ · 5d ago Cached

This paper introduces Reward as an Agent and DynDiff-GRPO to address reward hacking and limited exploration in reinforcement learning for embodied world models, achieving significant accuracy gains.

0 favorites 0 likes

#world-models

@gkxspace: LLM is likely just the first stop for AI large models. Professor Biwei Huang divides AI paradigms into four generations: First generation (1990s): Small models learn correlations. Second generation (2010s): Small models learn causation. Third generation (current LLMs): Large models learn correlations. Fourth generation (next step): Large models learn causation. Over 30 years, models have grown from small to large...

X AI KOLs Timeline ↗ · 2026-06-18 Cached

Professor Biwei Huang proposes a four-generation theory of AI paradigms, believing LLMs are just the first step, and the future lies in causal world models. Aether AI has completed a $20 million funding round, dedicated to building causal world models.

0 favorites 0 likes

#world-models

Current World Models Lack a Persistent State Core

Hugging Face Daily Papers ↗ · 2026-06-18 Cached

This paper argues that current world models lack a persistent state core, proposing a hybrid approach that adds temporal-causal structure via η-pseudo-unitary operator dynamics to convert pretrained GPT-2 into a time-reasoning model.

0 favorites 0 likes

#world-models

Lin Junyang AI Lab Closes Round at $2B Valuation

Reddit r/LocalLLaMA ↗ · 2026-06-17 Cached

Lin Junyang, former head of Alibaba's Qianwen team, closed his AI lab's first financing round at a $2B post-money valuation, with Gao Rong and Sequoia China each investing $100M and Tencent adding $20M. The lab will focus on world models and embodied intelligence rather than general LLMs.

0 favorites 0 likes

#world-models

@odysseyml: We’ve raised a $310M Series B to accelerate world models! We believe AI that can understand and simulate the world will…

X AI KOLs Following ↗ · 2026-06-17 Cached

OdysseyML announces a $310M Series B funding round to advance world models, with backing from Natural Capital, Amazon, GV, AMD, and IQT.

0 favorites 0 likes

#world-models

Next-Latent Prediction Transformers [R]

Reddit r/MachineLearning ↗ · 2026-06-17

Microsoft Research introduces Next-Latent Prediction (NextLat), a self-supervised method that trains transformers to predict their own next latent state, enabling compact world models for reasoning and planning and achieving up to 3.3x faster inference via self-speculative decoding.

0 favorites 0 likes

#world-models

@rohanpaul_ai: Language had a strange advantage robotics does not: Text is already a compressed, shared interface for human thought, w…

X AI KOLs Following ↗ · 2026-06-16 Cached

Discusses the challenges facing embodied AI and robotics, including a 100,000-year data gap and lack of shared benchmarks, and highlights startup opportunities in data loops, eval systems, and deployment.

0 favorites 0 likes

#world-models

@dair_ai: Can an LLM agent actually build a model of an environment it cannot see? This work makes the question gradeable. An age…

X AI KOLs Following ↗ · 2026-06-16 Cached

A research paper proposes agentic automata learning to evaluate whether LLM agents can infer hidden world models through interaction, finding that performance drops sharply as task complexity increases and that reasoning models outperform non-reasoning ones but still struggle.

0 favorites 0 likes

#world-models

How Should World Models Be Evaluated? A Decision-Making-Centric Position

arXiv cs.LG ↗ · 2026-06-16 Cached

This paper surveys evaluation methods for world models and argues for a decision-making-centric framework that prioritizes counterfactual reasoning, planning, and policy optimization over visual quality. It introduces an L0–L7 evaluation ladder and a benchmark protocol to align evaluation with claimed utility.

0 favorites 0 likes

#world-models

@DengHokin: I am super excited to share that I launch a weekly Video Model Journal Club. Every week we pick one paper and go deep, …

X AI KOLs Timeline ↗ · 2026-06-16 Cached

The author launches a weekly Video Model Journal Club covering video generation, world models, physical reasoning, diffusion, flow matching, etc. The first in-person talk will be by Yilun Du on Embodied Reasoning with World Models.

0 favorites 0 likes

#world-models

Kairos: A Native World Model Stack for Physical AI

Hugging Face Daily Papers ↗ · 2026-06-16 Cached

Kairos is a native world model framework for Physical AI that learns from diverse experiences using a cross-embodiment data curriculum, maintains persistent states with hybrid temporal attention, and supports efficient deployment on server and consumer hardware.

0 favorites 0 likes

world-models

Submit Feedback