depth-guided-warping

Tag

Cards List
#depth-guided-warping

Latent Spatial Memory for Video World Models

Hugging Face Daily Papers · 2026-06-08 Cached

This paper introduces latent spatial memory for video world models, storing 3D scene information directly in diffusion latent space to avoid costly pixel-space reconstruction. The proposed Mirage framework achieves up to 10.57x faster generation and 55x memory reduction while achieving state-of-the-art performance on WorldScore and RealEstate10K.

0 favorites 0 likes
← Back to home

Submit Feedback