video-world-models

Tag

Cards List
#video-world-models

StressDream: Steering Video World Models for Robust Policy Evaluation and Improvement

Hugging Face Daily Papers · 2026-05-29 Cached

StressDream enhances video world models by steering diffusion-based imaginations toward high-impact yet plausible outcomes through optimized noise initialization with semantic and plausibility objectives, enabling robust policy evaluation and improvement.

0 favorites 0 likes
#video-world-models

minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models

Hugging Face Daily Papers · 2026-05-28 Cached

minWM is a full-stack open-source framework that converts bidirectional video diffusion models into real-time interactive video world models with controllable camera, low-latency rollout, and modular architecture.

0 favorites 0 likes
#video-world-models

Incantation: Natural Language as the Action Interface for Multi-Entity Video World Models

Hugging Face Daily Papers · 2026-05-18 Cached

Incantation presents an interactive video world model that uses natural language as the action interface for fine-grained multi-entity control and cross-entity generalization, achieving high performance and real-time streaming through novel attention and distillation techniques.

0 favorites 0 likes
#video-world-models

MultiWorld: Scalable Multi-Agent Multi-View Video World Models

Hugging Face Daily Papers · 2026-04-20 Cached

MultiWorld is a unified framework for multi-agent multi-view video world modeling that achieves accurate control of multiple agents while maintaining multi-view consistency through a Multi-Agent Condition Module and Global State Encoder.

0 favorites 0 likes
← Back to home

Submit Feedback