interactive-video

#interactive-video

Light Interaction: Training-Free Inference Acceleration for Interactive Video World Models

Hugging Face Daily Papers ↗ · 2026-05-29 Cached

Light Interaction introduces a training-free inference acceleration framework for interactive video world models, using adaptive context management, denoising cache acceleration, and 3D block sparse attention to achieve up to 2.59x speedup while maintaining competitive visual quality.

0 favorites 0 likes

#interactive-video

Incantation: Natural Language as the Action Interface for Multi-Entity Video World Models

Hugging Face Daily Papers ↗ · 2026-05-18 Cached

Incantation presents an interactive video world model that uses natural language as the action interface for fine-grained multi-entity control and cross-entity generalization, achieving high performance and real-time streaming through novel attention and distillation techniques.

0 favorites 0 likes

#interactive-video

Echo-Forcing: A Scene Memory Framework for Interactive Long Video Generation

Hugging Face Daily Papers ↗ · 2026-05-15 Cached

Echo-Forcing introduces a scene memory framework for interactive long video generation, using hierarchical temporal memory, scene recall frames, and difference-aware memory decay to handle prompt switching and long-term recall. The method is training-free and achieves strong performance on VBench-Long.

0 favorites 0 likes

interactive-video

Light Interaction: Training-Free Inference Acceleration for Interactive Video World Models

Incantation: Natural Language as the Action Interface for Multi-Entity Video World Models

Echo-Forcing: A Scene Memory Framework for Interactive Long Video Generation

Submit Feedback