real-time-streaming

Tag

Cards List
#real-time-streaming

Incantation: Natural Language as the Action Interface for Multi-Entity Video World Models

Hugging Face Daily Papers · 2026-05-18 Cached

Incantation presents an interactive video world model that uses natural language as the action interface for fine-grained multi-entity control and cross-entity generalization, achieving high performance and real-time streaming through novel attention and distillation techniques.

0 favorites 0 likes
← Back to home

Submit Feedback