interactive-world-models

Tag

Cards List
#interactive-world-models

WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation

Hugging Face Daily Papers · 2026-05-25 Cached

WBench is a comprehensive multi-turn benchmark for evaluating interactive world models across five dimensions using 289 test cases and 1,058 interaction turns, providing automatic sub-metrics and diagnostic insights. It reveals that no single model excels across all dimensions.

0 favorites 0 likes
← Back to home

Submit Feedback