visual-world-models

Tag

Cards List
#visual-world-models

Trimming the Long-Tail of Visual World Modeling Evaluation

Hugging Face Daily Papers · 2026-06-23 Cached

This paper introduces Tailor-Bench, a benchmark that systematically evaluates visual world models on irregular physical interactions, revealing a long-tail gap in generalization where models perform well on common scenarios but degrade on unconventional and impossible ones.

0 favorites 0 likes
← Back to home

Submit Feedback