Tag
SCOPE is a specification-guided framework for text-to-image generation that tracks semantic commitments to better fulfill complex visual intents. It introduces the Gen-Arena benchmark and demonstrates strong performance on complex generation tasks.