Tag
A new creator-centric benchmark for text-to-image generation, Qwen-Image-Bench, evaluates models on real-world fidelity and creative generation using a hierarchical taxonomy of 56 verifiable facets scored by a unified judge model.