SPRITE: From Static Mockups to Engine-Ready Game UI
Summary
SPRITE introduces a pipeline that converts static game UI screenshots into editable engine assets using vision-language models and YAML to handle complex layouts and nesting.
View Cached Full Text
Cached at: 04/22/26, 06:17 AM
Paper page - SPRITE: From Static Mockups to Engine-Ready Game UI
Source: https://huggingface.co/papers/2604.18591
Abstract
SPRITE enables automated conversion of game UI screenshots into editable engine assets by combining vision-language models with structured YAML representation to handle complex layouts and nesting.
Game UIimplementation requires translating stylized mockups into interactive engine entities. However, current “Screenshot-to-Code” tools often struggle with the irregular geometries and deep visual hierarchies typical of game interfaces. To bridge this gap, we introduce SPRITE, a pipeline that transforms static screenshots into editableengine assets. By integratingVision-Language Models(VLMs) with a structuredYAMLintermediate representation, SPRITE explicitly captures complex container relationships and non-rectangular layouts. We evaluated SPRITE against a curatedGame UIbenchmark and conducted expert reviews with professional developers to assess reconstruction fidelity and prototyping efficiency. Our findings demonstrate that SPRITE streamlines development by automating tedious coding and resolving complex nesting. By facilitating rapid in-engine iteration, SPRITE effectively blurs the boundaries betweenartistic designandtechnical implementationin game development. Project page: https://baiyunshu.github.io/sprite.github.io/
View arXiv pageView PDFProject pageAdd to collection
Get this paper in your agent:
hf papers read 2604\.18591
Don’t have the latest CLI?curl \-LsSf https://hf\.co/cli/install\.sh \| bash
Models citing this paper0
No model linking this paper
Cite arxiv.org/abs/2604.18591 in a model README.md to link it from this page.
Datasets citing this paper0
No dataset linking this paper
Cite arxiv.org/abs/2604.18591 in a dataset README.md to link it from this page.
Spaces citing this paper0
No Space linking this paper
Cite arxiv.org/abs/2604.18591 in a Space README.md to link it from this page.
Collections including this paper0
No Collection including this paper
Add this paper to acollectionto link it from this page.
Similar Articles
@VraserX: This looks like a pretty big step for AI design. What stands out to me is that this is not just about generating pretty…
Genspark launches Genspark Design, an AI design tool powered by Claude Opus 4.7 that can create UI prototypes, posters, videos, HTML animations, and convert designs into code, aiming to be a full creative production tool.
Making Graphics Like it's 1993
A developer details building Catlantean 3D, a first-person shooter using 1993-era graphics techniques (256 colors, 320x240 resolution, hand-crafted assets, no AI) with a planned Steam release, focusing on palette rendering and asset creation.
@Designarena: Announcing Agentic Game Development on Design Arena - our newest multi-file, multi-turn evaluation. A sneak peek of wha…
Design Arena announces Agentic Game Development, a new multi-file, multi-turn evaluation with access to asset catalogs, built-in libraries like Howler and Tween.js, and expanded tool calls for sprite generation and asset discovery.
GUICrafter: Weakly-Supervised GUI Agent Leveraging Massive Unannotated Screenshots
GUICrafter introduces a weakly-supervised GUI agent that leverages massive unannotated screenshots and a two-stage curriculum learning framework to reduce reliance on expensive human annotations, achieving competitive performance with advanced systems like UI-TARS using only 0.1% of its data.
Genspark Design
Genspark Design is an AI tool that generates UI prototypes, videos, and posters.