Tag
JamSet and JamBench are introduced as a dataset and benchmark for project-level game code generation on the Godot engine, derived from Game Jam projects, with evaluation showing a capability cliff for AI models as project scale increases.