Tag
This paper introduces SEIG, a framework that uses pretrained vision-language models to reconstruct 3D scenes from single images as editable Blender programs through progressive refinement of geometry, materials, composition, and lighting.