Tag
VGGT-Edit proposes a feed-forward framework for text-conditioned native 3D scene editing using depth-synchronized text injection and residual field prediction, achieving superior quality and efficiency over 2D-lifting approaches.