Tag
PermaVid introduces a multi-modal context memory that disentangles appearance and geometric structure to maintain long-term video consistency after editing operations, outperforming prior methods.