Triangle Splats from Video Diffusion Latents (5 minute read)
Summary
FLAT is a method that directly decodes explicit triangle splats from compressed video diffusion latents in a single forward pass, improving geometric accuracy while enabling fast rasterization and physics-based interaction.
View Cached Full Text
Cached at: 06/25/26, 05:08 PM
Similar Articles
FLAT: Feedforward Latent Triangle Splatting for Geometrically Accurate Scene Generation
FLAT proposes a method to decode explicit triangle splats directly from video diffusion latents for geometrically accurate 3D scene generation. It introduces a ray-centered rotation parameterization and a product window function to improve gradient flow, achieving better geometric accuracy than prior feedforward methods while supporting real-time rendering.
reconstructing different angles from live footage
4D Gaussian Splatting is a technique that converts flat 2D images into three-dimensional spatial data, enabling reconstruction of different angles from live footage.
TriSplat: Simulation-Ready Feed-Forward 3D Scene Reconstruction
TriSplat is a feed-forward 3D reconstruction network that uses oriented triangle primitives to directly generate simulation-ready meshes from single images, bypassing expensive post-processing steps. It achieves geometry-faithful reconstructions while maintaining competitive novel-view rendering quality.
VidSplat: Gaussian Splatting Reconstruction with Geometry-Guided Video Diffusion Priors
VidSplat is a training-free generative reconstruction framework that uses video diffusion priors to recover complete 3D scenes from sparse inputs by synthesizing novel views.
GlobalSplat: Efficient Feed-Forward 3D Gaussian Splatting via Global Scene Tokens
GlobalSplat introduces an efficient feed-forward framework for 3D Gaussian splatting that achieves compact and consistent scene reconstruction using global scene tokens, reducing computational overhead and inference time to under 78ms. The method uses a coarse-to-fine training approach to prevent representation bloat while maintaining competitive novel-view synthesis performance with significantly fewer Gaussians (16K) compared to dense baselines.