Triangle Splats from Video Diffusion Latents (5 minute read)

TLDR AI Papers

Summary

FLAT is a method that directly decodes explicit triangle splats from compressed video diffusion latents in a single forward pass, improving geometric accuracy while enabling fast rasterization and physics-based interaction.

Google's FLAT introduces a feedforward method that decodes triangle splats directly from video diffusion latents, improving geometric accuracy over 3D Gaussian-based approaches.
Original Article
View Cached Full Text

Cached at: 06/25/26, 05:08 PM

# FLAT | Feedforward Latent Triangle Splatting Source: [https://flat-splat.github.io/](https://flat-splat.github.io/) Feedforward Latent Triangle Splatting for geometrically accurate scene generation\. Decode explicit surface\-aligned triangle splats from video diffusion latents in a single forward pass\. Orest Kupyn1,2, Goutam Bhat1, Philipp Henzler1, Fabian Manhardt1, Christian Rupprecht1,2, Federico Tombari1,3 1Google Research 2University of Oxford, Visual Geometry Group 3Technical University of Munich FLAT shows that compressed video diffusion latents can be mapped directly to explicit non\-volumetric scene parameters\. Instead of decoding 3D Gaussians, it predicts triangle splats in one pass, improving geometric accuracy while preserving competitive visual quality and enabling rasterization with simple triangle renderers and physics\-based interaction after lightweight refinement\. **Direct Triangle Decoding**FLAT turns compressed video diffusion latents into explicit triangle splats directly, avoiding the usual generate\-then\-optimize path used by many feedforward scene pipelines\. **Geometry\-Specific Training**Ray\-centered triangle parameterization and a product window rendering function stabilize triangle regression, where small orientation errors would otherwise break gradient flow\. **Refinement to Opaque Assets**A lightweight test\-time refinement step converts the predicted triangle soup into a fully opaque representation that fits standard rendering and game\-engine\-style interaction\. ## Inspect generated scenes as explicit triangle geometry\. FLAT outputs scenes that can be explored immediately with a simple triangle renderer\. This makes the viewer fast and portable across devices, without depending on a heavy rendering engine\. On touch devices, drag inside the scene to look around and use the on\-screen movement buttons to navigate\. **Navigation**WASDmove, drag to look,Rto reset\. **Tip**Double\-click anywhere in the viewport to snap back to the default view\. ## Appearance and surface structure stay aligned\. We target geometric accuracy, not only image realism\. These paired renders show that FLAT's novel views and surface normals stay consistent across viewpoints, making the geometry signal legible instead of hiding it behind appearance alone\. Novel View Surface Normals 01 / 07 **BibTeX** ``` @misc{kupyn2026flat, title = {FLAT: Feedforward Latent Triangle Splatting for Geometrically Accurate Scene Generation}, author = {Orest Kupyn and Goutam Bhat and Philipp Henzler and Fabian Manhardt and Christian Rupprecht and Federico Tombari}, year = {2026}, note = {Preprint} } ```

Similar Articles

FLAT: Feedforward Latent Triangle Splatting for Geometrically Accurate Scene Generation

Hugging Face Daily Papers

FLAT proposes a method to decode explicit triangle splats directly from video diffusion latents for geometrically accurate 3D scene generation. It introduces a ray-centered rotation parameterization and a product window function to improve gradient flow, achieving better geometric accuracy than prior feedforward methods while supporting real-time rendering.

TriSplat: Simulation-Ready Feed-Forward 3D Scene Reconstruction

Hugging Face Daily Papers

TriSplat is a feed-forward 3D reconstruction network that uses oriented triangle primitives to directly generate simulation-ready meshes from single images, bypassing expensive post-processing steps. It achieves geometry-faithful reconstructions while maintaining competitive novel-view rendering quality.

GlobalSplat: Efficient Feed-Forward 3D Gaussian Splatting via Global Scene Tokens

Hugging Face Daily Papers

GlobalSplat introduces an efficient feed-forward framework for 3D Gaussian splatting that achieves compact and consistent scene reconstruction using global scene tokens, reducing computational overhead and inference time to under 78ms. The method uses a coarse-to-fine training approach to prevent representation bloat while maintaining competitive novel-view synthesis performance with significantly fewer Gaussians (16K) compared to dense baselines.