ZipSplat: Fewer Gaussians, Better Splats
Summary
ZipSplat is a token-based feed-forward 3D Gaussian Splatting model that uses k-means clustering to decouple Gaussian placement from the pixel grid, achieving ~6x fewer Gaussians while setting new state-of-the-art results on DL3DV and RealEstate10K without requiring ground-truth poses or intrinsics.
Similar Articles
GlobalSplat: Efficient Feed-Forward 3D Gaussian Splatting via Global Scene Tokens
GlobalSplat introduces an efficient feed-forward framework for 3D Gaussian splatting that achieves compact and consistent scene reconstruction using global scene tokens, reducing computational overhead and inference time to under 78ms. The method uses a coarse-to-fine training approach to prevent representation bloat while maintaining competitive novel-view synthesis performance with significantly fewer Gaussians (16K) compared to dense baselines.
Gaussian Point Splatting
Researchers propose Gaussian Point Splatting, a stochastic rendering method using pixel-sized opaque points and 64-bit GPU atomics that scales to hundreds of millions of Gaussians in real time. The method, accepted at SIGGRAPH 2026, employs hierarchical culling and parallel programming primitives to achieve even workload distribution with only minor noise differences compared to original Gaussian splatting.
Gaussian Splat of a Strawberry
A Gaussian splat of a strawberry, created from 90 perspectives with focus stacking, using slang-splat for training and SuperSplat for viewing.
VidSplat: Gaussian Splatting Reconstruction with Geometry-Guided Video Diffusion Priors
VidSplat is a training-free generative reconstruction framework that uses video diffusion priors to recover complete 3D scenes from sparse inputs by synthesizing novel views.
SplatWeaver: Learning to Allocate Gaussian Primitives for Generalizable Novel View Synthesis
SplatWeaver is a feed-forward novel view synthesis framework that dynamically allocates 3D Gaussian primitives based on spatial complexity, improving rendering quality and efficiency over fixed-allocation methods. It leverages cardinality Gaussian experts and a pixel-level routing scheme guided by high-frequency priors to adaptively distribute primitives across complex and smooth scene regions.