BRDFusion: Physics Meets Generation for Urban Scene Inverse Rendering
Summary
BRDFusion combines physical modeling with generative priors to achieve high-quality inverse and forward rendering of urban scenes, enabling applications like novel-view relighting and dynamic object insertion.
View Cached Full Text
Cached at: 06/16/26, 11:34 AM
Paper page - BRDFusion: Physics Meets Generation for Urban Scene Inverse Rendering
Source: https://huggingface.co/papers/2606.17049
Abstract
BRDFusion combines physical modeling and generative priors to achieve high-quality inverse and forward rendering of urban scenes with precise control and artifact reduction.
Inverse renderingof urban scenes from captured videos enables numerous applications, including content creation and autonomous driving simulation. Physically-based rendering methods follow and control lighting physics, but suffer from reconstruction and rendering artifacts. Whilegenerative modelsproduce realistic videos, they offer limited consistency and controllability. We presentBRDFusion, a unified framework that combines two complementary models for inverse andforward rendering. Specifically,BRDFusionrecovers explicit, consistentscene propertieswithphysical modelingand alleviatesoptimization ambiguitywith generative priors. Duringforward rendering, the physical model providescontrollable renderingfrom the scene configuration, and the generative model denoises and fixes artifacts. Therefore, our method produces high-quality videos while allowing precise control, outperforming baselines in real and synthetic scenes. Moreover,BRDFusionsupportsnovel-view relighting,night simulation, anddynamic object insertion/editing. Project page: https://shigon255.github.io/brdfusion-page/
View arXiv pageView PDFProject pageGitHub10Add to collection
Models citing this paper1
#### Shigon/BRDFusion_checkpoints Updatedabout 8 hours ago
Datasets citing this paper2
#### Shigon/BRDFusion_data Viewer• Updatedabout 8 hours ago • 112k • 83 #### Shigon/BRDFusion_videos Viewer• Updatedabout 8 hours ago • 495 • 24
Spaces citing this paper0
No Space linking this paper
Cite arxiv.org/abs/2606.17049 in a Space README.md to link it from this page.
Collections including this paper0
No Collection including this paper
Add this paper to acollectionto link it from this page.
Similar Articles
GenRecon: Bridging Generative Priors for Multi-View 3D Scene Reconstruction
GenRecon introduces a method for 3D scene reconstruction that integrates generative 3D priors with multi-view image conditioning, achieving high-fidelity, editable mesh reconstructions of indoor environments and outperforming existing methods by 16%.
PhysiFormer: Learning to Simulate Mechanics in World Space
PhysiFormer uses coordinate-space diffusion to generate physically-plausible 3D object motions without explicit inductive biases, enabling efficient multi-object reasoning and generalization to complex materials and geometries.
CityRAG: Stepping Into a City via Spatially-Grounded Video Generation
CityRAG introduces a video generative model that produces long, physically grounded, 3D-consistent videos of real-world cities using geo-registered data, enabling realistic navigation and simulation for robotics and autonomous driving.
WildRelight: A Real-World Benchmark and Physics-Guided Adaptation for Single-Image Relighting
This paper introduces WildRelight, a new real-world benchmark dataset for single-image relighting that addresses the gap between synthetic and natural scenes. It proposes a physics-guided adaptation framework using diffusion posterior sampling and test-time adaptation to improve model performance on real-world data.
ABot-Earth 0.5: Generative 3D Earth Model
ABot-Earth 0.5 is a generative 3D framework that synthesizes realistic 3D urban environments from satellite imagery using 3D Gaussian Splatting, enabling real-time visualization and closed-loop UAV navigation at low cost.