BRDFusion: Physics Meets Generation for Urban Scene Inverse Rendering

Hugging Face Daily Papers Papers

Summary

BRDFusion combines physical modeling with generative priors to achieve high-quality inverse and forward rendering of urban scenes, enabling applications like novel-view relighting and dynamic object insertion.

Inverse rendering of urban scenes from captured videos enables numerous applications, including content creation and autonomous driving simulation. Physically-based rendering methods follow and control lighting physics, but suffer from reconstruction and rendering artifacts. While generative models produce realistic videos, they offer limited consistency and controllability. We present BRDFusion, a unified framework that combines two complementary models for inverse and forward rendering. Specifically, BRDFusion recovers explicit, consistent scene properties with physical modeling and alleviates optimization ambiguity with generative priors. During forward rendering, the physical model provides controllable rendering from the scene configuration, and the generative model denoises and fixes artifacts. Therefore, our method produces high-quality videos while allowing precise control, outperforming baselines in real and synthetic scenes. Moreover, BRDFusion supports novel-view relighting, night simulation, and dynamic object insertion/editing. Project page: https://shigon255.github.io/brdfusion-page/
Original Article
View Cached Full Text

Cached at: 06/16/26, 11:34 AM

Paper page - BRDFusion: Physics Meets Generation for Urban Scene Inverse Rendering

Source: https://huggingface.co/papers/2606.17049

Abstract

BRDFusion combines physical modeling and generative priors to achieve high-quality inverse and forward rendering of urban scenes with precise control and artifact reduction.

Inverse renderingof urban scenes from captured videos enables numerous applications, including content creation and autonomous driving simulation. Physically-based rendering methods follow and control lighting physics, but suffer from reconstruction and rendering artifacts. Whilegenerative modelsproduce realistic videos, they offer limited consistency and controllability. We presentBRDFusion, a unified framework that combines two complementary models for inverse andforward rendering. Specifically,BRDFusionrecovers explicit, consistentscene propertieswithphysical modelingand alleviatesoptimization ambiguitywith generative priors. Duringforward rendering, the physical model providescontrollable renderingfrom the scene configuration, and the generative model denoises and fixes artifacts. Therefore, our method produces high-quality videos while allowing precise control, outperforming baselines in real and synthetic scenes. Moreover,BRDFusionsupportsnovel-view relighting,night simulation, anddynamic object insertion/editing. Project page: https://shigon255.github.io/brdfusion-page/

View arXiv pageView PDFProject pageGitHub10Add to collection

Models citing this paper1

#### Shigon/BRDFusion_checkpoints Updatedabout 8 hours ago

Datasets citing this paper2

#### Shigon/BRDFusion_data Viewer• Updatedabout 8 hours ago • 112k • 83 #### Shigon/BRDFusion_videos Viewer• Updatedabout 8 hours ago • 495 • 24

Spaces citing this paper0

No Space linking this paper

Cite arxiv.org/abs/2606.17049 in a Space README.md to link it from this page.

Collections including this paper0

No Collection including this paper

Add this paper to acollectionto link it from this page.

Similar Articles

PhysiFormer: Learning to Simulate Mechanics in World Space

Hugging Face Daily Papers

PhysiFormer uses coordinate-space diffusion to generate physically-plausible 3D object motions without explicit inductive biases, enabling efficient multi-object reasoning and generalization to complex materials and geometries.

ABot-Earth 0.5: Generative 3D Earth Model

Hugging Face Daily Papers

ABot-Earth 0.5 is a generative 3D framework that synthesizes realistic 3D urban environments from satellite imagery using 3D Gaussian Splatting, enabling real-time visualization and closed-loop UAV navigation at low cost.