Decoupled Residual Denoising Diffusion Models for Unified and Data Efficient Image-to-Image Translation

Hugging Face Daily Papers 05/31/26, 06:38 AM Papers

Summary

This paper proposes Decoupled Residual Denoising Diffusion Models (DRDD) for unified and data-efficient image-to-image translation, decoupling noise diffusion for domain harmonization from residual diffusion for semantic mapping.

We propose Decoupled Residual Denoising Diffusion models (DRDD) for unified and data-efficient image-to-image (I2I) translation. While diffusion models have advanced I2I translation in terms of quality and diversity, we uncover a previously under-explored property in diffusion models. Crucially, beyond its conventional role of manifold lifting (i.e., moving data off low-dimensional manifolds), injecting Gaussian noise facilitates domain harmonization by implicitly aligning feature distributions across domains, a property particularly advantageous for unified I2I translation. However, existing diffusion models prematurely erode this harmonization effect, as noise and residuals are simultaneously removed in a single coupled diffusion process. To address this, DRDD decouples the diffusion process into two sequential and independent diffusion stages: (1) a stochastic noise diffusion for domain harmonization and manifold lifting, and (2) a deterministic residual diffusion that learns the core semantic mapping entirely within the fixed-noise domain. This decoupling preserves harmonization and manifold lifting effects throughout the transformation, substantially simplifying the learning of unified mappings across diverse tasks and domains. Notably, the noise diffusion stage is trained exclusively on abundant, unpaired target-domain images, greatly improving data efficiency. Comprehensive theoretical and empirical analysis demonstrates that DRDD is broadly compatible with mainstream diffusion models and consistently delivers robust, unified I2I translation, even under limited paired data. Our code is available at https://github.com/HKU-HealthAI/DRDD.

Original Article

View Cached Full Text

Cached at: 06/03/26, 07:36 AM

Paper page - Decoupled Residual Denoising Diffusion Models for Unified and Data Efficient Image-to-Image Translation

Source: https://huggingface.co/papers/2606.01048

Abstract

Decoupled Residual Denoising Diffusion models (DRDD) improve unified image-to-image translation by separating noise diffusion for domain harmonization from residual diffusion for semantic mapping, enhancing data efficiency and performance.

We propose Decoupled Residual DenoisingDiffusion models(DRDD) for unified and data-efficient image-to-image (I2I) translation. Whilediffusion modelshave advanced I2I translation in terms of quality and diversity, we uncover a previously under-explored property indiffusion models. Crucially, beyond its conventional role ofmanifold lifting(i.e., moving data off low-dimensional manifolds), injecting Gaussian noise facilitatesdomain harmonizationby implicitly aligning feature distributions across domains, a property particularly advantageous forunified I2I translation. However, existingdiffusion modelsprematurely erode this harmonization effect, as noise and residuals are simultaneously removed in a single coupled diffusion process. To address this, DRDD decouples the diffusion process into two sequential and independent diffusion stages: (1) a stochasticnoise diffusionfordomain harmonizationandmanifold lifting, and (2) a deterministicresidual diffusionthat learns the core semantic mapping entirely within the fixed-noise domain. This decoupling preserves harmonization andmanifold liftingeffects throughout the transformation, substantially simplifying the learning of unified mappings across diverse tasks and domains. Notably, thenoise diffusionstage is trained exclusively on abundant, unpaired target-domain images, greatly improvingdata efficiency. Comprehensive theoretical and empirical analysis demonstrates that DRDD is broadly compatible with mainstreamdiffusion modelsand consistently delivers robust,unified I2I translation, even under limited paired data. Our code is available at https://github.com/HKU-HealthAI/DRDD.

View arXiv page View PDF GitHub6 Add to collection

Get this paper in your agent:

hf papers read 2606\.01048

Don’t have the latest CLI?curl \-LsSf https://hf\.co/cli/install\.sh \| bash

Models citing this paper0

No model linking this paper

Cite arxiv.org/abs/2606.01048 in a model README.md to link it from this page.

Datasets citing this paper0

No dataset linking this paper

Cite arxiv.org/abs/2606.01048 in a dataset README.md to link it from this page.

Spaces citing this paper0

No Space linking this paper

Cite arxiv.org/abs/2606.01048 in a Space README.md to link it from this page.

Collections including this paper0

No Collection including this paper

Add this paper to acollectionto link it from this page.

Decoupled Residual Denoising Diffusion Models for Unified and Data Efficient Image-to-Image Translation

Paper page - Decoupled Residual Denoising Diffusion Models for Unified and Data Efficient Image-to-Image Translation

Abstract

Models citing this paper0

Datasets citing this paper0

Spaces citing this paper0

Collections including this paper0

Similar Articles

UniDDT: Unifying Multimodal Understanding and Generation with Decoupled Diffusion Transformer

RepFusion: Leveraging Multimodal Priors for Denoising in Representation Space

Uniform Diffusion Models Revisited: Leave-One-Out Denoiser and Absorbing State Reformulation

Drifting Objectives for Refining Discrete Diffusion Language Models

MMDiff: Extending Diffusion Transformers for Multi-Modal Generation

Submit Feedback

Similar Articles

UniDDT: Unifying Multimodal Understanding and Generation with Decoupled Diffusion Transformer

RepFusion: Leveraging Multimodal Priors for Denoising in Representation Space

Uniform Diffusion Models Revisited: Leave-One-Out Denoiser and Absorbing State Reformulation

Drifting Objectives for Refining Discrete Diffusion Language Models

MMDiff: Extending Diffusion Transformers for Multi-Modal Generation