Orthogonal Concept Erasure for Diffusion Models

arXiv cs.AI Papers

Summary

This paper introduces Orthogonal Concept Erasure (OCE), a method for precisely removing target concepts from diffusion models using multiplicative orthogonal parameter updates, enabling efficient single- and multi-concept erasure up to 100 concepts in seconds.

arXiv:2605.28902v1 Announce Type: new Abstract: Concept erasure has emerged as a promising approach to mitigate undesired or unsafe content in diffusion models, yet existing methods still face significant limitations. While training-based methods are effective, their high computational cost limits scalability. Editing-based methods are more efficient and deployment-friendly, yet they struggle to simultaneously achieve precise concept erasure and preserve overall generative capacity. We identify this core limitation of the editing-based methods as reliance on additive parameter updates. Our empirical analysis reveals that concept semantics primarily depend on neuron direction rather than neuron magnitude, while overall generative capacity relies on the angular geometry of neurons. As additive updates inherently entangle direction, magnitude, and angular geometry, they inevitably introduce unintended interference between concept erasure and overall generation performance. To address this, we propose Orthogonal Concept Erasure (OCE), which reformulates editing-based erasure as multiplicative parameter updates from a geometric perspective. Specifically, OCE applies layer-wise orthogonal transformations derived from a closed-form solution to the parameters, enabling precise concept erasure while preserving the neuron magnitude and angular geometry. Furthermore, to address conflicting constraints in multi-concept erasure, OCE introduces a subspace-level objective with structured subspace manipulation, yielding a more effective and scalable erasure. Extensive experiments on single- and multi-concept erasure demonstrate that OCE outperforms existing methods in concept erasure and non-target preservation, erasing up to 100 concepts in 4.3 s. Code: https://github.com/HansSunY/OCE.
Original Article
View Cached Full Text

Cached at: 05/29/26, 09:11 AM

# Orthogonal Concept Erasure for Diffusion Models
Source: [https://arxiv.org/html/2605.28902](https://arxiv.org/html/2605.28902)
###### Abstract

Concept erasure has emerged as a promising approach to mitigate undesired or unsafe content in diffusion models, yet existing methods still face significant limitations\. While training\-based methods are effective, their high computational cost limits scalability\. Editing\-based methods are more efficient and deployment\-friendly, yet they struggle to simultaneously achieve precise concept erasure and preserve overall generative capacity\. We identify this core limitation of the editing\-based methods as reliance on additive parameter updates\. Our empirical analysis reveals that concept semantics primarily depend on*neuron direction*rather than*neuron magnitude*, while overall generative capacity relies on the*angular geometry*of neurons\. As additive updates inherently entangle direction, magnitude, and angular geometry, they inevitably introduce unintended interference between concept erasure and overall generation performance\. To address this, we proposeOrthogonal Concept Erasure \(OCE\), which reformulates editing\-based erasure as multiplicative parameter updates from a geometric perspective\. Specifically, OCE applies layer\-wise orthogonal transformations derived from a closed\-form solution to the parameters, enabling precise concept erasure while preserving the neuron magnitude and angular geometry\. Furthermore, to address conflicting constraints in multi\-concept erasure, OCE introduces a subspace\-level objective with structured subspace manipulation, yielding a more effective and scalable erasure\. Extensive experiments on single\- and multi\-concept erasure demonstrate that OCE outperforms existing methods in concept erasure and non\-target preservation, erasing up to 100 concepts in 4\.3 s\. Code:[https://github\.com/HansSunY/OCE](https://github.com/HansSunY/OCE)\.

Machine Learning, ICML

![[Uncaptioned image]](https://arxiv.org/html/2605.28902v1/x1.png)

Figure 1:Our proposed method, OCE, achieves strong performance in both single\-concept and multi\-concept erasure\. \(a\) OCE effectively removestarget conceptsin object and artistic style erasure while preservingnon\-target concepts\. \(b\) OCE supports efficient large\-scale multi\-concept erasure of up to100 conceptsat once, requiring only4\.3 s\. \(c\) OCE reformulates previous additive editing as multiplicative orthogonal editing, introducing a new editing\-based concept erasure paradigm\.
## 1Introduction

Text\-to\-image \(T2I\) diffusion models\(Hoet al\.,[2020](https://arxiv.org/html/2605.28902#bib.bib1); Songet al\.,[2020a](https://arxiv.org/html/2605.28902#bib.bib3),[b](https://arxiv.org/html/2605.28902#bib.bib2); Rombachet al\.,[2022](https://arxiv.org/html/2605.28902#bib.bib4); Ho and Salimans,[2022](https://arxiv.org/html/2605.28902#bib.bib5); Nicholet al\.,[2021](https://arxiv.org/html/2605.28902#bib.bib6); Rameshet al\.,[2022](https://arxiv.org/html/2605.28902#bib.bib7)\)have achieved remarkable performance in synthesizing high\-fidelity and diverse images from text prompts\. However, their impressive generative capacity can lead to the generation of undesirable concepts, including copyrighted content\(Jianget al\.,[2023](https://arxiv.org/html/2605.28902#bib.bib8)\), offensive or sensitive visual attributes\(Schramowskiet al\.,[2023](https://arxiv.org/html/2605.28902#bib.bib9); Zhanget al\.,[2024c](https://arxiv.org/html/2605.28902#bib.bib10)\), and identity\-related information\(Carliniet al\.,[2023](https://arxiv.org/html/2605.28902#bib.bib11); Mirsky and Lee,[2021](https://arxiv.org/html/2605.28902#bib.bib12)\), raising concerns about safety, ethics, and privacy\. Consequently, concept erasure has emerged as a critical research direction\.

The task of concept erasure involves precisely removing the specific concepts from the pretrained model while preserving the prior concepts\. Existing concept erasure techniques can be broadly categorized into inference\-time intervention, training\-based and editing\-based methods\. Inference\-time intervention methods\(Schramowskiet al\.,[2023](https://arxiv.org/html/2605.28902#bib.bib9); Yoonet al\.,[2024](https://arxiv.org/html/2605.28902#bib.bib17); Wanget al\.,[2025](https://arxiv.org/html/2605.28902#bib.bib18)\)adjust sampling trajectories without modifying model parameters, but they are vulnerable to bypassing\. Training\-based methods\(Kumariet al\.,[2023](https://arxiv.org/html/2605.28902#bib.bib13); Gandikotaet al\.,[2023](https://arxiv.org/html/2605.28902#bib.bib14); Lyuet al\.,[2024](https://arxiv.org/html/2605.28902#bib.bib15); Luet al\.,[2024](https://arxiv.org/html/2605.28902#bib.bib16)\)achieve concept erasure by fine\-tuning a subset of model parameters with carefully designed objectives to remove target concepts\. While effective, these approaches typically require multiple rounds of optimization, leading to substantial computational time and overhead, which significantly limits their practicality in real\-world scenarios\. By contrast, editing\-based methods\(Gandikotaet al\.,[2024](https://arxiv.org/html/2605.28902#bib.bib19); Gonget al\.,[2024](https://arxiv.org/html/2605.28902#bib.bib20); Liet al\.,[2025c](https://arxiv.org/html/2605.28902#bib.bib21)\)operate directly on model parameters, such as the projection weights in cross\-attention layers, using closed\-form solutions\. This makes editing\-based methods more efficient and scalable to real\-world scenarios like multi\-concept erasure\. However, despite their efficiency, existing editing\-based methods still suffer from several limitations\. In particular, they often exhibit insufficient erasure precision, struggle to reliably preserve prior concepts, and often rely on relatively complex erasure pipelines, which makes them less concise and principled than desired\.

In this work, we argue that the core limitation of existing editing\-based methods\(Gandikotaet al\.,[2024](https://arxiv.org/html/2605.28902#bib.bib19); Gonget al\.,[2024](https://arxiv.org/html/2605.28902#bib.bib20); Liet al\.,[2025c](https://arxiv.org/html/2605.28902#bib.bib21)\)lies in how concept erasure is formulated\. Most existing methods define concept erasure as additive updates of model parametersW\+ΔW\+\\Delta, as shown in Fig\.[1](https://arxiv.org/html/2605.28902#S0.F1)\(c\)\. While simple and flexible, such additive updates simultaneously perturb neurons in both magnitude and direction\. More critically, even small additive changes can arbitrarily alter the angular geometry of neurons, including inter\-neuron angles and correlations\. Inspired by previous studies on hyperspherical energy\(Liuet al\.,[2017](https://arxiv.org/html/2605.28902#bib.bib22),[2018](https://arxiv.org/html/2605.28902#bib.bib23); Chenet al\.,[2020](https://arxiv.org/html/2605.28902#bib.bib24); Qiuet al\.,[2023](https://arxiv.org/html/2605.28902#bib.bib25)\), we design a toy experiment, which reveals that \(1\)*neuron direction*is crucial for encoding concept semantics, whereas magnitude has little effect, and \(2\) preserving the*angular geometry*is essential for maintaining the model’s overall generative capabilities\. Consequently, additive updates inevitably entangle direction, magnitude, and angular geometry, resulting in unstable erasure and poor preservation of prior concepts\.This motivates a geometric perspective on concept erasure: rather than performing unconstrained additive corrections, effective erasure should directly manipulate neuron directions, which encode concept semantics, while preserving their magnitudes and intrinsic angular geometry\.

From this geometric perspective, we proposeOrthogonalConceptErasure \(OCE\) for diffusion model\. Instead of additive updates, OCE applies a layer\-wise orthogonal transformation derived from a closed\-form solution to model parameters, precisely rotating neuron directions for concept erasure while preserving the neuron magnitude and angular geometry\. Furthermore, to address conflicting constraints in multi\-concept erasure, we extend the vector\-wise erasure objective to a subspace\-level projection objective\. Specifically, we represent the target and anchor concepts as their respective subspaces and then minimize the components of the target subspace that lie outside the orthogonal complement of the anchor subspace, which provides a more structured and gentle form of concept erasure\. By explicitly controlling neuron directions through a multiplicative and structured update, OCE achieves precise erasure of target concepts while preserving the generation ability of non\-target concepts\. It provides a principled, and efficient closed\-form solution for both single\- and multi\-concept erasure scenarios\. The main contributions of this work are summarized as follows:

- •We propose OCE, a geometry\-driven, editing\-based method that performs concept erasure via layer\-wise orthogonal transformations, enabling precise removal of target concepts while preserving the intrinsic angular geometry of pretrained models\.
- •We introduce a structured subspace\-level projection objective with closed\-form solutions\. By extending vector\-wise erasure to a subspace projection formulation, OCE provides a principled, scalable, and efficient solution for both single\- and multi\-concept erasure, achieving improved erasure precision and prior concept preservation\.
- •Extensive experiments across diverse erasure tasks, demonstrating that OCE consistently outperforms existing training\-based and editing\-based methods in terms of erasure effectiveness and overall generation quality\.

## 2Related Work

### 2\.1Concept Erasure in Diffusion Models

Text\-to\-image diffusion models can generate copyrighted, offensive and privacy content, raising serious concerns for real\-world deployment\. To better regulate the content generated by these models, early approaches focused on retraining with curated datasets\(Rombach,[2022](https://arxiv.org/html/2605.28902#bib.bib26)\)or introducing output filters\(Randoet al\.,[2022](https://arxiv.org/html/2605.28902#bib.bib27)\)to block undesirable generations\. However, retraining is computationally expensive, while output filters can be easily bypassed\. This has led to increasing interest in concept erasure, which selectively removes target concepts from models, categorized into three paradigms\. Inference\-time intervention methods\(Schramowskiet al\.,[2023](https://arxiv.org/html/2605.28902#bib.bib9); Yoonet al\.,[2024](https://arxiv.org/html/2605.28902#bib.bib17); Jainet al\.,[2024](https://arxiv.org/html/2605.28902#bib.bib36); Wanget al\.,[2025](https://arxiv.org/html/2605.28902#bib.bib18)\)modify sampling trajectories or guidance signals during generation to suppress undesired concepts\. Training\-based methods\(Kumariet al\.,[2023](https://arxiv.org/html/2605.28902#bib.bib13); Gandikotaet al\.,[2023](https://arxiv.org/html/2605.28902#bib.bib14); Zhanget al\.,[2024a](https://arxiv.org/html/2605.28902#bib.bib28); Lyuet al\.,[2024](https://arxiv.org/html/2605.28902#bib.bib15); Luet al\.,[2024](https://arxiv.org/html/2605.28902#bib.bib16); Zhanget al\.,[2024b](https://arxiv.org/html/2605.28902#bib.bib30); Srivatsanet al\.,[2025](https://arxiv.org/html/2605.28902#bib.bib31); Nguyenet al\.,[2025](https://arxiv.org/html/2605.28902#bib.bib32); Sunet al\.,[2025](https://arxiv.org/html/2605.28902#bib.bib58)\)fine\-tune model parameters with specific erasure objectives and regularization terms to preserve general generation quality\. Editing\-based method\(Orgadet al\.,[2023](https://arxiv.org/html/2605.28902#bib.bib34); Gandikotaet al\.,[2024](https://arxiv.org/html/2605.28902#bib.bib19); Gonget al\.,[2024](https://arxiv.org/html/2605.28902#bib.bib20); Liet al\.,[2025c](https://arxiv.org/html/2605.28902#bib.bib21); Linet al\.,[2025](https://arxiv.org/html/2605.28902#bib.bib35)\)directly modify model parameters with closed\-form updates for more efficient concept removal\. In contrast to existing additive update paradigms in editing\-based approaches, our method reformulates the concept erasure paradigm as an orthogonal transformation from a geometric perspective, enabling more structured and effective concept erasure\.

### 2\.2Orthogonalization in Parameter Space

Orthogonalization has been widely explored in parameter space as a principled way to fine\-tuning models and reducing interference across objectives\. In continual and multi\-task learning, orthogonal gradient methods\(Bennaniet al\.,[2020](https://arxiv.org/html/2605.28902#bib.bib37); Farajtabaret al\.,[2020](https://arxiv.org/html/2605.28902#bib.bib38)\)project gradients onto the orthogonal complement of past task subspaces or decompose updates to mitigate negative transfer between tasks\. More recently, orthogonal transformations have been introduced for parameter\-efficient fine\-tuning\. Orthogonal Fine\-Tuning \(OFT\)\(Qiuet al\.,[2023](https://arxiv.org/html/2605.28902#bib.bib25)\)and its variants\(Liuet al\.,[2023](https://arxiv.org/html/2605.28902#bib.bib39); Maet al\.,[2024](https://arxiv.org/html/2605.28902#bib.bib40); Qiuet al\.,[2025](https://arxiv.org/html/2605.28902#bib.bib41)\)apply structured orthogonal transformations to weight matrices, preserving the intrinsic pretrained structure\. Despite their success, existing parameter\-space orthogonalization methods primarily focus on stabilizing training dynamics or text\-to\-image customization\. In contrast, our work leverages orthogonal transform as a geometric tool for closed\-form concept erasure, enabling targeted concept suppression through a structured transformation in parameter space, while maintaining overall generation capacity of the model\.

## 3Geometric Analysis of Concept Erasure

In this section, we analyze concept erasure from a geometric perspective and argue that its effectiveness and stability are fundamentally governed by the angular structure of the neuron\. Our analysis is centered around two key claims:\(C1\)Concept semantics in diffusion models are more strongly associated with neuron directions than with magnitudes\.\(C2\)Preserving inter\-neuron angular geometry is critical for maintaining the overall generation quality\.

### 3\.1Concept Expression in Neuron Angular Geometry

We analyze controlled geometric transformations of the cross\-attention projection matrices in diffusion models to disentangle the roles of magnitude, direction, and inter\-neuron geometry in concept expression\. Specifically, we focus on the key and value matrices, denoted asWW, which play a central role in mapping text embeddings to visual semantics\. We design three controlled modification ofWW:

Case A: Magnitude\-only scaling\.We scale the weights by a scalar factorα\\alphaas

W~=α​W,α∈\(0,1\),\\tilde\{W\}=\\alpha W,\\quad\\alpha\\in\(0,1\),\(1\)which changes neuron magnitudes only\.

![Refer to caption](https://arxiv.org/html/2605.28902v1/x2.png)Figure 2:A toy experiment to demonstrate the importance of the angular information for concept expression in diffusion models\.Case B: Neuron\-wise orthogonal rotation\.We apply independent orthogonal transformations to individual neurons:

w~i=Qi​wi,Qi⊤​Qi=I,\\tilde\{w\}\_\{i\}=Q\_\{i\}w\_\{i\},\\quad Q\_\{i\}^\{\\top\}Q\_\{i\}=I,\(2\)wherewiw\_\{i\}denotes theii\-th column ofWWand eachQiQ\_\{i\}is a neuron\-wise orthogonal matrix\. This preserves neuron magnitudes but breaks the directions and inter\-neuron angular geometry\.

Case C: Layer\-wise orthogonal rotation\.We apply a shared orthogonal transformationQQto the entire layer:

W~=Q​W,Q⊤​Q=I,\\tilde\{W\}=QW,\\quad Q^\{\\top\}Q=I,\(3\)which changes neuron directions only\.

We evaluate the effects of these three modifications on the generation of the concept “cat” as shown in Fig\.[2](https://arxiv.org/html/2605.28902#S3.F2)\. For Case A, scaling the weights byα=0\.5\\alpha=0\.5has negligible effect on the generation of concept “cat”\. By contrast, applying a small layer\-wise orthogonal rotation \(Case C\) results in a clear semantic shift of the “cat” concept in the generated images, confirming that neuron directions are the primary carriers of concept semantics \(C1\)\. In addition, applying independent neuron\-wise rotations \(Case B\) substantially degrades overall image quality, highlighting that maintaining the relative angular geometry among neurons is crucial for generation capability \(C2\)\. These observations are also consistent with prior studies on hyperspherical energy\(Liuet al\.,[2017](https://arxiv.org/html/2605.28902#bib.bib22),[2018](https://arxiv.org/html/2605.28902#bib.bib23); Chenet al\.,[2020](https://arxiv.org/html/2605.28902#bib.bib24); Qiuet al\.,[2023](https://arxiv.org/html/2605.28902#bib.bib25)\)\.

### 3\.2Additive vs\. Orthogonal Updates

Based on the above discussion, we analyze why layer\-wise orthogonal update is a principled choice for concept erasure\.

Additive updates\.An additive update takes the form

W∗=W\+Δ,wi∗=wi\+δi,W^\{\*\}=W\+\\Delta,\\quad w\_\{i\}^\{\*\}=w\_\{i\}\+\\delta\_\{i\},\(4\)for each neuronwiw\_\{i\}\. These updates primarily rotate neuron directionsc​o​s​θi∗≠cos⁡θicos\\theta\_\{i\}^\{\*\}\\neq\\cos\\theta\_\{i\}, but also alter neuron magnitudes‖wi∗‖≠‖wi‖\\\|w\_\{i\}^\{\*\}\\\|\\neq\\\|w\_\{i\}\\\|and inter\-neuron anglescos⁡ϕi​j∗≠cos⁡ϕi​j\\cos\\phi\_\{ij\}^\{\*\}\\neq\\cos\\phi\_\{ij\}\. These combined effects explain why additive updates often fail to precisely erase target concepts and why they can disrupt non\-target concepts by disturbing the angular geometry that encodes semantic structure\. As a result, additive updates often fail to precisely remove target concepts, since magnitude changes contribute little to semantic erasure, and they also degrade the preservation of non\-target concepts due to changes in angular geometry\.

Layer\-wise orthogonal updates\.A layer\-wise orthogonal update takes the form

W∗=Q​W,Q⊤​Q=I,wi∗=Q​wi\.W^\{\*\}=QW,\\quad Q^\{\\top\}Q=I,\\quad w\_\{i\}^\{\*\}=Qw\_\{i\}\.Such a multiplicative update rotates all neurons in the layer by a shared orthogonal transformation\. As a result, neuron magnitudes are preserved,‖wi∗‖=‖wi‖\\\|w\_\{i\}^\{\*\}\\\|=\\\|w\_\{i\}\\\|, and pairwise inter\-neuron angles remain unchanged,cos⁡ϕi​j∗=cos⁡ϕi​j\\cos\\phi\_\{ij\}^\{\*\}=\\cos\\phi\_\{ij\}\. The update modifies directions in a structured manner, without introducing changes to magnitude or angular geometry\.

In summary, additive updates entangle changes in direction, magnitude, and angular geometry, making precise erasure and preservation of non\-target concepts difficult\. In contrast, layer\-wise orthogonal updates modify neuron directions while strictly preserving magnitudes and angular geometry, which better matches the geometric structure underlying semantic representations\. This makes orthogonal transformations a more principled choice for concept erasure\.

## 4Problem Formulation

In text\-to\-image diffusion models, semantic concepts are encoded through text embeddingsccprocessed by a pretrained CLIP text encoder\. During image generation, these embeddings are injected into the denoising U\-Net with cross\-attention layers, where the key and value projection matrices, denoted asWkW\_\{k\}andWvW\_\{v\}, play a central role in shaping semantic alignment\. For simplicity, we denote a generic projection matrix asWW\.

### 4\.1Additive Closed\-Form Concept Erasure

Editing\-based concept erasure aims to modify the projection matrixWWsuch that target concepts are suppressed while non\-target concepts remain unaffected\. Following prior approaches, we consider three sets of concept embeddings: an erasure setℰ=\{c1\(i\)\}i=1NE\\mathcal\{E\}=\\\{c^\{\(i\)\}\_\{1\}\\\}\_\{i=1\}^\{N\_\{E\}\}containing target concepts to be removed, a retain setℛ=\{c0\(j\)\}j=1NR\\mathcal\{R\}=\\\{c^\{\(j\)\}\_\{0\}\\\}\_\{j=1\}^\{N\_\{R\}\}containing non\-target concepts to be preserved, and an anchor set𝒜=\{c∗\(i\)\}i=1NE\\mathcal\{A\}=\\\{c^\{\(i\)\}\_\{\\ast\}\\\}\_\{i=1\}^\{N\_\{E\}\}, where each target conceptc1\(i\)c^\{\(i\)\}\_\{1\}is mapped onto an anchor conceptc∗\(i\)c^\{\(i\)\}\_\{\\ast\}that serves as a semantically similar surrogate\. We stack the embeddings of the erasure, anchor and retain sets into matricesC1C\_\{1\},C∗C\_\{\\ast\}andC0C\_\{0\}respectively\. Additive erasure methods, with UCE as a representative example, optimize a parameter perturbationΔ\\DeltaonWWthrough the following least\-squares formulation:

minΔ⁡‖\(W\+Δ\)​C1−W​C∗‖F2\+‖\(W\+Δ\)​C0−W​C0‖F2\.\\min\_\{\\Delta\}\\,\\\|\(W\+\\Delta\)C\_\{1\}\-WC\_\{\\ast\}\\\|\_\{F\}^\{2\}\+\\\|\(W\+\\Delta\)C\_\{0\}\-WC\_\{0\}\\\|\_\{F\}^\{2\}\.\(5\)The first term enforces effective erasure by mapping each target concept to its anchor, while the second term is designed to preserve non\-target concept\. This objective provides a closed\-form solution, enabling efficient parameter updates without iterative optimization\. Despite its simplicity and efficiency, as shown in Sec\. 3, additive erasure suffers from intrinsic geometric limitations, which constrain both erasure precision and the preservation of non\-target concepts\.

### 4\.2From Additive to Orthogonal Erasure

Motivated by the geometric analysis in Sec\. 3, we reformulate concept erasure from additive parameter perturbations to multiplicative orthogonal transformations\. We seek an orthogonal matrixPPthat modifies the projection space in a structured manner\. This leads to the following simple orthogonal erasure objective:

minP⊤​P=I⁡‖P​W​C1−W​C∗‖F2\+‖P​W​C0−W​C0‖F2\.\\min\_\{P^\{\\top\}P=I\}\\,\\\|PWC\_\{1\}\-WC\_\{\\ast\}\\\|\_\{F\}^\{2\}\+\\\|PWC\_\{0\}\-WC\_\{0\}\\\|\_\{F\}^\{2\}\.\(6\)
Compared to additive formulations, this orthogonal perspective provides a geometrically principled foundation for concept erasure, aligning with the observations in Sec\. 3 that concept semantics are direction\-sensitive and that preserving angular geometry is critical for maintaining generation quality\. The solution and further refinement of this objective are presented in the method section\.

## 5Orthogonal Concept Erasure \(OCE\)

### 5\.1Closed\-Form Solution for Orthogonal Update

##### Solving the Vector\-wise Objective\.

After defining the initial vector\-wise objective for orthogonal erasure in Eq\.[6](https://arxiv.org/html/2605.28902#S4.E6), we first discuss how to solve this problem in closed form\.

First, we stack the two terms in Eq\.[6](https://arxiv.org/html/2605.28902#S4.E6)into block matrices:

A=\[W​C1,W​C0\],B=\[W​C∗,W​C0\]\.A=\[WC\_\{1\},\\;WC\_\{0\}\],\\quad B=\[WC\_\{\\ast\},\\;WC\_\{0\}\]\.\(7\)The objective is equivalently written as

minP⊤​P=I⁡‖P​A−B‖F2\.\\min\_\{P^\{\\top\}P=I\}\\\|PA\-B\\\|\_\{F\}^\{2\}\.\(8\)This is equivalent to the following trace maximization:

maxP⊤​P=I⁡tr​\(P​M\),M=B​A⊤\.\\max\_\{P^\{\\top\}P=I\}\\;\\mathrm\{tr\}\(PM\),\\qquad M=BA^\{\\top\}\.\(9\)Detailed derivation is shown in Appendix\.[A\.1](https://arxiv.org/html/2605.28902#A1.SS1)\. The closed\-form solution follows from the classical Procrustes theorem\(Schönemann,[1966](https://arxiv.org/html/2605.28902#bib.bib56)\)\. Specifically, we compute the cross\-covariance matrixMMand perform its Singular Value Decomposition \(SVD\):

M=U​Σ​V⊤,M=U\\Sigma V^\{\\top\},\(10\)from which the optimal orthogonal transformation is

##### Construction ofMM\.

When calculatingMM, we expand it in terms of the original blocks gives

M\\displaystyle M=\(W​C∗\)​\(W​C1\)⊤\+\(W​C0\)​\(W​C0\)⊤\\displaystyle=\(WC\_\{\\ast\}\)\(WC\_\{1\}\)^\{\\top\}\+\(WC\_\{0\}\)\(WC\_\{0\}\)^\{\\top\}\(12\)=W​\(C∗​C1⊤\+C0​C0⊤\)​W⊤\.\\displaystyle=W\(C\_\{\\ast\}C\_\{1\}^\{\\top\}\+C\_\{0\}C\_\{0\}^\{\\top\}\)W^\{\\top\}\.\(13\)For the term involvingC0​C0⊤C\_\{0\}C\_\{0\}^\{\\top\}, we further decompose the retain set into two subsets: a set of generic conceptsℛg\\mathcal\{R\}\_\{g\}that should be universally preserved, and a set of local neighboring conceptsℛn\\mathcal\{R\}\_\{n\}that are specific to the current erasure task\. Correspondingly, we writeC0=\[Cg,Cn\]C\_\{0\}=\[\\,C\_\{g\},\\;C\_\{n\}\\,\], whereCgC\_\{g\}andCnC\_\{n\}stack the embeddings fromℛg\\mathcal\{R\}\_\{g\}andℛn\\mathcal\{R\}\_\{n\}, respectively\. The cross\-covariance term then becomes

W​\(C0​C0⊤\)​W⊤=W​\(Cg​Cg⊤\+Cn​Cn⊤\)​W⊤\.W\(C\_\{0\}C\_\{0\}^\{\\top\}\)W^\{\\top\}=W\(C\_\{g\}C\_\{g\}^\{\\top\}\+C\_\{n\}C\_\{n\}^\{\\top\}\)W^\{\\top\}\.\(14\)The generic preservation termCg​Cg⊤C\_\{g\}C\_\{g\}^\{\\top\}is independent of the target concept and can therefore be precomputed once and reused across different erasure tasks\. Concretely, we compute all the token embeddingsccover the COCO\-30k dataset and precompute

K0=Cg​Cg⊤=𝔼c​\[c​c⊤\],K\_\{0\}=C\_\{g\}C\_\{g\}^\{\\top\}=\\mathbb\{E\}\_\{c\}\[cc^\{\\top\}\],\(15\)which serves as a global preservation prior shared by all erasure instances\. Finally,MMcan be written as

M=W​\(C∗​C1⊤\+K0\+Cn​Cn⊤\)​W⊤\.M=W\(C\_\{\\ast\}C\_\{1\}^\{\\top\}\+K\_\{0\}\+C\_\{n\}C\_\{n\}^\{\\top\}\)W^\{\\top\}\.\(16\)This objective achieves strict vector\-wise alignment between target embeddings and anchors for erasure and preservation of the generation capacity\. It is effective for single\-concept erasure\. However, in multi\-concept erasure scenario, simultaneously enforcing exact alignment for multiple targets introduces conflicting constraints, which can interfere with each other and degrade erasure performance\. This motivates a more global, subspace\-level erasure objective\.

### 5\.2Subspace\-level Erasure Objective

Instead of enforcing vector\-wise alignment between target and anchor embeddings, we further reformulate concept erasure as a geometric subspace suppression problem, where erase directions are discouraged from lying in the orthogonal complement of the anchor subspace\. This formulation better respects the intrinsic geometry of the latent space and enables a more structured erasure\.

##### Target and Anchor Subspaces\.

We first define the target and anchor subspaces in the transformed feature space induced byWW\. Given sets of concept embeddingsC1C\_\{1\}andC∗C\_\{\*\}, we first map each embedding throughWWand normalize the resulting vectors\. Then, an orthonormal basis for each subspace is obtained via Gram–Schmidt orthogonalization\. Formally, we denoteG=orth​\(W​C1\)G=\\mathrm\{orth\}\(WC\_\{1\}\)andG∗=orth​\(W​C∗\)G\_\{\\ast\}=\\mathrm\{orth\}\(WC\_\{\*\}\)

whereorth​\(⋅\)\\mathrm\{orth\}\(\\cdot\)returns an orthonormal basis spanning the column space of its argument\. The corresponding orthogonal projection matrices are given by

R=G​G⊤,R∗=G∗​G∗⊤\.R=GG^\{\\top\},\\qquad R\_\{\\ast\}=G\_\{\\ast\}G\_\{\\ast\}^\{\\top\}\.\(17\)
Table 1:Evaluation of Object Erasure on CIFAR\-10\. We reportA​c​ceAcc\_\{e\}\(lower indicates more successful erasure\),A​c​csAcc\_\{s\}\(higher indicates better preservation of non\-target concepts\) andHoH\_\{o\}\(higher indicates better overall performance\) across the first five categories\. All values are expressed as percentages \(%\)\. The original model’s performance is included for reference\.ConceptAirplaneAutomobileBirdCatDeerAverageAcce↓\\text\{Acc\}\_\{e\}\\downarrowAccs↑\\text\{Acc\}\_\{s\}\\uparrow\\cellcolororange\!40Ho↑H\_\{o\}\\uparrowAcce↓\\text\{Acc\}\_\{e\}\\downarrowAccs↑\\text\{Acc\}\_\{s\}\\uparrow\\cellcolororange\!40Ho↑H\_\{o\}\\uparrowAcce↓\\text\{Acc\}\_\{e\}\\downarrowAccs↑\\text\{Acc\}\_\{s\}\\uparrow\\cellcolororange\!40Ho↑H\_\{o\}\\uparrowAcce↓\\text\{Acc\}\_\{e\}\\downarrowAccs↑\\text\{Acc\}\_\{s\}\\uparrow\\cellcolororange\!40Ho↑H\_\{o\}\\uparrowAcce↓\\text\{Acc\}\_\{e\}\\downarrowAccs↑\\text\{Acc\}\_\{s\}\\uparrow\\cellcolororange\!40Ho↑H\_\{o\}\\uparrowAcce↓\\text\{Acc\}\_\{e\}\\downarrowAccs↑\\text\{Acc\}\_\{s\}\\uparrow\\cellcolororange\!40Ho↑H\_\{o\}\\uparrowSD v1\.496\.0698\.92\\cellcolororange\!407\.5895\.7598\.95\\cellcolororange\!408\.1599\.7298\.51\\cellcolororange\!400\.5698\.9398\.60\\cellcolororange\!402\.1299\.8798\.49\\cellcolororange\!400\.2698\.0798\.69\\cellcolororange\!403\.79CA96\.2498\.55\\cellcolororange\!407\.2494\.4198\.47\\cellcolororange\!4010\.5899\.5598\.53\\cellcolororange\!400\.9098\.9498\.63\\cellcolororange\!402\.1099\.4598\.47\\cellcolororange\!401\.0997\.7298\.53\\cellcolororange\!404\.46ESD7\.3885\.48\\cellcolororange\!4088\.9130\.2991\.02\\cellcolororange\!4078\.9513\.1786\.17\\cellcolororange\!4086\.5011\.7791\.45\\cellcolororange\!4089\.8118\.1473\.81\\cellcolororange\!4077\.6316\.1585\.59\\cellcolororange\!4084\.71FMN96\.7698\.32\\cellcolororange\!406\.2795\.0896\.86\\cellcolororange\!409\.3699\.4698\.13\\cellcolororange\!401\.0794\.8997\.97\\cellcolororange\!409\.7198\.9594\.13\\cellcolororange\!402\.0897\.0397\.08\\cellcolororange\!405\.77UCE40\.3298\.79\\cellcolororange\!4074\.414\.7399\.02\\cellcolororange\!4097\.1110\.7198\.35\\cellcolororange\!4093\.602\.3598\.02\\cellcolororange\!4097\.8311\.8898\.39\\cellcolororange\!4092\.9714\.0098\.51\\cellcolororange\!4091\.83MACE9\.0695\.39\\cellcolororange\!4093\.116\.9795\.18\\cellcolororange\!4094\.099\.8897\.45\\cellcolororange\!4093\.642\.2298\.85\\cellcolororange\!4098\.3113\.4797\.71\\cellcolororange\!4091\.788\.3296\.92\\cellcolororange\!4094\.23RECE15\.0198\.76\\cellcolororange\!4091\.3636\.1399\.04\\cellcolororange\!4077\.667\.9598\.01\\cellcolororange\!4094\.9415\.7698\.51\\cellcolororange\!4090\.829\.6298\.27\\cellcolororange\!4094\.1616\.8998\.52\\cellcolororange\!4090\.16SPEED57\.7898\.74\\cellcolororange\!4059\.1510\.3398\.52\\cellcolororange\!4093\.8938\.4898\.23\\cellcolororange\!4075\.660\.3698\.57\\cellcolororange\!4099\.102\.1798\.31\\cellcolororange\!4098\.0721\.8298\.47\\cellcolororange\!4087\.16Ours6\.8998\.85\\cellcolororange\!4095\.898\.7999\.07\\cellcolororange\!4094\.983\.4098\.30\\cellcolororange\!4097\.440\.1598\.71\\cellcolororange\!4099\.283\.8198\.45\\cellcolororange\!4097\.314\.6198\.68\\cellcolororange\!4097\.01

Table 2:Evaluation of Artistic Style Erasure\. We report CLIP Score \(CS\) for each style generation, where lower CS indicates better erasure\. We also report FID and CS on MSCOCO\-30k, where lower FID and higher CS indicate better preservation quality\.MethodVan GoghPicassoMonetCS↓\\downarrowMSCOCOCS↓\\downarrowMSCOCOCS↓\\downarrowMSCOCOFID↓\\downarrowCS↑\\uparrowFID↓\\downarrowCS↑\\uparrowFID↓\\downarrowCS↑\\uparrowCA25\.018\.1126\.2626\.337\.4026\.2824\.207\.0226\.25ESD20\.1013\.6026\.2921\.1012\.2025\.8219\.7313\.5125\.87FMN22\.3613\.8526\.0621\.2414\.6326\.0025\.2314\.2526\.09UCE21\.2214\.5326\.4522\.847\.6526\.2423\.1714\.4425\.77MACE23\.4912\.2826\.2123\.7712\.0826\.2421\.7012\.2826\.25RECE22\.788\.0526\.4924\.017\.6726\.2222\.308\.6525\.79SPEED21\.687\.4926\.2023\.247\.5226\.3121\.917\.5126\.27Ours21\.087\.1526\.5220\.928\.5726\.4621\.357\.3026\.32

##### Subspace Suppression\.

Instead of enforcing point\-wise alignment betweenW​C1WC\_\{1\}andW​C∗WC\_\{\\ast\}, we consider a subspace\-level suppression objective that discourages the transformed target subspace from collapsing into the orthogonal complement of the anchor subspace\. Formally, we seek an orthogonal transformationPPthat pushes the target subspace away from directions orthogonal to the anchor space, while preserving the retain setC0C\_\{0\}through vector\-wise constraints:

minP⊤​P=I−‖P​R−R∗,⟂‖F2\+‖P​W​C0−W​C0‖F2,\\min\_\{P^\{\\top\}P=I\}\-\\\|PR\-R\_\{\\ast,\\perp\}\\\|\_\{F\}^\{2\}\+\\\|PWC\_\{0\}\-WC\_\{0\}\\\|\_\{F\}^\{2\},\(18\)whereR∗,⟂=I−R∗R\_\{\\ast,\\perp\}=I\-R\_\{\\ast\}is the projector onto the orthogonal complement of the anchor subspace\.

This objective can then be equivalently rewritten as a trace maximization:

maxP⊤​P=I⁡tr​\(P​Mtotal\),\\max\_\{P^\{\\top\}P=I\}\\mathrm\{tr\}\(PM\_\{\\mathrm\{total\}\}\),\(19\)whereMtotalM\_\{\\mathrm\{total\}\}combines contributions from target suppression and local retain:

Mtotal=−R​\(I−R∗\)\+W​\(K0\+Cn​Cn⊤\)​W⊤\.M\_\{\\mathrm\{total\}\}=\-R\(I\-R\_\{\\ast\}\)\+W\(K\_\{0\}\+C\_\{n\}C\_\{n\}^\{\\top\}\)W^\{\\top\}\.\(20\)Detailed derivation is shown in Appendix\.[A\.2](https://arxiv.org/html/2605.28902#A1.SS2)\. Here, we still writeC0=\[Cg,Cn\]C\_\{0\}=\[C\_\{g\},C\_\{n\}\]as Eq\.[14](https://arxiv.org/html/2605.28902#S5.E14)and incorporate the global preservation priorK0K\_\{0\}to enforce global retention\.

Eq\.[18](https://arxiv.org/html/2605.28902#S5.E18)also corresponds to a classical orthogonal Procrustes problem\. Following the same procedure as Eq\.[10](https://arxiv.org/html/2605.28902#S5.E10)and[11](https://arxiv.org/html/2605.28902#S5.E11), we perform SVD onMtotalM\_\{\\mathrm\{total\}\}and compute the closed\-form solution asP=U​V⊤P=UV^\{\\top\}\.

The proposed subspace\-level formulation generalizes vector\-wise erasure by lifting the objective from vector\-wise alignment to subspace energy suppression\. This design mitigates interference among multiple erased concepts and enables a unified closed\-form solution, balancing targeted erasure with global representation stability\.

Notably, we impose erasure at the subspace level while enforcing preservation in a vector\-wise manner\. This asymmetric design is intentional and principled\. Compared with erasure, preservation requires finer\-grained control\. Vector\-wise preservation ensures high\-fidelity retention of individual non\-target concepts and does not suffer from cross\-concept interference\. Using subspace\-level constraints for preservation would be unnecessarily restrictive\. Our ablation studies in Sec\.[6\.4](https://arxiv.org/html/2605.28902#S6.SS4)further confirm that this asymmetric formulation achieves a better balance between effective erasure and faithful preservation\.

## 6Experiments

In this section, we evaluate OCE on three representative concept erasure settings: single concept erasure, multi\-concept erasure and implicit concept erasure \(Appendix\.[6\.3](https://arxiv.org/html/2605.28902#S6.SS3)\)\. We compare OCE with several commonly adopted baselines, including CA\(Kumariet al\.,[2023](https://arxiv.org/html/2605.28902#bib.bib13)\), ESD\(Gandikotaet al\.,[2023](https://arxiv.org/html/2605.28902#bib.bib14)\), FMN\(Zhanget al\.,[2024a](https://arxiv.org/html/2605.28902#bib.bib28)\), UCE\(Gandikotaet al\.,[2024](https://arxiv.org/html/2605.28902#bib.bib19)\), MACE\(Luet al\.,[2024](https://arxiv.org/html/2605.28902#bib.bib16)\), RECE\(Gonget al\.,[2024](https://arxiv.org/html/2605.28902#bib.bib20)\), SPEED\(Liet al\.,[2025c](https://arxiv.org/html/2605.28902#bib.bib21)\)\. Our evaluation measures bothefficacyandspecificity:efficacymeasures the target concept erasure, whilespecificitymeasures the preservation of unrelated concepts\. All experiments are conducted on111[https://github\.com/CompVis/stable\-diffusion](https://github.com/CompVis/stable-diffusion)SD v1\.4\. More implementation details can be found in Appendix\.[C](https://arxiv.org/html/2605.28902#A3)\.

### 6\.1Single Concept Erasure

![Refer to caption](https://arxiv.org/html/2605.28902v1/x3.png)Figure 3:Quantitative comparison of the Multi\-Concept Erasure in erasing celebrities\. Red boxes indicate the erased target celebrity, while green boxes denote the preserved non\-target celebrity\. Our method achieves precise erasure of up to 100 concepts while effectively preserving non\-target concepts\. Notably, our method exhibits stable performance across varying erasure scales\.Table 3:Evaluation of Multi\-Concept Erasure for celebrity with 10, 50, 100 target concepts\. We reportA​c​ceAcc\_\{e\},A​c​crAcc\_\{r\},HoH\_\{o\}and Time \(s\)\. Our method consistently achieves strong erasure performance across different scales, effectively erasing up to 100 concepts while preserving non\-target concepts with minimal degradation\. It also demonstrates competitive efficiency, erasing 100 concepts in only 4\.3 s on a single A100 GPU\.Erase 10 CelebritiesMSCOCOErase 50 CelebritiesMSCOCOErase 100 CelebritiesMSCOCOMethodAcce\\text\{Acc\}\_\{e\}↓\\downarrowAccs\\text\{Acc\}\_\{s\}↑\\uparrow\\cellcolororange\!40HoH\_\{o\}↑\\uparrow\\cellcolorblue\!20Time↓\\downarrowCS↑\\uparrowFID↓\\downarrowAcce\\text\{Acc\}\_\{e\}↓\\downarrowAccs\\text\{Acc\}\_\{s\}↑\\uparrow\\cellcolororange\!40HoH\_\{o\}↑\\uparrow\\cellcolorblue\!20Time↓\\downarrowCS↑\\uparrowFID↓\\downarrowAcce\\text\{Acc\}\_\{e\}↓\\downarrowAccs\\text\{Acc\}\_\{s\}↑\\uparrow\\cellcolororange\!40HoH\_\{o\}↑\\uparrow\\cellcolorblue\!20Time↓\\downarrowCS↑\\uparrowFID↓\\downarrowSD v1\.494\.4397\.51\\cellcolororange\!4010\.54\\cellcolorblue\!20–26\.56–97\.1197\.51\\cellcolororange\!405\.61\\cellcolorblue\!20–26\.56–99\.1797\.51\\cellcolororange\!401\.65\\cellcolorblue\!20–26\.56–CA25\.5287\.29\\cellcolororange\!4080\.38\\cellcolorblue\!20120026\.3011\.281\.8732\.25\\cellcolororange\!4048\.55\\cellcolorblue\!20264025\.6616\.970\.8011\.13\\cellcolororange\!4020\.01\\cellcolorblue\!20540025\.5918\.29ESD9\.8334\.86\\cellcolororange\!4050\.28\\cellcolorblue\!2030026\.0216\.2626\.2134\.46\\cellcolororange\!4046\.98\\cellcolorblue\!2096025\.9016\.5418\.8424\.16\\cellcolororange\!4037\.24\\cellcolorblue\!20180025\.9717\.24FMN44\.9066\.43\\cellcolororange\!4060\.24\\cellcolorblue\!2012\.026\.3611\.8435\.8256\.10\\cellcolororange\!4059\.87\\cellcolorblue\!2028\.026\.3711\.6651\.4849\.20\\cellcolororange\!4048\.86\\cellcolorblue\!2053\.026\.4011\.43UCE2\.4590\.89\\cellcolororange\!4094\.10\\cellcolorblue\!201\.525\.8317\.3738\.9088\.31\\cellcolororange\!4072\.23\\cellcolorblue\!201\.826\.1919\.8658\.6776\.79\\cellcolororange\!4053\.74\\cellcolorblue\!202\.125\.2328\.07MACE1\.8696\.72\\cellcolororange\!4097\.42\\cellcolorblue\!2020026\.3113\.873\.8293\.23\\cellcolororange\!4094\.68\\cellcolorblue\!2084025\.4417\.688\.0291\.60\\cellcolororange\!4091\.79\\cellcolorblue\!20180024\.8120\.92RECE0\.3080\.04\\cellcolororange\!4088\.91\\cellcolorblue\!206\.019\.7090\.380\.0036\.19\\cellcolororange\!4053\.15\\cellcolorblue\!2012\.013\.80218\.960\.0021\.78\\cellcolororange\!4035\.77\\cellcolorblue\!2025\.012\.93140\.08SPEED3\.2296\.52\\cellcolororange\!4096\.65\\cellcolorblue\!203\.826\.5510\.964\.0095\.07\\cellcolororange\!4095\.53\\cellcolorblue\!204\.226\.3016\.215\.2192\.67\\cellcolororange\!4093\.72\\cellcolorblue\!205\.026\.2918\.40Ours0\.6196\.92\\cellcolororange\!4098\.14\\cellcolorblue\!203\.226\.5311\.202\.6695\.91\\cellcolororange\!4096\.62\\cellcolorblue\!203\.626\.4316\.033\.4494\.42\\cellcolororange\!4095\.48\\cellcolorblue\!204\.326\.3318\.33

##### Evaluation Setup\.

We evaluate single concept erasure on object erasure and artistic style erasure\.

Forobject erasure, we edit ten models with each model targeted to erase one object class of the CIFAR\-10 dataset\. To evaluateefficiency, we generate 200 images using prompt “a photo of the\{erased class\}” with each model\. All the images are classified using CLIP, where lower classification accuracyAcce\\text\{Acc\}\_\{e\}indicates more effective erasure\. To assessspecificity, we generate 200 images for each of the nine remaining classes using the prompt “a photo of the \{unrelated class\}” with each model, where higher classification accuracyAccs\\text\{Acc\}\_\{s\}indicates better preservation of unrelated concepts\. The harmonic meanHo=2\(1−Acce\)−1\+\(Accs\)−1H\_\{o\}=\\frac\{2\}\{\(1\-\\mathrm\{Acc\}\_\{e\}\)^\{\-1\}\+\(\\mathrm\{Acc\}\_\{s\}\)^\{\-1\}\}is adopted to assess the overall performance of erasure\.

Forartistic style erasure, we use CLIP Score \(CS\) and Fréchet Inception Distance \(FID\) as metrics\. Forefficiency, we generate 100 images using prompt “a painting in the style of \{style\}” for each style\. We compute the average CS between the prompts and generated images, where lower CS indicates more effective erasure\. Forspecificity, we generate images with the first 10K prompts from the MSCOCO\-30k dataset, then compute CLIP scores and Fr’echet Inception Distance \(FID\) against the original images\. Higher CLIP scores and lower FID indicate better preservation\.

##### Analysis and discussion\.

Tab\.[1](https://arxiv.org/html/2605.28902#S5.T1)compares different methods on CIFAR\-10\. Our method achieves the best average performance across the five concept erasure tasks\. Compared to the state\-of\-the\-art method, our method reduces the averageA​c​ceAcc\_\{e\}from 8\.32 to 4\.61, indicating substantially stronger erasure effectiveness\. Meanwhile, it preservesA​c​csAcc\_\{s\}almost perfectly, with only a 0\.01% drop compared to the original model\. As a result, our method achieves the highest average harmonic scoreHoH\_\{o\}\. The results for the remaining five classes are provided in Appendix\.[D\.1](https://arxiv.org/html/2605.28902#A4.SS1)\.

Tab\.[2](https://arxiv.org/html/2605.28902#S5.T2)reports the results on artistic style erasure\. Although ESD exhibits relatively strong erasure capability \(lower CS\), it significantly degrades the model’s generation quality\. RECE and SPEED preserve generation ability well; however, their erasure performance becomes unreliable in some cases\. In contrast, our method consistently achieves effective erasure and preservation of overall generation quality\.

### 6\.2Multi\-Concept Erasure

##### Evaluation Setup

Multi\-concept erasure poses a more challenging setting, as mass concepts are erased simultaneously\. We follow the experiment setup in SPEED\(Liet al\.,[2025c](https://arxiv.org/html/2605.28902#bib.bib21)\)for large\-scale celebrity erasure, conducting experiments that erase 10, 50 and 100 celebrities, respectively\. In addtion, we collect a separate set of 100 celebrities as non\-target concepts to evaluate preservation behavior\. During evaluation, we employ five prompt templates for each celebrity\. For both target and non\-target sets, we generate 500 images in total by adjusting the number of images per celebrity accordingly\. All generated images are then evaluated using the GIPHY Celebrity Detector \(GCD\)\(Hastyet al\.,[2019](https://arxiv.org/html/2605.28902#bib.bib42)\), where we report the top\-1 detection accuracy\.Acce\\mathrm\{Acc\}\_\{e\}measures the detection accuracy on erased target concepts andAccs\\mathrm\{Acc\}\_\{s\}on non\-target concepts\. Meanwhile, to assess the overall performance of erasure, we compute theHoH\_\{o\}and report the CS and FID results on MSCOCO\.

![Refer to caption](https://arxiv.org/html/2605.28902v1/x4.png)Figure 4:Extension to DiT\-based models\. All experiments are conducted on FLUX\.1 dev\. \(a\) Object erasure results for the target concept “Mickey”, together with generations of non\-target concepts \(“Pikachu” and “Snoopy”\)\. \(b\) Artistic style erasure results for “Van Gogh”, together with non\-target styles “Monet” and “Picasso”\. \(c\) Celebrity erasure results for “Elon Musk”\. \(d\) Implicit concept erasure results for NSFW content\. Across all settings, our method achieves effective erasure while preserving non\-target concepts, demonstrating strong transferability to DiT\-based diffusion models\.
##### Analysis and discussion\.

Tab\.[3](https://arxiv.org/html/2605.28902#S6.T3)further demonstrates the superiority of our method in multi\-concept erasure\. Compared to the state\-of\-the\-art methods MACE and SPEED, our method achieves stronger erasure effectiveness while better preserving non\-target concepts\. In contrast, other methods tend to suffer from severe generation collapse as the number of erasure concepts increases\. Notably, our method also offers clear efficiency advantages\. While both SPEED and our approach are significantly faster than MACE, it is important to note that the reported runtime of SPEED does not include the additional three preprocessing required for its preservation sets\. By contrast, our method requires no such preprocessing and can be applied in a single\-step manner, making it substantially more practical for large\-scale multi\-concept erasure\. We also present qualitative results in Fig\.[3](https://arxiv.org/html/2605.28902#S6.F3), which show that our method maintains stable and effective erasure performance as the number of erased concepts increases, with minimal impact on non\-target concepts compared to existing methods\.

### 6\.3Implicit Concept Erasure

##### Evaluation setup\.

We evaluate our method on implicit concept erasure in scenarios where the target concept \(e\.g\., nudity\) is not explicitly appear in the prompt\. Follow the settings in SPEED\(Liet al\.,[2025c](https://arxiv.org/html/2605.28902#bib.bib21)\), we generate images with the Inappropriate Image Prompt \(I2P\) dataset\(Schramowskiet al\.,[2023](https://arxiv.org/html/2605.28902#bib.bib9)\)for evaluation, which consists of a diverse set of implicitly inappropriate prompts covering violence and sexual content\. To further examine adversarial robustness, we evaluate all methods on two popular adversarial attack benchmarks, including Ring\-A\-Bell\(Tsaiet al\.,[2023](https://arxiv.org/html/2605.28902#bib.bib55)\)and MMA\-Diffusion\(Yanget al\.,[2024](https://arxiv.org/html/2605.28902#bib.bib54)\)\. We use Nudenet with a threshold of 0\.6 to detect sexual content and report the Attack Success Rate \(ASR\)\. We further generate images with the first 1K prompts from MSCOCO\-30k, and report CS and FID to measure general preservation\.

##### Analysis and discussion\.

We introduce additional baseline methods, including CPE\(Leeet al\.,[2025](https://arxiv.org/html/2605.28902#bib.bib45)\), AdvUnlearn\(Zhanget al\.,[2024b](https://arxiv.org/html/2605.28902#bib.bib30)\), RACE\(Kimet al\.,[2024](https://arxiv.org/html/2605.28902#bib.bib46)\), Receler\(Huanget al\.,[2024](https://arxiv.org/html/2605.28902#bib.bib29)\), STEREO\(Srivatsanet al\.,[2025](https://arxiv.org/html/2605.28902#bib.bib31)\)and EraseFlow\(Kusumbaet al\.,[2026](https://arxiv.org/html/2605.28902#bib.bib57)\), which defend against adversarial attacks with adversarial training objectives\. Following SPEED\(Liet al\.,[2025c](https://arxiv.org/html/2605.28902#bib.bib21)\), we also adapt our method with adversarial editing \(denoted as Ours w/ AT\) following the setting in RECE\(Gonget al\.,[2024](https://arxiv.org/html/2605.28902#bib.bib20)\)for a fair comparison\. As shown in Tab\.[4](https://arxiv.org/html/2605.28902#S6.T4), although some training\-based methods \(e\.g\., CPE\) achieve strong robustness in concept erasure, they substantially degrade the generation quality of non\-target concepts on COCO and incur prohibitive computational costs\. In contrast, editing\-based methods are generally more efficient, but often fail to provide sufficiently reliable erasure under such adversarial settings\. Compared to these methods, our method can be seamlessly integrated with adversarial editing strategies, enabling robust concept erasure while effectively preserving the generation quality of unrelated concepts\.

Table 4:Comparison under implicit concept erasure and adversarial attacks\. Lower is better for I2P, MMA, Ring\-A\-Bell, and FID; higher is better for CS\.MethodI2P↓\\downarrowMMA↓\\downarrowRing\-A\-Bell↓\\downarrowMSCOCOCS↑\\uparrowFID↓\\downarrowSD v1\.40\.520\.630\.9826\.32\-MACE0\.210\.040\.0524\.0652\.78CPE0\.070\.010\.0026\.3248\.23AdvUnlearn0\.040\.000\.0024\.0557\.22UCE0\.240\.380\.3926\.2438\.60RECE0\.140\.200\.1825\.9840\.37RACE0\.230\.290\.2125\.5442\.73Receler0\.130\.070\.0125\.9340\.29SPEED0\.200\.240\.2026\.2937\.82SPEED w/ AT0\.100\.010\.0026\.0339\.51STEREO0\.010\.030\.1225\.3347\.10EraseFlow0\.040\.020\.1025\.1142\.33Ours w/o AT0\.110\.120\.0526\.3338\.31Ours w/ AT0\.050\.010\.0026\.1039\.73

### 6\.4Further Analysis

##### Extension to DiT\-based Models\.

To evaluate the transferability of our method, we further conduct experiments on the DiT\-based model222[https://github\.com/black\-forest\-labs/flux](https://github.com/black-forest-labs/flux)FLUX\.1 dev, covering object erasure, artistic style erasure, celebrity erasure, and implicit concept erasure\. Unlike U\-Net\-based diffusion models, DiT\-based architectures do not contain explicit cross\-attention layers\. Therefore, following333[https://github\.com/rohitgandikota/unified\-concept\-editing](https://github.com/rohitgandikota/unified-concept-editing)UCE, we choose to edit specific embedding layers in MMDiT\.

As shown in Fig\.[4](https://arxiv.org/html/2605.28902#S6.F4), our method consistently achieves strong erasure performance on FLUX while effectively preserving non\-target concepts, demonstrating its effectiveness across different diffusion architectures\. Additional quantitative results are provided in the Appendix\.[D\.2](https://arxiv.org/html/2605.28902#A4.SS2)\.

##### Ablation on Objective Designs\.

We conduct an ablation study to analyze the effect of different objective formulations under the multi\-concept erasure setting\. Specifically, we compare three variants: \(1\) vector\-wise erasure with vector\-wise preservation, \(2\) subspace\-level erasure with subspace\-level preservation, and \(3\) subspace\-level erasure with vector\-wise preservation \(OCE\)\. The results in Tab\.[5](https://arxiv.org/html/2605.28902#S6.T5)show that our asymmetric design achieves the best overall erasure performance\. Specifically, subspace\-level erasure mitigates interference among multiple erased concepts, while vector\-wise preservation provides finer\-grained control for preservation of non\-target concepts\. These results further validate the effectiveness of our asymmetric design\.

##### Ablation on global preservation priorK0K\_\{0\}\.

We study the effect of the size of the generic concept set used to constructK0K\_\{0\}\. Specifically, we evaluate reduced settings using 1/3 and 2/3 of the original COCO\-30K tokens, as well as a variant withoutK0K\_\{0\}in the multi\-concept erasure setting\. In the latter case, we replace the original preservation term‖P​W​C0−W​C0‖F2\\\|PWC\_\{0\}\-WC\_\{0\}\\\|\_\{F\}^\{2\}with‖P​W​Cn−W​Cn‖F2\+‖P​W−W‖F2\\\|PWC\_\{n\}\-WC\_\{n\}\\\|\_\{F\}^\{2\}\+\\\|PW\-W\\\|\_\{F\}^\{2\}, which changes the closed\-form solution’s preservation component toW​\(I\+Cn​Cn⊤\)​W⊤W\(I\+C\_\{n\}C\_\{n\}^\{\\top\}\)W^\{\\top\}\. As shown in Tab\.[6](https://arxiv.org/html/2605.28902#S6.T6), increasing the number of generic concepts consistently improves the trade\-off between erasure and preservation performance, demonstrating the effectiveness of the global preservation prior\. The computation ofK0K\_\{0\}is efficient \( 3s on A100 GPU\) and performed offline\.

Table 5:Ablation study on different objective formulations under the multi\-concept erasure setting\.EandPdenote the erasure and preservation objective, respectively\.AblationSubspaceErase 100 CelebritiesMSCOCOEPAcce↓\\text\{Acc\}\_\{e\}\\downarrowAccs↑\\text\{Acc\}\_\{s\}\\uparrow\\cellcolororange\!40Ho↑H\_\{o\}\\uparrowCS↑\\uparrowFID↓\\downarrow1×\\times×\\times7\.5991\.37\\cellcolororange\!4091\.7025\.8320\.792✓✓4\.5493\.01\\cellcolororange\!4094\.2226\.1718\.10Ours✓×\\times3\.4494\.42\\cellcolororange\!4095\.4826\.3318\.33SD v1\.4––99\.1797\.51\\cellcolororange\!401\.6526\.56–

Table 6:Ablation study on the effect of the global preservation prior under the multi\-concept erasure setting\.\|Cg\|\|C\_\{g\}\|ratioErase 100 CelebritiesMSCOCOAcce↓\\text\{Acc\}\_\{e\}\\downarrowAccs↑\\text\{Acc\}\_\{s\}\\uparrow\\cellcolororange\!40Ho↑H\_\{o\}\\uparrowCS↑\\uparrowFID↓\\downarrowNone6\.7294\.32\\cellcolororange\!4093\.8026\.1422\.761/34\.4793\.44\\cellcolororange\!4094\.4726\.3119\.312/33\.8593\.63\\cellcolororange\!4094\.8726\.3518\.60Full3\.4494\.42\\cellcolororange\!4095\.4826\.3318\.33SD v1\.499\.1797\.51\\cellcolororange\!401\.6526\.56–

## 7Limitations\.

Although OCE achieves effective erasure, it still has several limitations\. First, the SVD\-based subspace erasure may introduce additional computational overhead when scaling to larger models\. Second, since OCE enforces subspace\-level rather than strict vector\-wise constraints, the generated outputs may fall into intermediate semantic regions instead of precisely matching fine\-grained anchor concepts, which can limit its applicability in editing tasks\. Finally, further exploration is needed for erasure of more implicit concepts such as relational, compositional understanding, or watermarks\.

## 8Conclusion

In this work, we proposeOrthogonal Concept Erasure \(OCE\), a geometry\-driven editing\-based method for concept erasure in text\-to\-image diffusion models\. By revisiting concept erasure from a geometric perspective, OCE formulates concept erasure as a multiplicative orthogonal transformation applied to the parameters\. This design enables direct control over neuron directions while preserving their intrinsic angular geometry\. Moreover, by extending vector\-wise erasure to a structured subspace\-level projection objective with a closed\-form solution, OCE achieves precise and stable erasure of target concepts while effectively retaining non\-target concepts\. This formulation naturally scales to multi\-concept erasure and transfers to DiT\-based models\.

## Acknowledgments

This work is supported by the National Nature Science Foundation of China \(62425114, 62121002, U23B2028, 62472395\)\.

## Impact Statement

Our work proposes Orthogonal Concept Erasure \(OCE\), a method for efficiently and precisely removing sensitive or undesired concepts from text\-to\-image diffusion models while preserving the overall generative capabilities\. OCE supports both single\- and multi\-concept erasure with minimal interference on unrelated concepts, providing a controllable mechanism to improve the safety and reliability of generative AI models\. By reducing the risk of generating harmful, offensive, or privacy\-violating content, OCE promotes safer AI deployment in content creation, education, and public platforms\. This approach offers a practical and scalable solution for enhancing model controllability and aligning generative outputs with ethical and social standards\.

## References

- M\. A\. Bennani, T\. Doan, and M\. Sugiyama \(2020\)Generalisation guarantees for continual learning with orthogonal gradient descent\.arXiv preprint arXiv:2006\.11942\.Cited by:[§2\.2](https://arxiv.org/html/2605.28902#S2.SS2.p1.1)\.
- S\. D\. Biswas, A\. Roy, and K\. Roy \(2025\)CURE: concept unlearning via orthogonal representation editing in diffusion models\.InThe Thirty\-ninth Annual Conference on Neural Information Processing Systems,Cited by:[Appendix B](https://arxiv.org/html/2605.28902#A2.p1.1),[Appendix B](https://arxiv.org/html/2605.28902#A2.p2.1)\.
- A\. Bui, T\. Vu, L\. Vuong, T\. Le, P\. Montague, T\. Abraham, J\. Kim, and D\. Phung \(2025\)Fantastic targets for concept erasure in diffusion models and where to find them\.arXiv preprint arXiv:2501\.18950\.Cited by:[Appendix B](https://arxiv.org/html/2605.28902#A2.p1.1)\.
- N\. Carlini, J\. Hayes, M\. Nasr, M\. Jagielski, V\. Sehwag, F\. Tramer, B\. Balle, D\. Ippolito, and E\. Wallace \(2023\)Extracting training data from diffusion models\.In32nd USENIX security symposium \(USENIX Security 23\),pp\. 5253–5270\.Cited by:[§1](https://arxiv.org/html/2605.28902#S1.p1.1)\.
- B\. Chen, W\. Liu, Z\. Yu, J\. Kautz, A\. Shrivastava, A\. Garg, and A\. Anandkumar \(2020\)Angular visual hardness\.InInternational conference on machine learning,pp\. 1637–1648\.Cited by:[§1](https://arxiv.org/html/2605.28902#S1.p3.1),[§3\.1](https://arxiv.org/html/2605.28902#S3.SS1.p5.1)\.
- C\. Fan, J\. Liu, Y\. Zhang, E\. Wong, D\. Wei, and S\. Liu \(2023\)Salun: empowering machine unlearning via gradient\-based weight saliency in both image classification and generation\.arXiv preprint arXiv:2310\.12508\.Cited by:[Appendix B](https://arxiv.org/html/2605.28902#A2.p1.1)\.
- M\. Farajtabar, N\. Azizan, A\. Mott, and A\. Li \(2020\)Orthogonal gradient descent for continual learning\.InInternational conference on artificial intelligence and statistics,pp\. 3762–3773\.Cited by:[§2\.2](https://arxiv.org/html/2605.28902#S2.SS2.p1.1)\.
- R\. Gandikota, J\. Materzynska, J\. Fiotto\-Kaufman, and D\. Bau \(2023\)Erasing concepts from diffusion models\.InProceedings of the IEEE/CVF international conference on computer vision,pp\. 2426–2436\.Cited by:[§C\.1](https://arxiv.org/html/2605.28902#A3.SS1.SSS0.Px3.p1.1),[§1](https://arxiv.org/html/2605.28902#S1.p2.1),[§2\.1](https://arxiv.org/html/2605.28902#S2.SS1.p1.1),[§6](https://arxiv.org/html/2605.28902#S6.p1.1)\.
- R\. Gandikota, H\. Orgad, Y\. Belinkov, J\. Materzyńska, and D\. Bau \(2024\)Unified concept editing in diffusion models\.InProceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision,pp\. 5111–5120\.Cited by:[§C\.1](https://arxiv.org/html/2605.28902#A3.SS1.SSS0.Px3.p1.1),[§1](https://arxiv.org/html/2605.28902#S1.p2.1),[§1](https://arxiv.org/html/2605.28902#S1.p3.1),[§2\.1](https://arxiv.org/html/2605.28902#S2.SS1.p1.1),[§6](https://arxiv.org/html/2605.28902#S6.p1.1)\.
- D\. Gao, S\. Lu, W\. Zhou, J\. Chu, J\. Zhang, M\. Jia, B\. Zhang, Z\. Fan, and W\. Zhang \(2025\)Eraseanything: enabling concept erasure in rectified flow transformers\.InForty\-second International Conference on Machine Learning,Cited by:[Appendix B](https://arxiv.org/html/2605.28902#A2.p1.1)\.
- C\. Gong, K\. Chen, Z\. Wei, J\. Chen, and Y\. Jiang \(2024\)Reliable and efficient concept erasure of text\-to\-image diffusion models\.InEuropean Conference on Computer Vision,pp\. 73–88\.Cited by:[§C\.1](https://arxiv.org/html/2605.28902#A3.SS1.SSS0.Px3.p1.1),[§1](https://arxiv.org/html/2605.28902#S1.p2.1),[§1](https://arxiv.org/html/2605.28902#S1.p3.1),[§2\.1](https://arxiv.org/html/2605.28902#S2.SS1.p1.1),[§6\.3](https://arxiv.org/html/2605.28902#S6.SS3.SSS0.Px2.p1.1),[§6](https://arxiv.org/html/2605.28902#S6.p1.1)\.
- N\. Hasty, I\. Kroosh, D\. Voitekh, and Dmytro\. Korduban \(2019\)Giphy celebrity detector\.External Links:[Link](https://github.com/Giphy/celeb-detection-oss)Cited by:[§C\.1](https://arxiv.org/html/2605.28902#A3.SS1.SSS0.Px2.p1.9),[§6\.2](https://arxiv.org/html/2605.28902#S6.SS2.SSS0.Px1.p1.3)\.
- J\. Ho, A\. Jain, and P\. Abbeel \(2020\)Denoising diffusion probabilistic models\.Advances in neural information processing systems33,pp\. 6840–6851\.Cited by:[§1](https://arxiv.org/html/2605.28902#S1.p1.1)\.
- J\. Ho and T\. Salimans \(2022\)Classifier\-free diffusion guidance\.arXiv preprint arXiv:2207\.12598\.Cited by:[§1](https://arxiv.org/html/2605.28902#S1.p1.1)\.
- C\. Huang, K\. Chang, C\. Tsai, Y\. Lai, F\. Yang, and Y\. F\. Wang \(2024\)Receler: reliable concept erasing of text\-to\-image diffusion models via lightweight erasers\.InEuropean Conference on Computer Vision,pp\. 360–376\.Cited by:[Appendix B](https://arxiv.org/html/2605.28902#A2.p1.1),[§6\.3](https://arxiv.org/html/2605.28902#S6.SS3.SSS0.Px2.p1.1)\.
- A\. Jain, Y\. Kobayashi, T\. Shibuya, Y\. Takida, N\. Memon, J\. Togelius, and Y\. Mitsufuji \(2024\)Trasce: trajectory steering for concept erasure\.arXiv preprint arXiv:2412\.07658\.Cited by:[§2\.1](https://arxiv.org/html/2605.28902#S2.SS1.p1.1)\.
- H\. H\. Jiang, L\. Brown, J\. Cheng, M\. Khan, A\. Gupta, D\. Workman, A\. Hanna, J\. Flowers, and T\. Gebru \(2023\)AI art and its impact on artists\.InProceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society,pp\. 363–374\.Cited by:[§1](https://arxiv.org/html/2605.28902#S1.p1.1)\.
- C\. Kim, K\. Min, and Y\. Yang \(2024\)Race: robust adversarial concept erasure for secure text\-to\-image diffusion model\.InEuropean Conference on Computer Vision,pp\. 461–478\.Cited by:[Appendix B](https://arxiv.org/html/2605.28902#A2.p1.1),[§6\.3](https://arxiv.org/html/2605.28902#S6.SS3.SSS0.Px2.p1.1)\.
- N\. Kumari, B\. Zhang, S\. Wang, E\. Shechtman, R\. Zhang, and J\. Zhu \(2023\)Ablating concepts in text\-to\-image diffusion models\.InProceedings of the IEEE/CVF International Conference on Computer Vision,pp\. 22691–22702\.Cited by:[§C\.1](https://arxiv.org/html/2605.28902#A3.SS1.SSS0.Px3.p1.1),[§1](https://arxiv.org/html/2605.28902#S1.p2.1),[§2\.1](https://arxiv.org/html/2605.28902#S2.SS1.p1.1),[§6](https://arxiv.org/html/2605.28902#S6.p1.1)\.
- N\. S\. A\. Kusumba, M\. Patel, K\. Min, C\. Kim, C\. Baral, and Y\. Yang \(2026\)EraseFlow: learning concept erasure policies via gflownet\-driven alignment\.Advances in Neural Information Processing Systems38,pp\. 70876–70911\.Cited by:[§6\.3](https://arxiv.org/html/2605.28902#S6.SS3.SSS0.Px2.p1.1)\.
- B\. H\. Lee, S\. Lim, S\. Lee, D\. U\. Kang, and S\. Y\. Chun \(2025\)Concept pinpoint eraser for text\-to\-image diffusion models via residual attention gate\.arXiv preprint arXiv:2506\.22806\.Cited by:[Appendix B](https://arxiv.org/html/2605.28902#A2.p1.1),[§6\.3](https://arxiv.org/html/2605.28902#S6.SS3.SSS0.Px2.p1.1)\.
- F\. Li, Q\. Xu, S\. Bao, Z\. Yang, X\. Cao, and Q\. Huang \(2025a\)One image is worth a thousand words: a usability preservable text\-image collaborative erasing framework\.arXiv preprint arXiv:2505\.11131\.Cited by:[Appendix B](https://arxiv.org/html/2605.28902#A2.p1.1)\.
- L\. Li, S\. Lu, Y\. Ren, and A\. W\. Kong \(2025b\)Set you straight: auto\-steering denoising trajectories to sidestep unwanted concepts\.InProceedings of the 33rd ACM International Conference on Multimedia,pp\. 9257–9266\.Cited by:[Appendix B](https://arxiv.org/html/2605.28902#A2.p1.1)\.
- O\. Li, Y\. Wang, X\. Hu, H\. Jiang, T\. Liang, Y\. Hao, G\. Ma, and F\. Feng \(2025c\)Speed: scalable, precise, and efficient concept erasure for diffusion models\.arXiv preprint arXiv:2503\.07392\.Cited by:[§C\.1](https://arxiv.org/html/2605.28902#A3.SS1.SSS0.Px2.p1.9),[§C\.1](https://arxiv.org/html/2605.28902#A3.SS1.SSS0.Px3.p1.1),[§1](https://arxiv.org/html/2605.28902#S1.p2.1),[§1](https://arxiv.org/html/2605.28902#S1.p3.1),[§2\.1](https://arxiv.org/html/2605.28902#S2.SS1.p1.1),[§6\.2](https://arxiv.org/html/2605.28902#S6.SS2.SSS0.Px1.p1.3),[§6\.3](https://arxiv.org/html/2605.28902#S6.SS3.SSS0.Px1.p1.1),[§6\.3](https://arxiv.org/html/2605.28902#S6.SS3.SSS0.Px2.p1.1),[§6](https://arxiv.org/html/2605.28902#S6.p1.1)\.
- Y\. Lin, N\. Huang, K\. Huang, H\. Liu, Y\. Yan, J\. Guo, T\. Lee, and X\. Li \(2025\)ICE: intercede concept erasure in text\-to\-image diffusion models\.InProceedings of the 33rd ACM International Conference on Multimedia,pp\. 11328–11336\.Cited by:[§2\.1](https://arxiv.org/html/2605.28902#S2.SS1.p1.1)\.
- W\. Liu, Z\. Liu, Z\. Yu, B\. Dai, R\. Lin, Y\. Wang, J\. M\. Rehg, and L\. Song \(2018\)Decoupled networks\.InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition,pp\. 2771–2779\.Cited by:[§1](https://arxiv.org/html/2605.28902#S1.p3.1),[§3\.1](https://arxiv.org/html/2605.28902#S3.SS1.p5.1)\.
- W\. Liu, Z\. Qiu, Y\. Feng, Y\. Xiu, Y\. Xue, L\. Yu, H\. Feng, Z\. Liu, J\. Heo, S\. Peng,et al\.\(2023\)Parameter\-efficient orthogonal finetuning via butterfly factorization\.arXiv preprint arXiv:2311\.06243\.Cited by:[§2\.2](https://arxiv.org/html/2605.28902#S2.SS2.p1.1)\.
- W\. Liu, Y\. Zhang, X\. Li, Z\. Yu, B\. Dai, T\. Zhao, and L\. Song \(2017\)Deep hyperspherical learning\.Advances in neural information processing systems30\.Cited by:[§1](https://arxiv.org/html/2605.28902#S1.p3.1),[§3\.1](https://arxiv.org/html/2605.28902#S3.SS1.p5.1)\.
- Y\. Liu, J\. An, W\. Zhang, M\. Li, D\. Wu, J\. Gu, Z\. Lin, and W\. Wang \(2024\)Realera: semantic\-level concept erasure via neighbor\-concept mining\.arXiv preprint arXiv:2410\.09140\.Cited by:[Appendix B](https://arxiv.org/html/2605.28902#A2.p1.1)\.
- S\. Lu, Z\. Wang, L\. Li, Y\. Liu, and A\. W\. Kong \(2024\)Mace: mass concept erasure in diffusion models\.InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition,pp\. 6430–6440\.Cited by:[§C\.1](https://arxiv.org/html/2605.28902#A3.SS1.SSS0.Px3.p1.1),[§1](https://arxiv.org/html/2605.28902#S1.p2.1),[§2\.1](https://arxiv.org/html/2605.28902#S2.SS1.p1.1),[§6](https://arxiv.org/html/2605.28902#S6.p1.1)\.
- M\. Lyu, Y\. Yang, H\. Hong, H\. Chen, X\. Jin, Y\. He, H\. Xue, J\. Han, and G\. Ding \(2024\)One\-dimensional adapter to rule them all: concepts diffusion models and erasing applications\.InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition,pp\. 7559–7568\.Cited by:[§1](https://arxiv.org/html/2605.28902#S1.p2.1),[§2\.1](https://arxiv.org/html/2605.28902#S2.SS1.p1.1)\.
- X\. Ma, X\. Chu, Z\. Yang, Y\. Lin, X\. Gao, and J\. Zhao \(2024\)Parameter efficient quasi\-orthogonal fine\-tuning via givens rotation\.arXiv preprint arXiv:2404\.04316\.Cited by:[§2\.2](https://arxiv.org/html/2605.28902#S2.SS2.p1.1)\.
- Y\. Mirsky and W\. Lee \(2021\)The creation and detection of deepfakes: a survey\.ACM computing surveys \(CSUR\)54\(1\),pp\. 1–41\.Cited by:[§1](https://arxiv.org/html/2605.28902#S1.p1.1)\.
- K\. Nguyen, A\. Tran, and C\. Pham \(2025\)SuMa: a subspace mapping approach for robust and effective concept erasure in text\-to\-image diffusion models\.InProceedings of the IEEE/CVF International Conference on Computer Vision,pp\. 19587–19596\.Cited by:[§2\.1](https://arxiv.org/html/2605.28902#S2.SS1.p1.1)\.
- A\. Nichol, P\. Dhariwal, A\. Ramesh, P\. Shyam, P\. Mishkin, B\. McGrew, I\. Sutskever, and M\. Chen \(2021\)Glide: towards photorealistic image generation and editing with text\-guided diffusion models\.arXiv preprint arXiv:2112\.10741\.Cited by:[§1](https://arxiv.org/html/2605.28902#S1.p1.1)\.
- H\. Orgad, B\. Kawar, and Y\. Belinkov \(2023\)Editing implicit assumptions in text\-to\-image diffusion models\.InProceedings of the IEEE/CVF International Conference on Computer Vision,pp\. 7053–7061\.Cited by:[§2\.1](https://arxiv.org/html/2605.28902#S2.SS1.p1.1)\.
- Z\. Qiu, W\. Liu, H\. Feng, Y\. Xue, Y\. Feng, Z\. Liu, D\. Zhang, A\. Weller, and B\. Schölkopf \(2023\)Controlling text\-to\-image diffusion by orthogonal finetuning\.Advances in Neural Information Processing Systems36,pp\. 79320–79362\.Cited by:[§1](https://arxiv.org/html/2605.28902#S1.p3.1),[§2\.2](https://arxiv.org/html/2605.28902#S2.SS2.p1.1),[§3\.1](https://arxiv.org/html/2605.28902#S3.SS1.p5.1)\.
- Z\. Qiu, W\. Liu, A\. Weller, and B\. Schölkopf \(2025\)Orthogonal finetuning made scalable\.arXiv preprint arXiv:2506\.19847\.Cited by:[§2\.2](https://arxiv.org/html/2605.28902#S2.SS2.p1.1)\.
- A\. Ramesh, P\. Dhariwal, A\. Nichol, C\. Chu, and M\. Chen \(2022\)Hierarchical text\-conditional image generation with clip latents\.arXiv preprint arXiv:2204\.061251\(2\),pp\. 3\.Cited by:[§1](https://arxiv.org/html/2605.28902#S1.p1.1)\.
- J\. Rando, D\. Paleka, D\. Lindner, L\. Heim, and F\. Tramèr \(2022\)Red\-teaming the stable diffusion safety filter\.arXiv preprint arXiv:2210\.04610\.Cited by:[§2\.1](https://arxiv.org/html/2605.28902#S2.SS1.p1.1)\.
- R\. Rombach, A\. Blattmann, D\. Lorenz, P\. Esser, and B\. Ommer \(2022\)High\-resolution image synthesis with latent diffusion models\.InProceedings of the IEEE/CVF conference on computer vision and pattern recognition,pp\. 10684–10695\.Cited by:[§1](https://arxiv.org/html/2605.28902#S1.p1.1)\.
- R\. Rombach \(2022\)Stable diffusion 2\.0 release\.Cited by:[§2\.1](https://arxiv.org/html/2605.28902#S2.SS1.p1.1)\.
- P\. H\. Schönemann \(1966\)A generalized solution of the orthogonal procrustes problem\.Psychometrika31\(1\),pp\. 1–10\.Cited by:[§5\.1](https://arxiv.org/html/2605.28902#S5.SS1.SSS0.Px1.p2.1)\.
- P\. Schramowski, M\. Brack, B\. Deiseroth, and K\. Kersting \(2023\)Safe latent diffusion: mitigating inappropriate degeneration in diffusion models\.InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition,pp\. 22522–22531\.Cited by:[§1](https://arxiv.org/html/2605.28902#S1.p1.1),[§1](https://arxiv.org/html/2605.28902#S1.p2.1),[§2\.1](https://arxiv.org/html/2605.28902#S2.SS1.p1.1),[§6\.3](https://arxiv.org/html/2605.28902#S6.SS3.SSS0.Px1.p1.1)\.
- J\. Song, C\. Meng, and S\. Ermon \(2020a\)Denoising diffusion implicit models\.arXiv preprint arXiv:2010\.02502\.Cited by:[§1](https://arxiv.org/html/2605.28902#S1.p1.1)\.
- Y\. Song, J\. Sohl\-Dickstein, D\. P\. Kingma, A\. Kumar, S\. Ermon, and B\. Poole \(2020b\)Score\-based generative modeling through stochastic differential equations\.arXiv preprint arXiv:2011\.13456\.Cited by:[§1](https://arxiv.org/html/2605.28902#S1.p1.1)\.
- K\. Srivatsan, F\. Shamshad, M\. Naseer, V\. M\. Patel, and K\. Nandakumar \(2025\)Stereo: a two\-stage framework for adversarially robust concept erasing from text\-to\-image diffusion models\.InProceedings of the Computer Vision and Pattern Recognition Conference,pp\. 23765–23774\.Cited by:[§2\.1](https://arxiv.org/html/2605.28902#S2.SS1.p1.1),[§6\.3](https://arxiv.org/html/2605.28902#S6.SS3.SSS0.Px2.p1.1)\.
- Y\. Sun, Y\. Zhang, G\. Liu, H\. Xie, and S\. Liu \(2025\)Invisible watermarks, visible gains: steering machine unlearning with bi\-level watermarking design\.InProceedings of the IEEE/CVF International Conference on Computer Vision,pp\. 2417–2428\.Cited by:[§2\.1](https://arxiv.org/html/2605.28902#S2.SS1.p1.1)\.
- Y\. Tsai, C\. Hsu, C\. Xie, C\. Lin, J\. Chen, B\. Li, P\. Chen, C\. Yu, and C\. Huang \(2023\)Ring\-a\-bell\! how reliable are concept removal methods for diffusion models?\.arXiv preprint arXiv:2310\.10012\.Cited by:[§6\.3](https://arxiv.org/html/2605.28902#S6.SS3.SSS0.Px1.p1.1)\.
- Y\. Wang, O\. Li, T\. Mu, Y\. Hao, K\. Liu, X\. Wang, and X\. He \(2025\)Precise, fast, and low\-cost concept erasure in value space: orthogonal complement matters\.In2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition \(CVPR\),pp\. 28759–28768\.Cited by:[Appendix B](https://arxiv.org/html/2605.28902#A2.p1.1),[Appendix B](https://arxiv.org/html/2605.28902#A2.p2.1),[§1](https://arxiv.org/html/2605.28902#S1.p2.1),[§2\.1](https://arxiv.org/html/2605.28902#S2.SS1.p1.1)\.
- Y\. Yang, R\. Gao, X\. Wang, T\. Ho, N\. Xu, and Q\. Xu \(2024\)Mma\-diffusion: multimodal attack on diffusion models\.InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition,pp\. 7737–7746\.Cited by:[§6\.3](https://arxiv.org/html/2605.28902#S6.SS3.SSS0.Px1.p1.1)\.
- J\. Yoon, S\. Yu, V\. Patil, H\. Yao, and M\. Bansal \(2024\)Safree: training\-free and adaptive guard for safe text\-to\-image and video generation\.arXiv preprint arXiv:2410\.12761\.Cited by:[§1](https://arxiv.org/html/2605.28902#S1.p2.1),[§2\.1](https://arxiv.org/html/2605.28902#S2.SS1.p1.1)\.
- G\. Zhang, K\. Wang, X\. Xu, Z\. Wang, and H\. Shi \(2024a\)Forget\-me\-not: learning to forget in text\-to\-image diffusion models\.InProceedings of the IEEE/CVF conference on computer vision and pattern recognition,pp\. 1755–1764\.Cited by:[§C\.1](https://arxiv.org/html/2605.28902#A3.SS1.SSS0.Px3.p1.1),[§2\.1](https://arxiv.org/html/2605.28902#S2.SS1.p1.1),[§6](https://arxiv.org/html/2605.28902#S6.p1.1)\.
- Y\. Zhang, E\. Jin, Y\. Dong, Y\. Wu, P\. Torr, A\. Khakzar, J\. Stegmaier, and K\. Kawaguchi \(2025\)Minimalist concept erasure in generative models\.arXiv preprint arXiv:2507\.13386\.Cited by:[Appendix B](https://arxiv.org/html/2605.28902#A2.p1.1)\.
- Y\. Zhang, X\. Chen, J\. Jia, Y\. Zhang, C\. Fan, J\. Liu, M\. Hong, K\. Ding, and S\. Liu \(2024b\)Defensive unlearning with adversarial training for robust concept erasure in diffusion models\.Advances in neural information processing systems37,pp\. 36748–36776\.Cited by:[§2\.1](https://arxiv.org/html/2605.28902#S2.SS1.p1.1),[§6\.3](https://arxiv.org/html/2605.28902#S6.SS3.SSS0.Px2.p1.1)\.
- Y\. Zhang, J\. Jia, X\. Chen, A\. Chen, Y\. Zhang, J\. Liu, K\. Ding, and S\. Liu \(2024c\)To generate or not? safety\-driven unlearned diffusion models are still easy to generate unsafe images… for now\.InEuropean Conference on Computer Vision,pp\. 385–403\.Cited by:[§1](https://arxiv.org/html/2605.28902#S1.p1.1)\.

## Appendix AAdditional Theoretical Analysis

### A\.1Derivation of the Vector\-wise Orthogonal Erasure

We provide a detailed derivation of the closed\-form solution to the vector\-wise orthogonal erasure objective in Eq\. \(9\)\.

Starting from the objective

minP⊤​P=I⁡‖P​A−B‖F2,\\min\_\{P^\{\\top\}P=I\}\\\|PA\-B\\\|\_\{F\}^\{2\},\(21\)we expand the Frobenius norm as

‖P​A−B‖F2\\displaystyle\\\|PA\-B\\\|\_\{F\}^\{2\}=tr​\(\(P​A−B\)⊤​\(P​A−B\)\)\\displaystyle=\\mathrm\{tr\}\(\(PA\-B\)^\{\\top\}\(PA\-B\)\)\(22\)=tr​\(A⊤​A\)\+tr​\(B⊤​B\)−2​tr​\(P⊤​B​A⊤\),\\displaystyle=\\mathrm\{tr\}\(A^\{\\top\}A\)\+\\mathrm\{tr\}\(B^\{\\top\}B\)\-2\\,\\mathrm\{tr\}\(P^\{\\top\}BA^\{\\top\}\),\(23\)where we use the orthogonality constraintP⊤​P=IP^\{\\top\}P=I\.

Sincetr​\(A⊤​A\)\\mathrm\{tr\}\(A^\{\\top\}A\)andtr​\(B⊤​B\)\\mathrm\{tr\}\(B^\{\\top\}B\)are constant with respect toPP, minimizing the original objective is equivalent to maximizing

maxP⊤​P=I⁡tr​\(P⊤​M\),M=B​A⊤\.\\max\_\{P^\{\\top\}P=I\}\\mathrm\{tr\}\(P^\{\\top\}M\),\\qquad M=BA^\{\\top\}\.\(24\)
This is the classical orthogonal Procrustes problem, whose optimal solution is given by

whereU​Σ​V⊤U\\Sigma V^\{\\top\}is the singular value decomposition ofMM\.

### A\.2Derivation of Subspace\-level Orthogonal Concept Erasure \(OCE\)

In this appendix, we provide detailed derivations for the proposed subspace\-level Orthogonal Concept Erasure \(OCE\) objective\. In particular, we clarify the connection between vector\-wise and subspace\-level formulations, and rigorously justify the unified quadratic objective used to obtain the closed\-form orthogonal update\.

#### A\.2\.1Subspace\-level Erasure Objective

For erasure, instead of enforcing point\-wise alignment between target and anchor embeddings, we operate at the subspace level\. Let

G=orth​\(W​C1\),G∗=orth​\(W​C∗\)G=\\mathrm\{orth\}\(WC\_\{1\}\),\\qquad G\_\{\\ast\}=\\mathrm\{orth\}\(WC\_\{\\ast\}\)\(26\)denote orthonormal bases for the target and anchor subspaces, respectively\. Their corresponding projection matrices are

R=G​G⊤,R∗=G∗​G∗⊤\.R=GG^\{\\top\},\\qquad R\_\{\\ast\}=G\_\{\\ast\}G\_\{\\ast\}^\{\\top\}\.\(27\)We aim to suppress components of the transformed target subspace that lie outside the anchor subspace\. This leads to the following erasure objective:

minP⊤​P=I−‖P​R−R∗,⟂‖F2\.\\min\_\{P^\{\\top\}P=I\}\\;\-\\\|PR\-R\_\{\\ast,\\perp\}\\\|\_\{F\}^\{2\}\.\(28\)WhereR∗,⟂=I−R∗,⟂R\_\{\\ast,\\perp\}=I\-R\_\{\\ast,\\perp\}, Expanding the Frobenius norm and discarding constants gives

‖P​R−R∗,⟂‖F2=tr​\(R⊤​P⊤​P​R\)−2​tr​\(R∗,⟂⊤​P​R\)\+const\\displaystyle\\\|PR\-R\_\{\\ast,\\perp\}\\\|\_\{F\}^\{2\}=\\mathrm\{tr\}\\\!\\left\(R^\{\\top\}P^\{\\top\}PR\\right\)\-2\\,\\mathrm\{tr\}\\\!\\left\(R\_\{\\ast,\\perp\}^\{\\top\}PR\\right\)\+\\mathrm\{const\}\(29\)Under the orthogonality constraintP⊤​P=IP^\{\\top\}P=I, the first term is constant\. Therefore, minimizing Eq\. \([28](https://arxiv.org/html/2605.28902#A1.E28)\) is equivalent to the following linear trace maximization:

maxP⊤​P=I⁡tr​\(P⊤​Me\),Me=−\(I−R∗\)​R\.\\max\_\{P^\{\\top\}P=I\}\\;\\mathrm\{tr\}\\\!\\left\(P^\{\\top\}M\_\{e\}\\right\),\\qquad M\_\{e\}=\-\(I\-R\_\{\\ast\}\)R\.\(30\)

#### A\.2\.2Vector\-wise Preservation Objective

Recall that the vector\-wise preservation objective for the retain setC0C\_\{0\}is defined as

minP⊤​P=I⁡‖P​W​C0−W​C0‖F2\.\\min\_\{P^\{\\top\}P=I\}\\;\\\|PWC\_\{0\}\-WC\_\{0\}\\\|\_\{F\}^\{2\}\.\(31\)Expanding the squared Frobenius norm yields

‖P​W​C0−W​C0‖F2\\displaystyle\\\|PWC\_\{0\}\-WC\_\{0\}\\\|\_\{F\}^\{2\}=tr​\(C0⊤​W⊤​P⊤​P​W​C0\)−2​tr​\(C0⊤​W⊤​P​W​C0\)\+const\.\\displaystyle=\\mathrm\{tr\}\\\!\\left\(C\_\{0\}^\{\\top\}W^\{\\top\}P^\{\\top\}PWC\_\{0\}\\right\)\-2\\,\\mathrm\{tr\}\\\!\\left\(C\_\{0\}^\{\\top\}W^\{\\top\}PWC\_\{0\}\\right\)\+\\mathrm\{const\}\.\(32\)Under the orthogonality constraintP⊤​P=IP^\{\\top\}P=I, the first term is constant\. Therefore, minimizing Eq\. \([31](https://arxiv.org/html/2605.28902#A1.E31)\) is equivalent to the following linear trace maximization:

maxP⊤​P=I⁡tr​\(P⊤​M0\),M0=W​C0​C0⊤​W⊤,\\max\_\{P^\{\\top\}P=I\}\\;\\mathrm\{tr\}\(P^\{\\top\}M\_\{0\}\),\\qquad M\_\{0\}=WC\_\{0\}C\_\{0\}^\{\\top\}W^\{\\top\},\(33\)whereM0M\_\{0\}is symmetric and positive semidefinite\.

Combining Eq\. \([33](https://arxiv.org/html/2605.28902#A1.E33)\) and Eq\. \([30](https://arxiv.org/html/2605.28902#A1.E30)\) leads to the final objective:

maxP⊤​P=I⁡tr​\(P⊤​\(Me\+M0\)\)\.\\max\_\{P^\{\\top\}P=I\}\\;\\;\\mathrm\{tr\}\(P^\{\\top\}\(M\_\{e\}\+M\_\{0\}\)\)\.\(34\)

## Appendix BExtented Related Works

In addition to the concept erasure methods discussed in the main text, there is also a large body of machine unlearning and concept erasure methods, including EA\(Gaoet al\.,[2025](https://arxiv.org/html/2605.28902#bib.bib33)\), MCE\(Zhanget al\.,[2025](https://arxiv.org/html/2605.28902#bib.bib52)\), Co\-Erasing\(Liet al\.,[2025a](https://arxiv.org/html/2605.28902#bib.bib51)\), ANT\(Liet al\.,[2025b](https://arxiv.org/html/2605.28902#bib.bib49)\), AGE\(Buiet al\.,[2025](https://arxiv.org/html/2605.28902#bib.bib50)\), Realera\(Liuet al\.,[2024](https://arxiv.org/html/2605.28902#bib.bib48)\), Salun\(Fanet al\.,[2023](https://arxiv.org/html/2605.28902#bib.bib53)\), AdaVD\(Wanget al\.,[2025](https://arxiv.org/html/2605.28902#bib.bib18)\), CPE\(Leeet al\.,[2025](https://arxiv.org/html/2605.28902#bib.bib45)\), RACE\(Kimet al\.,[2024](https://arxiv.org/html/2605.28902#bib.bib46)\), Receler\(Huanget al\.,[2024](https://arxiv.org/html/2605.28902#bib.bib29)\), CURE\(Biswaset al\.,[2025](https://arxiv.org/html/2605.28902#bib.bib47)\)\. Due to practical constraints on time and computational resources, it is infeasible to include exhaustive comparisons with all existing methods\. Nevertheless, these methods provide valuable insights into efficient erasure, robustness, and controllability of diffusion models under different unlearning settings\.

It is worth noting that the notion of*orthogonality*has also been explored in prior works such as AdaVD\(Wanget al\.,[2025](https://arxiv.org/html/2605.28902#bib.bib18)\)and CURE\(Biswaset al\.,[2025](https://arxiv.org/html/2605.28902#bib.bib47)\)\. However, these methods are fundamentally different from our approach\. Specifically, AdaVD and CURE focus on*feature\-level interventions during inference*, where orthogonal projections are employed as operational tools to suppress or filter target concepts at inference time\. In contrast, our method represents a*paradigm\-level innovation for editing\-based concept erasure*\. We study orthogonal updates from a*parameter\-space geometric perspective*\. We systematically analyze why orthogonal transformations are advantageous for concept erasure, and formulate a*rigorous erasure objective directly in the parameter space*\. This formulation admits a*closed\-form solution*and provides explicit theoretical guarantees, resulting in a more general, principled, and novel framework for concept erasure\.

## Appendix CImplementation Details

Our method introduces three hyperparametersλe\\lambda\_\{e\},λ0\\lambda\_\{0\}, andλr\\lambda\_\{r\}to balance the contributions of erasure and preservation\. TheMtotalM\_\{\\mathrm\{total\}\}is formulated as

Mtotal=−λe​R​\(I−R∗\)\+W​\(λ0​K0\+λr​Cn​Cn⊤\)​W⊤,M\_\{\\mathrm\{total\}\}=\-\\lambda\_\{e\}R\(I\-R\_\{\\ast\}\)\+W\\big\(\\lambda\_\{0\}K\_\{0\}\+\\lambda\_\{r\}C\_\{n\}C\_\{n\}^\{\\top\}\\big\)W^\{\\top\},\(35\)where the first term enforces suppression of the target concept, and the second term preserves generic \(K0K\_\{0\}\) and neighboring \(CnC\_\{n\}\) concepts, weighted respectively byλ0\\lambda\_\{0\}andλr\\lambda\_\{r\}\. For all tasks, we edit theWkW\_\{k\}in the cross\-attention modules for erasure\.

Table 7:Object classes and corresponding anchors used in object erasure\.Object ClassesAirplaneAutomobileBirdCatDeerDogFrogHorseShipTruckAnchorsSkyTruckCatDogHorseCatBirdDeerAirplaneShip

Table 8:Evaluation of Object Erasure on CIFAR\-10\. We reportA​c​ceAcc\_\{e\}\(lower indicates more successful erasure\),A​c​csAcc\_\{s\}\(higher indicates better preservation of non\-target concepts\) andHoH\_\{o\}\(higher indicates better overall performance\) across the first five categories\. All values are expressed as percentages \(%\)\. The original model’s performance is included for reference\.ConceptDogFrogHorseShipTruckAverageAcce↓\\text\{Acc\}\_\{e\}\\downarrowAccs↑\\text\{Acc\}\_\{s\}\\uparrow\\cellcolororange\!40Ho↑H\_\{o\}\\uparrowAcce↓\\text\{Acc\}\_\{e\}\\downarrowAccs↑\\text\{Acc\}\_\{s\}\\uparrow\\cellcolororange\!40Ho↑H\_\{o\}\\uparrowAcce↓\\text\{Acc\}\_\{e\}\\downarrowAccs↑\\text\{Acc\}\_\{s\}\\uparrow\\cellcolororange\!40Ho↑H\_\{o\}\\uparrowAcce↓\\text\{Acc\}\_\{e\}\\downarrowAccs↑\\text\{Acc\}\_\{s\}\\uparrow\\cellcolororange\!40Ho↑H\_\{o\}\\uparrowAcce↓\\text\{Acc\}\_\{e\}\\downarrowAccs↑\\text\{Acc\}\_\{s\}\\uparrow\\cellcolororange\!40Ho↑H\_\{o\}\\uparrowAcce↓\\text\{Acc\}\_\{e\}\\downarrowAccs↑\\text\{Acc\}\_\{s\}\\uparrow\\cellcolororange\!40Ho↑H\_\{o\}\\uparrowSD v1\.498\.7498\.62\\cellcolororange\!402\.4999\.9398\.49\\cellcolororange\!400\.1499\.7898\.50\\cellcolororange\!400\.4498\.6498\.63\\cellcolororange\!402\.6898\.8998\.60\\cellcolororange\!402\.2099\.2098\.57\\cellcolororange\!401\.59CA98\.5098\.57\\cellcolororange\!402\.9699\.9298\.62\\cellcolororange\!400\.1699\.7498\.63\\cellcolororange\!400\.5298\.1898\.50\\cellcolororange\!403\.5798\.5098\.61\\cellcolororange\!402\.9698\.9798\.59\\cellcolororange\!402\.04ESD27\.0389\.75\\cellcolororange\!4080\.4912\.3288\.05\\cellcolororange\!4087\.8617\.6982\.23\\cellcolororange\!4082\.2718\.3894\.32\\cellcolororange\!4087\.5126\.1185\.35\\cellcolororange\!4079\.2120\.3187\.94\\cellcolororange\!4083\.61FMN97\.6498\.12\\cellcolororange\!404\.6191\.6094\.59\\cellcolororange\!4015\.4399\.6393\.14\\cellcolororange\!400\.7497\.9798\.21\\cellcolororange\!403\.9897\.6497\.86\\cellcolororange\!404\.6196\.9096\.38\\cellcolororange\!406\.01UCE13\.2298\.69\\cellcolororange\!4092\.3520\.8698\.32\\cellcolororange\!4087\.694\.6698\.32\\cellcolororange\!4096\.916\.1398\.41\\cellcolororange\!4096\.0920\.5898\.16\\cellcolororange\!4087\.8013\.0998\.38\\cellcolororange\!4089\.24MACE11\.0796\.77\\cellcolororange\!4092\.6811\.4597\.75\\cellcolororange\!4092\.924\.8997\.48\\cellcolororange\!4096\.288\.5898\.56\\cellcolororange\!4094\.867\.2998\.38\\cellcolororange\!4095\.468\.6697\.79\\cellcolororange\!4094\.46RECE17\.7298\.51\\cellcolororange\!4089\.6711\.6998\.43\\cellcolororange\!4093\.109\.9898\.10\\cellcolororange\!4093\.8921\.9298\.41\\cellcolororange\!4087\.0730\.4298\.43\\cellcolororange\!4081\.5318\.3598\.38\\cellcolororange\!4089\.24SPEED1\.3298\.43\\cellcolororange\!4098\.555\.6398\.18\\cellcolororange\!4096\.243\.8497\.83\\cellcolororange\!4096\.990\.7898\.44\\cellcolororange\!4098\.837\.4998\.55\\cellcolororange\!4095\.433\.8198\.29\\cellcolororange\!4097\.23Ours0\.7498\.47\\cellcolororange\!4098\.861\.2798\.45\\cellcolororange\!4098\.590\.3598\.39\\cellcolororange\!4099\.020\.1598\.64\\cellcolororange\!4099\.241\.3898\.53\\cellcolororange\!4098\.570\.7898\.50\\cellcolororange\!4098\.86

Table 9:Quantitative experiments on implicit concept erasure with FLUX\.1 dev\. Lower is better for I2P, Ring\-A\-Bell, and FID; higher is better for CS\.MethodI2P↓\\downarrowRing\-A\-Bell↓\\downarrowMSCOCOCS↑\\uparrowFID↓\\downarrowFLUX0\.440\.9325\.64–UCE0\.300\.5625\.6330\.41ESD0\.230\.6124\.9755\.43EraseFlow0\.210\.4424\.7240\.64Ours0\.230\.4325\.5526\.29

Table 10:Ablation study on selection of anchor concepts on CIFAR\-10\.ConceptAirplaneAutomobileBirdCatDeerAverageAcce↓\\text\{Acc\}\_\{e\}\\downarrowAccs↑\\text\{Acc\}\_\{s\}\\uparrow\\cellcolororange\!40Ho↑H\_\{o\}\\uparrowAcce↓\\text\{Acc\}\_\{e\}\\downarrowAccs↑\\text\{Acc\}\_\{s\}\\uparrow\\cellcolororange\!40Ho↑H\_\{o\}\\uparrowAcce↓\\text\{Acc\}\_\{e\}\\downarrowAccs↑\\text\{Acc\}\_\{s\}\\uparrow\\cellcolororange\!40Ho↑H\_\{o\}\\uparrowAcce↓\\text\{Acc\}\_\{e\}\\downarrowAccs↑\\text\{Acc\}\_\{s\}\\uparrow\\cellcolororange\!40Ho↑H\_\{o\}\\uparrowAcce↓\\text\{Acc\}\_\{e\}\\downarrowAccs↑\\text\{Acc\}\_\{s\}\\uparrow\\cellcolororange\!40Ho↑H\_\{o\}\\uparrowAcce↓\\text\{Acc\}\_\{e\}\\downarrowAccs↑\\text\{Acc\}\_\{s\}\\uparrow\\cellcolororange\!40Ho↑H\_\{o\}\\uparrowEmpty9\.2297\.63\\cellcolororange\!4094\.0812\.6197\.13\\cellcolororange\!4092\.003\.1297\.90\\cellcolororange\!4097\.385\.0398\.41\\cellcolororange\!4096\.666\.9098\.47\\cellcolororange\!4095\.717\.3897\.91\\cellcolororange\!4095\.17Random7\.7198\.62\\cellcolororange\!4095\.3510\.9198\.56\\cellcolororange\!4093\.595\.6297\.36\\cellcolororange\!4095\.852\.4998\.55\\cellcolororange\!4098\.024\.2898\.32\\cellcolororange\!4097\.006\.2098\.28\\cellcolororange\!4095\.96Ours6\.8998\.85\\cellcolororange\!4095\.898\.7999\.07\\cellcolororange\!4094\.983\.4098\.30\\cellcolororange\!4097\.440\.1598\.71\\cellcolororange\!4099\.283\.8198\.45\\cellcolororange\!4097\.314\.6198\.68\\cellcolororange\!4096\.98

Table 11:Ablation study on selection of anchor concepts in multi\-concept erasure scenarios\.AnchorErase 100 CelebritiesMSCOCOAcce↓\\text\{Acc\}\_\{e\}\\downarrowAccs↑\\text\{Acc\}\_\{s\}\\uparrow\\cellcolororange\!40Ho↑H\_\{o\}\\uparrowCS↑\\uparrowFID↓\\downarrowEmpty4\.2891\.58\\cellcolororange\!4093\.6026\.0418\.64Car4\.4894\.68\\cellcolororange\!4095\.1025\.8518\.18Cat4\.2492\.20\\cellcolororange\!4093\.9525\.9218\.45Person3\.8594\.50\\cellcolororange\!4095\.3226\.4018\.29Ours3\.4494\.42\\cellcolororange\!4095\.4826\.3318\.33

Table 12:Ablation study on hyperparameters in multi\-concept erasure scenarios\.λe\\lambda\_\{e\}λ0\\lambda\_\{0\}λr\\lambda\_\{r\}Erase 100 CelebritiesMSCOCOAcce↓\\text\{Acc\}\_\{e\}\\downarrowAccs↑\\text\{Acc\}\_\{s\}\\uparrow\\cellcolororange\!40Ho↑H\_\{o\}\\uparrowCS↑\\uparrowFID↓\\downarrow9005033\.4494\.42\\cellcolororange\!4095\.4826\.3318\.3310005032\.8593\.25\\cellcolororange\!4095\.1626\.2919\.0112005032\.0389\.73\\cellcolororange\!4093\.6726\.2419\.696005037\.7093\.48\\cellcolororange\!4092\.8926\.4117\.619002033\.0594\.01\\cellcolororange\!4095\.4626\.1221\.059008035\.6793\.85\\cellcolororange\!4094\.0926\.4417\.559005011\.8489\.57\\cellcolororange\!4093\.6726\.3418\.639005056\.5294\.49\\cellcolororange\!4093\.9826\.3718\.34

### C\.1Experimental Setup Details

##### Single\-concept erasure\.

For object erasure, we conduct experiments on the ten objects classes of the CIFAR\-10 dataset, as detailed in Tab\.[7](https://arxiv.org/html/2605.28902#A3.T7)\. For erasure of each class, we edit the original model withλe=1000\\lambda\_\{e\}=1000,λ0=50\\lambda\_\{0\}=50, andλr=1\\lambda\_\{r\}=1\. In evaluation, 200 images are generated using the prompt “a photo of theclass” for the target concept and the nine remaining concepts\. For artistic style erasure, we conduct experiments on three style: Van Gogh, Monet and Picasso\. we edit the original model withλe=500\\lambda\_\{e\}=500,λ0=100\\lambda\_\{0\}=100, andλr=1\\lambda\_\{r\}=1\. In evaluation, 100 images are generated with the prompt “a painting in the style ofstyle” for the target style\.

##### Multi\-concept erasure\.

Following the experiment setup of SPEED\(Liet al\.,[2025c](https://arxiv.org/html/2605.28902#bib.bib21)\), we introduce a dataset of 200 celebrities, whose portraits generated by SD v1\.4 can be reliably recognized with high accuracy by the GIPHY Celebrity Detector \(GCD\)\(Hastyet al\.,[2019](https://arxiv.org/html/2605.28902#bib.bib42)\)\. We divide this dataset into an erasure set containing 100 celebrities, and a retain set of 100 other celebrities\. We provide the full lists for both sets in Tab\.[13](https://arxiv.org/html/2605.28902#A4.T13)\. We conduct experiments erasing 10, 50, and 100 celebrities from the predefined erasure set, while including the full retain set\. For erasing 10 celebrities, we set the hyperparameters asλe=1500\\lambda\_\{e\}=1500,λ0=50\\lambda\_\{0\}=50, andλr=3\\lambda\_\{r\}=3\. For erasing 50 celebrities, we setλe=1200\\lambda\_\{e\}=1200,λ0=50\\lambda\_\{0\}=50, andλr=3\\lambda\_\{r\}=3For the 100\-celebrity erasure scenario, we setλe=900\\lambda\_\{e\}=900,λ0=50\\lambda\_\{0\}=50, andλr=3\\lambda\_\{r\}=3\. In evaluation, we use five celebrity templates: “a portrait ofcelebrity”, “a sketch ofcelebrity”, “an oil painting ofcelebrity”, “celebrityin an official photo”, and “an image capturingcelebrityat a public event”\. We generate 500 images for each set\. For non\-target concepts, we generate 1 image per template for each of the 100 concepts\. For target concepts, the number of images per template is adjusted to maintain 500 images overall\.

##### Implementation of baselines\.

We compare OCE with seven baselines:444[https://github\.com/nupurkmr9/concept\-ablation](https://github.com/nupurkmr9/concept-ablation)CA\(Kumariet al\.,[2023](https://arxiv.org/html/2605.28902#bib.bib13)\),555[https://github\.com/rohitgandikota/erasing](https://github.com/rohitgandikota/erasing)ESD\(Gandikotaet al\.,[2023](https://arxiv.org/html/2605.28902#bib.bib14)\),666[https://github\.com/SHI\-Labs/Forget\-Me\-Not](https://github.com/SHI-Labs/Forget-Me-Not)FMN\(Zhanget al\.,[2024a](https://arxiv.org/html/2605.28902#bib.bib28)\),777[https://github\.com/rohitgandikota/unified\-concept\-editing](https://github.com/rohitgandikota/unified-concept-editing)UCE\(Gandikotaet al\.,[2024](https://arxiv.org/html/2605.28902#bib.bib19)\),888[https://github\.com/Shilin\-LU/MACE](https://github.com/Shilin-LU/MACE)MACE\(Luet al\.,[2024](https://arxiv.org/html/2605.28902#bib.bib16)\),999[https://github\.com/CharlesGong12/RECE](https://github.com/CharlesGong12/RECE)RECE\(Gonget al\.,[2024](https://arxiv.org/html/2605.28902#bib.bib20)\),101010[https://github\.com/ouxiang\-li/speed](https://github.com/ouxiang-li/speed)SPEED\(Liet al\.,[2025c](https://arxiv.org/html/2605.28902#bib.bib21)\)\. All the baselines are implemented using the default configurations from their official repositories\.

## Appendix DAdditional Experiments

### D\.1Additional Evaluation Results of Object Erasure on CIFAR\-10

We further present the results of erasing the remaining five object classes of the CIFAR\-10 dataset in Tab\.[8](https://arxiv.org/html/2605.28902#A3.T8)\. OCE achieves the highestHoH\_\{o\}in all erasure of these five class of objects\. This demonstrates that OCE consistently maintains a superior balance between effective concept erasure and preservation of non\-target generation capacity\.

### D\.2Additional Experiments on FLUX

We conducted additional quantitative experiments on implicit concept erasure with FLUX\.1 dev\. We generate images with the I2P and Ring\-A\-Bell dataset for evaluation\. The results are summarized in Tab\.[9](https://arxiv.org/html/2605.28902#A3.T9)\. Compared with baseline methods, OCE achieves a better balance between concept erasure and prior preservation, demonstrating its strong transferability to different model architectures\. For all baseline methods, we use the official implementations with default settings for fair comparison\.

### D\.3Ablation on Selection of Anchor Concepts

For anchor selection, we adopt a heuristic strategy: anchor concepts are chosen to belong to the same high\-level category as the target concept, sharing some similarity but also exhibiting noticeable differences\. This ensures smooth and stable concept erasure\. To further investigate the impact of anchor choice, we conducted experiments on CIFAR\-10 \(in Tab\.[10](https://arxiv.org/html/2605.28902#A3.T10)\) and multi\-concept erasure scenarios \(in Tab\.[11](https://arxiv.org/html/2605.28902#A3.T11)\)\. We compared our heuristic selections with random anchors and the empty anchor\. The performance varies under different anchor settings, and our heuristic strategy consistently improves erasure effectiveness\.

### D\.4Ablation on the Hyperparameters

We conduct an ablation study on the trade\-off parametersλe\\lambda\_\{e\},λ0\\lambda\_\{0\}, andλr\\lambda\_\{r\}in the multi\-concept erasure scenario\. The results are summarized in the Tab\.[12](https://arxiv.org/html/2605.28902#A3.T12)\. From the table, we can observe:λe\\lambda\_\{e\}mainly affects the erasure effectiveness \(Acce\\text\{Acc\}\_\{e\}\),λ0\\lambda\_\{0\}mainly affects the overall generation quality \(FID\(COCO\)\),andλp\\lambda\_\{p\}mainly affects the preservation of neighboring concepts \(Accr\\text\{Acc\}\_\{r\}\)\.These findings are consistent with our design\. In our experiments, we chooseλe=900\\lambda\_\{e\}=900,λ0=50\\lambda\_\{0\}=50, andλr=3\\lambda\_\{r\}=3to achieve a balanced trade\-off between erasure and preservation\.

### D\.5Additional Qualitative Results

In this section, we provide more comprehensive qualitative results across a variety of concept erasure tasks, including object erasure \(Fig\.[5](https://arxiv.org/html/2605.28902#A4.F5)\), style erasure \(Fig\.[6](https://arxiv.org/html/2605.28902#A4.F6)\), celebrity erasure \(Fig\.[7](https://arxiv.org/html/2605.28902#A4.F7)\) and implicit concept erasure \(Fig\.[8](https://arxiv.org/html/2605.28902#A4.F8)\)\.

Table 13:Evaluation setup for multi\-concept erasure\. The dataset contains an Erasure Set and a Retain Set of celebrities\.GroupNumberAnchorConceptCelebrityErasureSet10‘celebrity’‘Adam Driver’, ‘Adriana Lima’, ‘Amber Heard’, ‘Amy Adams’, ‘Andrew Garfield’, ‘Angelina Jolie’, ‘Anjelica Huston’, ‘Anna Faris’, ‘Anna Kendrick’, ‘Anne Hathaway’50‘celebrity’‘Adam Driver’, ‘Adriana Lima’, ‘Amber Heard’, ‘Amy Adams’, ‘Andrew Garfield’, ‘Angelina Jolie’, ‘Anjelica Huston’, ‘Anna Faris’, ‘Anna Kendrick’, ‘Anne Hathaway’, ‘Arnold Schwarzenegger’, ‘Barack Obama’, ‘Beth Behrs’, ‘Bill Clinton’, ‘Bob Dylan’, ‘Bob Marley’, ‘Bradley Cooper’, ‘Bruce Willis’, ‘Bryan Cranston’, ‘Cameron Diaz’, ‘Channing Tatum’, ‘Charlie Sheen’, ‘Charlize Theron’, ‘Chris Evans’, ‘Chris Hemsworth’, ‘Chris Pine’, ‘Chuck Norris’, ‘Courteney Cox’, ‘Demi Lovato’, ‘Drake’, ‘Drew Barrymore’, ‘Dwayne Johnson’, ‘Ed Sheeran’, ‘Elon Musk’, ‘Elvis Presley’, ‘Emma Stone’, ‘Frida Kahlo’, ‘George Clooney’, ‘Glenn Close’, ‘Gwyneth Paltrow’, ‘Harrison Ford’, ‘Hillary Clinton’, ‘Hugh Jackman’, ‘Idris Elba’, ‘Jake Gyllenhaal’, ‘James Franco’, ‘Jared Leto’, ‘Jason Momoa’, ‘Jennifer Aniston’, ‘Jennifer Lawrence’100‘celebrity’‘Adam Driver’, ‘Adriana Lima’, ‘Amber Heard’, ‘Amy Adams’, ‘Andrew Garfield’, ‘Angelina Jolie’, ‘Anjelica Huston’, ‘Anna Faris’, ‘Anna Kendrick’, ‘Anne Hathaway’, ‘Arnold Schwarzenegger’, ‘Barack Obama’, ‘Beth Behrs’, ‘Bill Clinton’, ‘Bob Dylan’, ‘Bob Marley’, ‘Bradley Cooper’, ‘Bruce Willis’, ‘Bryan Cranston’, ‘Cameron Diaz’, ‘Channing Tatum’, ‘Charlie Sheen’, ‘Charlize Theron’, ‘Chris Evans’, ‘Chris Hemsworth’, ‘Chris Pine’, ‘Chuck Norris’, ‘Courteney Cox’, ‘Demi Lovato’, ‘Drake’, ‘Drew Barrymore’, ‘Dwayne Johnson’, ‘Ed Sheeran’, ‘Elon Musk’, ‘Elvis Presley’, ‘Emma Stone’, ‘Frida Kahlo’, ‘George Clooney’, ‘Glenn Close’, ‘Gwyneth Paltrow’, ‘Harrison Ford’, ‘Hillary Clinton’, ‘Hugh Jackman’, ‘Idris Elba’, ‘Jake Gyllenhaal’, ‘James Franco’, ‘Jared Leto’, ‘Jason Momoa’, ‘Jennifer Aniston’, ‘Jennifer Lawrence’, ‘Jennifer Lopez’, ‘Jeremy Renner’, ‘Jessica Biel’, ‘Jessica Chastain’, ‘John Oliver’, ‘John Wayne’, ‘Johnny Depp’, ‘Julianne Hough’, ‘Justin Timberlake’, ‘Kate Bosworth’, ‘Kate Winslet’, ‘Leonardo Dicaprio’, ‘Margot Robbie’, ‘Mariah Carey’, ‘Melania Trump’, ‘Meryl Streep’, ‘Mick Jagger’, ‘Mila Kunis’, ‘Milla Jovovich’, ‘Morgan Freeman’, ‘Nick Jonas’, ‘Nicolas Cage’, ‘Nicole Kidman’, ‘Octavia Spencer’, ‘Olivia Wilde’, ‘Oprah Winfrey’, ‘Paul Mccartney’, ‘Paul Walker’, ‘Peter Dinklage’, ‘Philip Seymour Hoffman’, ‘Reese Witherspoon’, ‘Richard Gere’, ‘Ricky Gervais’, ‘Rihanna’, ‘Robin Williams’, ‘Ronald Reagan’, ‘Ryan Gosling’, ‘Ryan Reynolds’, ‘Shia Labeouf’, ‘Shirley Temple’, ‘Spike Lee’, ‘Stan Lee’, ‘Theresa May’, ‘Tom Cruise’, ‘Tom Hanks’, ‘Tom Hardy’, ‘Tom Hiddleston’, ‘Whoopi Goldberg’, ‘Zac Efron’, ‘Zayn Malik’RetainSet10, 50, and 100\-‘Aaron Paul’, ‘Alec Baldwin’, ‘Amanda Seyfried’, ‘Amy Poehler’, ‘Amy Schumer’, ‘Amy Winehouse’, ‘Andy Samberg’, ‘Aretha Franklin’, ‘Avril Lavigne’, ‘Aziz Ansari’, ‘Barry Manilow’, ‘Ben Affleck’, ‘Ben Stiller’, ‘Benicio Del Toro’, ‘Bette Midler’, ‘Betty White’, ‘Bill Murray’, ‘Bill Nye’, ‘Britney Spears’, ‘Brittany Snow’, ‘Bruce Lee’, ‘Burt Reynolds’, ‘Charles Manson’, ‘Christie Brinkley’, ‘Christina Hendricks’, ‘Clint Eastwood’, ‘Countess Vaughn’, ‘Dakota Johnson’, ‘Dane Dehaan’, ‘David Bowie’, ‘David Tennant’, ‘Denise Richards’, ‘Doris Day’, ‘Dr Dre’, ‘Elizabeth Taylor’, ‘Emma Roberts’, ‘Fred Rogers’, ‘Gal Gadot’, ‘George Bush’, ‘George Takei’, ‘Gillian Anderson’, ‘Gordon Ramsey’, ‘Halle Berry’, ‘Harry Dean Stanton’, ‘Harry Styles’, ‘Hayley Atwell’, ‘Heath Ledger’, ‘Henry Cavill’, ‘Jackie Chan’, ‘Jada Pinkett Smith’, ‘James Garner’, ‘Jason Statham’, ‘Jeff Bridges’, ‘Jennifer Connelly’, ‘Jensen Ackles’, ‘Jim Morrison’, ‘Jimmy Carter’, ‘Joan Rivers’, ‘John Lennon’, ‘Johnny Cash’, ‘Jon Hamm’, ‘Judy Garland’, ‘Julianne Moore’, ‘Justin Bieber’, ‘Kaley Cuoco’, ‘Kate Upton’, ‘Keanu Reeves’, ‘Kim Jong Un’, ‘Kirsten Dunst’, ‘Kristen Stewart’, ‘Krysten Ritter’, ‘Lana Del Rey’, ‘Leslie Jones’, ‘Lily Collins’, ‘Lindsay Lohan’, ‘Liv Tyler’, ‘Lizzy Caplan’, ‘Maggie Gyllenhaal’, ‘Matt Damon’, ‘Matt Smith’, ‘Matthew Mcconaughey’, ‘Maya Angelou’, ‘Megan Fox’, ‘Mel Gibson’, ‘Melanie Griffith’, ‘Michael Cera’, ‘Michael Ealy’, ‘Natalie Portman’, ‘Neil Degrasse Tyson’, ‘Niall Horan’, ‘Patrick Stewart’, ‘Paul Rudd’, ‘Paul Wesley’, ‘Pierce Brosnan’, ‘Prince’, ‘Queen Elizabeth’, ‘Rachel Dratch’, ‘Rachel Mcadams’, ‘Reba Mcentire’, ‘Robert De Niro’

![Refer to caption](https://arxiv.org/html/2605.28902v1/x5.png)Figure 5:Additional qualitative results on Object Erasure\.![Refer to caption](https://arxiv.org/html/2605.28902v1/x6.png)Figure 6:Additional qualitative results on Artistic Style Erasure\.![Refer to caption](https://arxiv.org/html/2605.28902v1/x7.png)Figure 7:Additional qualitative results on Celebrity Erasure\.![Refer to caption](https://arxiv.org/html/2605.28902v1/x8.png)Figure 8:Additional qualitative results on Implicit Concept Erasure on I2P\.

Similar Articles

Hoeffding Concept Bottleneck Models with Applications to Overhead Images

arXiv cs.LG

Introduces Hoeffding Concept Bottleneck Models (HCBM), a nonlinear and sparse aggregation of concept scores using Hoeffding functional decomposition of gradient-boosted trees, for improved explainability and accuracy in classification and object detection tasks, with applications to overhead images.

Drifting Objectives for Refining Discrete Diffusion Language Models

arXiv cs.CL

This paper introduces TokenDrift, a drifting objective that refines discrete diffusion language models by lifting categorical predictions to a continuous semantic space for anti-symmetric drifting, significantly improving generation quality under a fixed number of denoising steps.