eCNNTO: A Highly Generalizable ConvNet for Accelerating Topology Optimization

arXiv cs.AI 06/20/26, 04:00 AM Papers
cnn topology-optimization acceleration generalization deep-learning engineering-design small-data
Summary
This paper proposes eCNNTO, a CNN with residual connections to accelerate density-based topology optimization by predicting near-optimal densities from early iteration histories, achieving up to 97% reduction in iterations and strong generalization across different boundary conditions, geometries, and mesh resolutions.
arXiv:2606.19921v1 Announce Type: new Abstract: This work proposes an element-based Convolutional Neural Network (CNN) to accelerate density-based Topology Optimization (TO), termed eCNNTO. TO generally undergoes a large number of iterations, where finite element analysis is performed in every iteration, leading to the efficiency bottleneck especially when dense meshes are used to achieve high-resolution designs. To address this limitation, eCNNTO is proposed to build upon Kallioras et al. (2020), where a Deep Belief Network (DBN) was trained for every element to predict its near-optimal density from its early history, thereby skipping the great majority of iterations and significantly accelerating the TO procedure. However, the method lacks spatial correlations among neighboring elements and may lead to disconnected features in the final structure. The proposed method employs CNN with residual connections to address this issue. On top of it, a novel training strategy is introduced to further enhance the optimization efficiency, where the training dataset consists of the final stage density histories rather than early ones. This change can also help reduce the required training data size. eCNNTO requires only a small dataset to train and yet it can be generalized to problems with largely different boundary conditions, loading cases, design domain geometries, mesh resolutions, as well as non-design domains. In the end, the generalization capabilities and efficiency of eCNNTO are demonstrated through a variety of examples in two and three dimensions, achieving up to 90% and 97% reduction of iterations, respectively.
Original Article
View Cached Full Text
Cached at: 06/20/26, 02:34 PM
# eCNNTO: A Highly Generalizable ConvNet for Accelerating Topology Optimization
Source: [https://arxiv.org/html/2606.19921](https://arxiv.org/html/2606.19921)
###### Abstract

This work proposes an element\-based Convolutional Neural Network \(CNN\) to accelerate density\-based Topology Optimization \(TO\) , termedeCNNTO\. TO generally undergoes a large number of iterations, where finite element analysis is performed in every iteration, leading to the efficiency bottleneck especially when dense meshes are used to achieve high\-resolution designs\. To address this limitation, eCNNTO is proposed to build uponKallioraset al\.\([2020](https://arxiv.org/html/2606.19921#bib.bib14)\), where a Deep Belief Network \(DBN\) was trained for every element to predict its near\-optimal density from its early history, thereby skipping the great majority of iterations and significantly accelerating the TO procedure\. However, the method lacks spatial correlations among neighboring elements and may lead to disconnected features in the final structure\. The proposed method employs CNN with residual connections to address this issue\. On top of it, a novel training strategy is introduced to further enhance the optimization efficiency, where the training dataset consists of the final stage density histories rather than early ones\. This change can also help reduce the required training data size\. eCNNTO requires only a small dataset to train and yet it can be generalized to problems with largely different boundary conditions, loading cases, design domain geometries, mesh resolutions, as well as non\-design domains\. In the end, the generalization capabilities and efficiency of eCNNTO are demonstrated through a variety of examples in two and three dimensions, achieving up to 90% and 97% reduction of iterations, respectively\.

###### keywords:

Strong generalization , Topology optimization acceleration , CNN , Small data

\\affiliation

\[1\]organization=Global college, Shanghai Jiao Tong University, addressline=800 Dongchuan Road, city=Shanghai, postcode=200240, state=Shanghai, country=China

## 1Introduction

Topology Optimization \(TO\) is a structural design methodology that determines the optimal material distribution within a prescribed design domain to achieve the best structural performance under given boundary conditions and constraints\. Due to its high design flexibility and ability to generate high\-performance structures, topology optimization has been widely applied in various fields such as aerospace engineering\(Mekkiet al\.,[2021](https://arxiv.org/html/2606.19921#bib.bib18)\), bioengineering\(Xueet al\.,[2020](https://arxiv.org/html/2606.19921#bib.bib37); Ahadiet al\.,[2024](https://arxiv.org/html/2606.19921#bib.bib2)\), and automotive design\(Zhanget al\.,[2021](https://arxiv.org/html/2606.19921#bib.bib38); Suet al\.,[2022](https://arxiv.org/html/2606.19921#bib.bib28)\)\.

A variety of numerical methods of TO have been developed, such as Solid Isotropic Material with Penalization \(SIMP\)\(Bendsøe,[1989](https://arxiv.org/html/2606.19921#bib.bib6); Andreassenet al\.,[2011](https://arxiv.org/html/2606.19921#bib.bib3); Wanget al\.,[2025](https://arxiv.org/html/2606.19921#bib.bib30)\), Evolutionary Structural Optimization \(ESO\)\(Xie and Steven,[1993](https://arxiv.org/html/2606.19921#bib.bib35)\), Bi\-directional Evolutionary Structural Optimization \(BESO\)\(Xiaet al\.,[2018](https://arxiv.org/html/2606.19921#bib.bib34)\), Level\-Set Methods \(LSM\)\(Wanget al\.,[2003](https://arxiv.org/html/2606.19921#bib.bib31)\), Moving Morphable Components \(MMC\)\(Guoet al\.,[2014](https://arxiv.org/html/2606.19921#bib.bib10)\)and phase\-field\-based methods\(Wanget al\.,[2024](https://arxiv.org/html/2606.19921#bib.bib32); Sheng and Wei,[2025](https://arxiv.org/html/2606.19921#bib.bib22)\)\. Among these approaches, SIMP has been extensively studied and widely adopted because of its mathematical tractability, implementation simplicity, and practical robustness\. However, SIMP is computationally expensive because it requires dense meshes to achieve high\-resolution designs, which significantly increase computational cost due to repeated finite element analysis throughout the optimization\. It becomes particularly prohibitive for large\-scale problems\(Aageet al\.,[2017](https://arxiv.org/html/2606.19921#bib.bib1)\)\.

With the rapid development of artificial intelligence, neural network–based approaches have been increasingly explored to accelerate topology optimization\. Existing learning\-based methods can be broadly classified into global methods and local methods\. Global methods aim to directly generate final topologies from problem settings \(e\.g\., boundary conditions and loading cases\), essentially treating the problem as an image\-to\-image mapping task\. Early works by Sosnovik and Oseledets\(Sosnovik and Oseledets,[2019](https://arxiv.org/html/2606.19921#bib.bib27)\)pioneered this approach using 2D Convolutional Neural Networks \(CNNs\), where a convolutional encoder\-decoder architecture to map intermediate densities and their gradients to final structures\.Nieet al\.\([2021](https://arxiv.org/html/2606.19921#bib.bib19)\)adopted Generative Adversarial Networks \(GAN\) to generate final structures using loading and boundary conditions as the input\.Behzadi and Ilieş \([2021](https://arxiv.org/html/2606.19921#bib.bib5)\)replaced the network with the conditional GAN \(cGAN\) to improve the generalization capability to unseen boundary and external loading conditions\. Rather than directly generating optimized structures,Qian and Ye \([2021](https://arxiv.org/html/2606.19921#bib.bib21)\)proposed a dual\-network surrogate for forward and sensitivity analyses to accelerate the optimization process\.Xing and Tong \([2023](https://arxiv.org/html/2606.19921#bib.bib36)\)introduced an autonomous online learning strategy, where a neural network is trained on\-the\-fly using the data collected from the executed SIMP iterations, and then the trained surrogate replaces conventional sensitivity analysis to accelerate the optimization\. Though these approaches can significantly reduce the optimization time, they often require large datasets and exhibit limited robustness\(Woldsethet al\.,[2022](https://arxiv.org/html/2606.19921#bib.bib33); Bangaet al\.,[2018](https://arxiv.org/html/2606.19921#bib.bib4); Sosnovik and Oseledets,[2019](https://arxiv.org/html/2606.19921#bib.bib27)\), where small variations in input conditions can lead to disconnected structures or violations of physical constraints\.

To address the robustness issue, several neural nets have been proposed to predict a near\-optimal initial design, followed by a correction stage via the conventional SIMP\.Padhiet al\.\([2024](https://arxiv.org/html/2606.19921#bib.bib20)\)proposed the conditional invertible neural network to predict near\-optimal structures with a few SIMP iterations as the input, which can reduce 40% of iterations\.Limet al\.\([2024](https://arxiv.org/html/2606.19921#bib.bib17)\)proposed a CNN to accelerate SIMP\-based optimization and generate high\-resolution near\-optimal structures\.Jooet al\.\([2024](https://arxiv.org/html/2606.19921#bib.bib12)\)proposed a dynamic graph\-based neural network to accelerate the convergence of topology optimization for unstructured meshes\. Despite their demonstrated efficiency, the practical application of these global methods is hindered by the prohibitive cost of preparing training datasets and their limited generalization capabilities\(Woldsethet al\.,[2022](https://arxiv.org/html/2606.19921#bib.bib33)\)\.

In contrast, local methods focus on learning the evolution of element\-wise densities\. As structural variations follow relatively regular patterns at the local level, these methods typically require fewer training samples and demonstrate improved generalization capabilities\.Kallioraset al\.\([2020](https://arxiv.org/html/2606.19921#bib.bib14)\)proposed the Deep Learning Assisted Topology OPtimization \(DLTOP\), which uses the early history of element density to predict the near\-optimal density, followed by a few SIMP iterations to ensure structural connectivity and physical validity\. While DLTOP demonstrated a significant acceleration \(more than 50% reduction of iterations\), it predicts each element independently and neglects the spatial correlations among neighboring elements\. This lack of spatial context may lead to physically defective features, such as corner contacts and disconnections\. To address this issue,Jooet al\.\([2021](https://arxiv.org/html/2606.19921#bib.bib13)\)incorporated the spatio\-temporal information using Convolutional Long Short\-Term Memory \(ConvLSTM\) networks through a so\-called unit module, which is a patch of64×6464\\times 64elements\. Their training strategy relied on optimizing unit modules under varying loading conditions, which requires high computational cost to prepare the training datasets and increases the model complexity\.

To address the limitations of existing local approaches, this work, termedeCNNTO, builds on top of DLTOP\(Kallioraset al\.,[2020](https://arxiv.org/html/2606.19921#bib.bib14)\)and proposes a CNN\-based framework for accelerating SIMP\-based topology optimization\. CNNs are used to explicitly account for spatial correlations among neighboring elements\. They are naturally suited for this task as their convolution operations restore the continuum assumption of structures and the filtering scheme used in SIMP\. By exploiting spatial correlations, eCNNTO can suppress the occurrence of defective features, such as isolated pieces, corner\-contact parts and intermediate\-density regions\. Furthermore, a novel training strategy is proposed, where the training datasets are prepared using element densities from thefinal stage, rather than those from the early stage in DLTOP\. This way, further speedup can be achieved and the size of required training data can be reduced\. Moreover, eCNNTO shows strong generalization capabilities\. It can accommodate varying boundary conditions, loading cases, design domain geometries, mesh resolutions, and non\-design domains\.

The rest of the paper is organized as follows: Topology optimization based on SIMP is introduced in Section[2](https://arxiv.org/html/2606.19921#S2)\. eCNNTO is presented in[Section 3](https://arxiv.org/html/2606.19921#S3)in detail, including the network architecture, dataset construction and the novel training strategy\. Section[4](https://arxiv.org/html/2606.19921#S4)presents a variety of numerical examples to demonstrate the advantages of eCNNTO\. Finally, Section[5](https://arxiv.org/html/2606.19921#S5)draws conclusions and suggests directions for future research\.

## 2Topology Optimization and Deep Learning Assisted Acceleration

### 2\.1Solid Isotropic Material with Penalization \(SIMP\)

In a typical Topology Optimization \(TO\) problem, an initial material distribution is assumed and then iteratively updated via a certain gradient\-based optimization method, which relies on the state of the problem\. Such state is obtained by solving certain governing equations through Finite Element Analysis \(FEA\)\. An optimized structure is obtained once a certain convergence criterion is satisfied\.

Solid Isotropic Material with Penalization \(SIMP\) has been widely adopted in TO\. It adopts a continuous density between 0 \(indicating void\) and 1 \(pure material\) to represent the material distribution, where intermediate values are penalized towards the two extremes \(i\.e\., 0 and 1\)\. The prescribed domain where material can be possibly distributed is called adesign domain\. It is discretized into a mesh, where every element is assigned a density valueρe∈\[0,1\]\\rho\_\{e\}\\in\[0,1\]\. Central to SIMP is the rescaling of the key material parameters \(e\.g\., Young’s modulusEeE\_\{e\}\) usingρe\\rho\_\{e\}\. Specifically,EeE\_\{e\}is defined through the penalization strategy:

Ee\(ρe\)=Emin\+ρep\(Emax−Emin\),E\_\{e\}\(\\rho\_\{e\}\)=E\_\{\\text\{min\}\}\+\\rho\_\{e\}^\{p\}\(E\_\{\\text\{max\}\}\-E\_\{\\text\{min\}\}\),\(1\)
whereEmaxE\_\{\\text\{max\}\}is the Young’s modulus of pure material,EminE\_\{\\text\{min\}\}is a threshold Young’s modulus to avoid singular stiffness matrices, andppis a penalization factor \(usuallyp=3p=3\) used to penalize the behavior of intermediate density values\.

In this work, we use the classical minimum compliance as the model problem to introduce the proposed method\. It is formulated as follows:

minρ:\\displaystyle\\min\_\{\\mathbf\{\\rho\}\}:C\(ρ\)=𝐔\(ρ\)T𝐊\(ρ\)𝐔\(ρ\),\\displaystyle\\ C\(\\mathbf\{\\rho\}\)=\\mathbf\{U\(\\rho\)\}^\{\\mathrm\{T\}\}\\mathbf\{K\(\\rho\)\}\\mathbf\{U\(\\rho\)\},\(2\)s\.t:\\displaystyle\\text\{s\.t\}:𝐊\(ρ\)𝐔\(ρ\)=𝐅,\\displaystyle\\quad\\mathbf\{K\(\\rho\)\}\\mathbf\{U\(\\rho\)\}=\\mathbf\{F\},V\(ρ\)V0=Vf,\\displaystyle\\frac\{V\(\\mathbf\{\\rho\}\)\}\{V\_\{0\}\}=V\_\{f\},0≤ρe≤1,e=1,…,N,\\displaystyle 0\\leq\\rho\_\{e\}\\leq 1,\\quad e=1,\\ldots,N,
whereCCdenotes the structural compliance,𝐔\\mathbf\{U\},𝐊\\mathbf\{K\}and𝐅\\mathbf\{F\}are the global displacement vector, the stiffness matrix, and the global force vector, respectively,V\(ρ\)V\(\\rho\)is the volume of the target structure,V0V\_\{0\}is the volume of the design domain,VfV\_\{f\}is the prescribed volume fraction, andNNis the number of elements\. Note that𝐔\\mathbf\{U\},𝐊\\mathbf\{K\}and𝐅\\mathbf\{F\}are a result of applying FEA to solve the linear elasticity problem\.

In[Equation 2](https://arxiv.org/html/2606.19921#S2.E2), element densitiesρe\\rho\_\{e\}are thedesign variables\. Their final values lead to the optimized structure\. The optimization process is typically initialized with a constant density field corresponding toVfV\_\{f\}\. Given the currentρe\\rho\_\{e\},𝐊𝐔=𝐅\\mathbf\{K\}\\mathbf\{U\}=\\mathbf\{F\}is solved to find the current state of the problem \(i\.e\.,𝐔\\mathbf\{U\}\), with which the compliance and its gradients with respect toρe\\rho\_\{e\}can be evaluated\. The constrained optimization problem is then solved using gradient\-based optimization algorithms, such as the Optimality Criteria \(OC\)\(Sigmund,[2001](https://arxiv.org/html/2606.19921#bib.bib23); Bendsøe and Sigmund,[2004](https://arxiv.org/html/2606.19921#bib.bib7)\)and the Method of Moving Asymptotes \(MMA\)\(Svanberg,[1987](https://arxiv.org/html/2606.19921#bib.bib29)\)\. The optimization procedure terminates when a stopping criterion is met\. For example, the maximum change of element densities reaches a given thresholdϵ\\epsilon,

maxe∈\{1,…,N\}⁡\|Δρe\|≤ϵ\.\\max\_\{e\\in\\\{1,\\ldots,N\\\}\}\{\|\\Delta\{\\rho\_\{e\}\}\|\}\\leq\\epsilon\.\(3\)
SIMP\-based TO often suffers from numerical instabilities such as the checkerboard phenomenon\(Sigmund and Petersson,[1998](https://arxiv.org/html/2606.19921#bib.bib25)\), where void and solid elements appear alternately\. Various filters have been proposed to resolve this issue such as the density filter\(Sigmund,[2007](https://arxiv.org/html/2606.19921#bib.bib24)\), the sensitivity filter\(Sigmund and Maute,[2012](https://arxiv.org/html/2606.19921#bib.bib26)\)and the PDE filter\(Kawamotoet al\.,[2011](https://arxiv.org/html/2606.19921#bib.bib15)\)\.

In this work, SIMP iterations are performed using the popular MATLAB codesAndreassenet al\.\([2011](https://arxiv.org/html/2606.19921#bib.bib3)\); Wanget al\.\([2025](https://arxiv.org/html/2606.19921#bib.bib30)\)\. They are used to prepare our datasets and also verify our method\.

### 2\.2Deep Learning Assisted Topology Optimization \(DLTOP\)

Looking at the density evolutions of individual elements \(e\.g\.,[Figure 1](https://arxiv.org/html/2606.19921#S2.F1)\), we find that significant changes mainly occur during the early stage of the optimization, whereas density evolution becomes relatively stable in the later stage\. Motivated by this characteristic, an acceleration method named Deep Learning Assisted Topology Optimization \(DLTOP\)\(Kallioraset al\.,[2020](https://arxiv.org/html/2606.19921#bib.bib14)\)was proposed to predict the near\-optimal density of an element based on its early history of the density sequence, thereby skipping a large number of intermediate iterations\.

The training dataset of DLTOP is prepared at the element level\. Therefore, running a single TO problem can already generate many data samples, and thus data preparation in DLTOP is efficient\. A dataset consists of pairs of element density sequences at the early stage and the optimized element densities\. Specifically, for every element, the density sequence obtained from the first N iterations serves as the input, whereas its final density is the label\.

Once trained, the neural net can be used for new scenarios without retraining\. The input is obtained by performing SIMP for N iterations\. Based on this, the neural net predicts a density value for each element, which collectively yields a near\-optimal structure\. Finally, another few SIMP iterations are performed to improve structural connectivity and satisfy the prescribed volume constraint until the convergence criterion is satisfied\.

![Refer to caption](https://arxiv.org/html/2606.19921v1/x1.png)Figure 1:Evolution of element densities with respect to the number of iterations\. In the early stage \(red region\), element densities undergo drastic and fluctuating changes, while in the late stage \(blue region\), density variations tend to be stable\.It is worth mentioning that DLTOP divides the labels into 3 or 12 classes according to the ranges of density values\. Classifications are done by

ρi=\{0,ρi∈\[0,0\.4\],0\.5,ρi∈\(0\.4,0\.7\),1,ρi∈\[0\.7,1\],\\rho\_\{i\}=\\begin\{cases\}0,&\\rho\_\{i\}\\in\[0,0\.4\],\\\\ 0\.5,&\\rho\_\{i\}\\in\(0\.4,0\.7\),\\\\ 1,&\\rho\_\{i\}\\in\[0\.7,1\],\\\\ \\end\{cases\}\(4\)
or

ρi=\{0,ρi=0,0\.05,ρi∈\(0,0\.1\],⋮⋮0\.95,ρi∈\(0\.9,1\.0\),1,ρi=1,\\rho\_\{i\}=\\begin\{cases\}0,&\\rho\_\{i\}=0,\\\\ 0\.05,&\\rho\_\{i\}\\in\(0,0\.1\],\\\\ \\vdots&\\vdots\\\\ 0\.95,&\\rho\_\{i\}\\in\(0\.9,1\.0\),\\\\ 1,&\\rho\_\{i\}=1,\\end\{cases\}\(5\)
whereρi\\rho\_\{i\}\(i=1,2,…i=1,2,\\ldots\) is a labeled density\. Note that only the final densities are classified into discrete values, whereas the inputs remain continuous\.

However, DLTOP overlooked the fact that evolutions of neighboring elements depend on one another, leading to the need of relatively large dataset, poor structural connectivity, and even isolated structural features\.

![Refer to caption](https://arxiv.org/html/2606.19921v1/x2.png)Figure 2:Overall workflow of eCNNTO\. \(a\) Preparation of the training dataset, where SIMP is performed under several different settings\. \(b\) eCNNTO is trained on the labeled data at the element level\. \(c\) A design domain is discretized into a mesh with certain resolution\. \(d\) Preparation of the input data, where SIMP is performed only a few iterations\. \(e\) eCNNTO predicts a near\-optimal structure and it skips a large number of intermediate iterations needed in SIMP\. \(f\) Structural fine\-tuning via SIMP\.

## 3Element\-Based CNN for Accelerating TO \(eCNNTO\)

In this section, we introduce the proposed method, namely an element\-based Convolutional Neural Network for accelerating Topology Optimization \(eCNNTO\)\.[Figure 2](https://arxiv.org/html/2606.19921#S2.F2)illustrates the overall workflow of eCNNTO\. It is divided into offline training and online prediction\. During offline training, eCNNTO learns the evolution of element densities based on the data obtained from SIMP\. At the online stage, the trained eCNNTO predicts the near\-optimal structures of new problems according to the early histories of element densities \(computed by SIMP\)\. Finally, the predicted structures are further optimized by SIMP until convergence\. In what follows, we first introduce the architecture of eCNNTO, where CNN and the residual connections\(Heet al\.,[2016](https://arxiv.org/html/2606.19921#bib.bib11)\)are used\. Next, we discuss the construction of dataset in detail\. In the end, a new training strategy is presented to reduce the size of the training dataset through a particular selection of input features\.

### 3\.1Network architecture

![Refer to caption](https://arxiv.org/html/2606.19921v1/x3.png)Figure 3:Network architecture of eCNNTO\. Convolutional Feature Extractor \(CFE\) contains 6 residual blocks to extract spatial features from the input, which is a density sequence of an element patch\. In each block, residual connection is constructed through a shortcut path\. Fully Connected Classification Head \(FCCH\), composed of 3 linear layers, integrates the extracted features from CFE to output the near\-optimal density of the target element\.We start with the network architecture of eCNNTO; see[Figure 3](https://arxiv.org/html/2606.19921#S3.F3)\. In addition to the input and output, it has two main building blocks: a Convolutional Feature Extractor \(CFE\) and a Fully Connected Classification Head \(FCCH\)\. In the input layer, eCNNTO works with the so\-calledelement patch, which consists of the target element and its neighboring elements, together forming a local window of sizeH×WH\\times W\. The input to the network is a density sequence of an element patch, which is obtained by performing SIMP for N iterations\. Thus, the input is aN×H×WN\\times H\\times Wtensor\. In practice, we usually takeH=W=3H=W=3\(3D\) or55\(2D\)\. Their choice \(alsoNN\) and influence will be further discussed in[Section 4](https://arxiv.org/html/2606.19921#S4)\.

The element patch is introduced due to the adoption of Convolutional Neural Networks \(CNNs\) in the proposed architecture\. CNNs are a class of deep learning models that leverage local receptive fields and weight sharing to efficiently extract spatial features from grid\-based data\(LeCunet al\.,[2015](https://arxiv.org/html/2606.19921#bib.bib16)\)\. The explicit inductive bias of CNN is necessary for capturing local spatial correlations, making it an ideal candidate to address the poor connectivity issue of DLTOP\.

The input is fed into CFE, which is a typical network architecture used to extract local features\. CFE mainly stacks a certain number of residual blocks\. Each block features a convolutional layer \(Conv\), followd by a batch normalization layer \(BN\) and a ReLU activation function\. In a convolutional layer, let𝐗\(l−1\)∈ℝCin×H×W\\mathbf\{X\}^\{\(l\-1\)\}\\in\\mathbb\{R\}^\{C\_\{in\}\\times H\\times W\}and𝐖\(l\)∈ℝCout×Cin×KH×KW\\mathbf\{W\}^\{\(l\)\}\\in\\mathbb\{R\}^\{C\_\{out\}\\times C\_\{in\}\\times K\_\{H\}\\times K\_\{W\}\}be the input feature map and the learnable kernel weights of thelthl^\{th\}layer, respectively, whereCinC\_\{in\}andCoutC\_\{out\}are the number of channels of the input and output, andKHK\_\{H\}andKWK\_\{W\}are the height and width of the kernel\. Whenl=1l=1,𝐗\(0\)\\mathbf\{X\}^\{\(0\)\}is simply the input and thusCin=NC\_\{in\}=N\. We takeKH=KW=3K\_\{H\}=K\_\{W\}=3in this work unless otherwise stated\. The output feature map𝐙\(l\)∈RCout×H′×W′\\mathbf\{Z\}^\{\(l\)\}\\in\\mathrm\{R\}^\{C\_\{out\}\\times H^\{\\prime\}\\times W^\{\\prime\}\}of thelthl^\{th\}layer is given by

𝐙\(l\)=𝐖\(l\)∗𝐗\(l−1\),\\mathbf\{Z\}^\{\(l\)\}=\\mathbf\{W\}^\{\(l\)\}\\ast\\mathbf\{X\}^\{\(l\-1\)\},\(6\)
where the symbol∗\\astdenotes the discrete convolution operator\. It slides the kernel𝐖\(l\)\\mathbf\{W\}^\{\(l\)\}across𝐗\(l−1\)\\mathbf\{X\}^\{\(l\-1\)\}at a certain stride \(1 in this work\), performs an entry\-wise multiplication, and sums up the products into a scalar\. While the dimensions of𝐗\(l−1\)\\mathbf\{X\}^\{\(l\-1\)\}and𝐙\(l\)\\mathbf\{Z\}^\{\(l\)\}can be different in general, we makeH′=HH^\{\\prime\}=HandW′=WW^\{\\prime\}=Wby introducingpaddingaround the element patch, which basically enlarges the element patch by certain layers of zeros \(1 layer in this work due to choice ofKHK\_\{H\}andKWK\_\{W\}\)\.

𝐙\(l\)\\mathbf\{Z\}^\{\(l\)\}usually goes through theBatch Normalization\(BN\) layer, which normalizes𝐙\(l\)\\mathbf\{Z\}^\{\(l\)\}to have zero mean and unit variance, and thus puts the parameters on a similar scale to enhance the convergence of training\. It is defined as

𝐅\(l\)=BN\(𝐙\(l\)\),\\mathbf\{F\}^\{\(l\)\}=\\text\{BN\}\\left\(\\mathbf\{Z\}^\{\(l\)\}\\right\),\(7\)
where the network BN contains learnable weights accounting for scaling and shifting\.𝐅\(l\)\\mathbf\{F\}^\{\(l\)\}is followed by the ReLU activation and then fed into the next convolutional layer\.

This unit of Conv, BN and ReLU repeats several times \(2 in this work\) in a single residual block\. Moreover, a residual connection is used right before the last activation to deal with the vanishing gradients in deep networks\. When the input and output dimensions of a residual block differ \(e\.g\.,Cout≠CinC\_\{out\}\\neq C\_\{in\}\), an extra layer of Conv and BN will be added to the residual connection to maintain compatible dimensions\. Specifically, it introduces a kernel𝐖b∈RCout×Cin×1×1\\mathbf\{W\}\_\{b\}\\in\\mathrm\{R\}^\{C\_\{out\}\\times C\_\{in\}\\times 1\\times 1\}to act on𝐗\(l−1\)\\mathbf\{X\}^\{\(l\-1\)\}\. CFE in this work consists of 6 residual blocks described above, whose output dimensions are 64, 128, 256, 256, 128, and 64, respectively\. After residual blocks, a convolutional layer is used to aggregate all the extracted features into a single latent vector\.

Subsequently, the latent vector is fed into FCCH to predict the element density\. FCCH simply consists of several fully connected dense layers\. In this work, it has 3 dense layers\. Each of the first 2 layers has 100 neurons with a ReLU activation, whereas the number of neurons in the last layer corresponds to the output dimension\. Note that activation functions are not needed in the last layer\.

Following DLTOP, eCNNTO also treats density prediction as a classification task rather than a regression problem; see Equations[4](https://arxiv.org/html/2606.19921#S2.E4)and[5](https://arxiv.org/html/2606.19921#S2.E5)\. Correspondingly, the cross\-entropy loss is used,

L=−∑i=1Kρiln⁡ρ^i,L=\-\\sum\_\{i=1\}^\{K\}\\rho\_\{i\}\\ln\{\\hat\{\\rho\}\_\{i\}\},\(8\)
whereKK\(K=3K=3or1212\) is the total number of classes, andρ^i\\hat\{\\rho\}\_\{i\}denotes the predicted probability of the target element that falls into the classρi\\rho\_\{i\}\. The specific choice ofKKand its influence will be discussed in[Section 4](https://arxiv.org/html/2606.19921#S4)\. All the neural nets are trained using Adam with a learning rate of10−310^\{\-3\}and a weight decay of10−410^\{\-4\}\. Each model is trained with 100 epochs\.

###### Remark 1

Compared with DLTOP, eCNNTO replaces the Deep Relief Networks \(DBNs\) with a ResNet\-based CNN to preserve spatial information and thus enhance structural connectivity\. Alternative network architectures, such as Transformers and Vision Transformers, were also investigated\. However, Transformer architectures lack an explicit inductive bias for local spatial correlations, whereas Vision Transformers require much larger datasets than CNN to fully exploit the modeling capacity\(Dosovitskiyet al\.,[2021](https://arxiv.org/html/2606.19921#bib.bib9)\)\. As a result, these architectures show an inferior performance compared with the proposed method\.

### 3\.2Dataset Construction

![Refer to caption](https://arxiv.org/html/2606.19921v1/x4.png)\(a\)Cantilever beam
![Refer to caption](https://arxiv.org/html/2606.19921v1/x5.png)\(b\)Simply supported beam

Figure 4:Problem setups for dataset construction in 2D, whereLx=2L\_\{x\}=2,Ly=1L\_\{y\}=1, andVf=0\.5V\_\{f\}=0\.5is the volume fraction\. \(a\) A cantilever beam with a concentrated forceF=1F=1applied at the midpoint of the right boundary, and \(b\) a simply supported beam with a uniformly distributed forceq=1q=1applied on the top\.![Refer to caption](https://arxiv.org/html/2606.19921v1/x6.png)\(a\)
![Refer to caption](https://arxiv.org/html/2606.19921v1/x7.png)\(b\)

Figure 5:Problem setups for dataset construction in 3D, whereLx=2L\_\{x\}=2,Ly=1L\_\{y\}=1,Lz=1L\_\{z\}=1,q=1q=1, andVf=0\.4V\_\{f\}=0\.4is the volume fraction\. \(a\) A cantilever beam with a distributed force applied at a right area of the lower boundary, and \(b\) a cantilever beam with a distributed force applied at a center area of the right boundary\.Datasets are prepared by performing SIMP only on a couple of benchmark problems in 2D and 3D; see Figures[4](https://arxiv.org/html/2606.19921#S3.F4)and[5](https://arxiv.org/html/2606.19921#S3.F5), where the problem setups are intended to be limited and simple in terms of geometries, meshes, and loading cases because later we would like to show the generalization capabilities of the proposed method\.[Figure 4](https://arxiv.org/html/2606.19921#S3.F4)shows the 2D case, which uses a cantilever beam and a simply supported beam\. Two mesh resolutions are considered for a2×12\\times 1design domain:110×60110\\times 60and200×100200\\times 100\. The 3D problem setups are shown in[Figure 5](https://arxiv.org/html/2606.19921#S3.F5)\. A2×1×12\\times 1\\times 1cantilever beam is taken as the design domain with a single mesh resolution of50×100×5050\\times 100\\times 50\. Two loading cases are considered\.

Since the dataset is constructed at the element level, running a single problem can already yield a large number of data samples, so data generation is highly efficient\. Indeed, through the above problems, we can obtain 53,200 samples in 2D and 500,000 in 3D, which are sufficient for training\.

SIMP iterations are performed using open\-source packages, includingAndreassenet al\.\([2011](https://arxiv.org/html/2606.19921#bib.bib3)\)for 2D problems andWanget al\.\([2025](https://arxiv.org/html/2606.19921#bib.bib30)\)for 3D\. Note that we adopt the convergence criterionϵ=1×10−3\\epsilon=1\\times 10^\{\-3\}for 2D andϵ=5×10−3\\epsilon=5\\times 10^\{\-3\}for 3D, shown in[Equation 3](https://arxiv.org/html/2606.19921#S2.E3)\. Parameters specific to SIMP are given in[Table 1](https://arxiv.org/html/2606.19921#S3.T1)\.

Table 1:Parameters setting of SIMP\.###### Remark 2

Recall that the input is given in terms of an element patch\. However, a boundary element does not have neighbors beyond the boundary\. In this case, the element patch is padded with values of \-1 to indicate that the corresponding neighbors do not exist\.

### 3\.3Feature selection for training

![Refer to caption](https://arxiv.org/html/2606.19921v1/x8.png)Figure 6:Three options to select features for training: Early Stage \(green\), Middle Stage \(blue\), and Final Stage \(red\)\. A feature is a density sequence of consecutive SIMP iterations\. Regardless of the feature choice, the label is always the final element density \(purple\)\.A data sample contains the entire density evolution of a given element patch obtained from SIMP iterations\. For the training purpose, certain consecutiveNNiterations are selected as the input feature, whereas the element density at the final iteration serves as the label\. Depending on where theseNNiterations come from, we investigate three options: the firstNNiterations \(Early Stage\), theNNiterations in the middle of the evolution history \(Middle Stage\), and the lastNNiterations except the final iteration \(Final Stage\); see[Figure 6](https://arxiv.org/html/2606.19921#S3.F6)\.

As will be shown, different choice of the feature leads to greatly varying performance in both training and testing\. Note that DLTOP only considered the Early Stage strategy\. Indeed, both the Middle and Final Stage strategies appear counter\-intuitive\. However, the Final Stage proves to over\-perform the Early Stage in every aspect, such as the speedup for TO, structural connectivity, and the required data size\.

To identify the most effective strategy, three neural nets with the identical network architecture and hyperparameters are trained on the same dataset \(i\.e\., the 2D case introduced in[Section 3\.2](https://arxiv.org/html/2606.19921#S3.SS2)\)\. The only difference is the choice of the feature\. Once trained, the neural nets are tested on three benchmark problems \(see[A](https://arxiv.org/html/2606.19921#A1)for details\) in terms of the speedup and structural connectivity\.[Table 2](https://arxiv.org/html/2606.19921#S3.T2)lists the total number of iterations required to achieve an optimized structure\. The results of “SIMP” serve as a reference, whereas the results of “Early”, “Middle”, and “Final” are obtained by applying the online prediction procedure of eCNNTO; see[Figure 2](https://arxiv.org/html/2606.19921#S2.F2)\. “NA” means that the predicted structure is invalid due to the presence of disconnected components\.

Table 2:Total number of iterations required to achieve an optimized structure under different training strategies\.We observe that the model trained with the Final Stage features consistently achieve the best acceleration performance and does not produce disconnected structures in any of the tested examples\. In contrast, models trained with the other two options exhibit an inferior acceleration performance and may generate disconnected structures\. Therefore, we will adopt the Final Stage features for training in this work\.

It is worth mentioning that, once the model is trained and when it is used to accelerate TO, the input to the model is still given by the firstNNSIMP iterations\. Although it introduces a mismatch between the training and the inference of a model, the observed performance improvement can be explained from the characteristic of data\. The data from Early Stage contains a large amount of noise due to the drastic evolution of the initial structure\. The model trained on such data has difficulty in learning the density evolution pattern\. In contrast, density variations in the Final Stage contain subtle yet critical information about the convergence behavior towards the optimized value, which greatly affects the acceleration performance\. Accurately capturing this convergence behavior can help reduce the number of SIMP iterations required to fine\-tune the near\-optimal structure predicted by eCNNTO\. On the other hand, the Middle Stage data only contains intermediate density variations that may be very similar in different problems and thus fail to distinguish them as independent features\. As a result, its prediction may be poor given a new scenario, which also explains why the model trained on this strategy may be even worse than the case not using any acceleration\.

###### Remark 3

DLTOP uses the Early Stage data as the feature for training, whereas eCNNTO uses data from the Final Stage\. As the Early Stage data is much noisier than the Final Stage, DLTOP demands a larger dataset for training than eCNNTO, which will be shown in the next section\.

## 4Numerical examples

In this section, we present a variety of 2D and 3D test examples to demonstrate the efficiency and generalization capabilities of eCNNTO\. The models have been trained according to the method introduced in the previous section\. Here, they are directly applied to accelerate TO without retraining\. We first investigate the effect of the classification criterion on both the acceleration performance and generalization of eCNNTO\. Next, we compare eCNNTO with DLTOP, in particular on the structural connectivity and training data size\. Last but not least, we evaluate the generalization capabilities of eCNNTO under various 2D and 3D settings, such as different boundary conditions, loading cases, design domain geometries, mesh resolutions, and non\-design domains\. 2D models are trained on an Intel®Core™i7\-12700 CPU @ 2\.10 GHz with 64\.0 GB DDR5 RAM and NVIDIA RTX A4000, whereas 3D models are trained on an Intel®Xeon®Gold 6248R CPU @ 3\.00GHz with 32\.0 GB DDR4 RAM and NVIDIA RTX A6000\.

### 4\.1Classification criterion

This section evaluates the impact of the classification granularity \(i\.e\., the number of density classes\) on the acceleration performance of eCNNTO\. We compare two cases, where density values are classified into 3 classes or 12 classes;[Equation 4](https://arxiv.org/html/2606.19921#S2.E4)and[5](https://arxiv.org/html/2606.19921#S2.E5)\. Four examples are evaluated on a2×12\\times 1design domain with a110×60110\\times 60mesh\. Examples 1 and 2 are in\-distribution \(ID\) samples drawn directly from the training dataset \(from[Figure 4](https://arxiv.org/html/2606.19921#S3.F4)\), where the model is trained with the Final Stage sequence and here it is tested on the Early Stage sequence as the input\. Examples 3 and 4 are out\-of\-distribution \(OOD\) problems featuring boundary conditions that are not covered during training; see[Figure 7](https://arxiv.org/html/2606.19921#S4.F7)\. Both of them adopt the sensitivity filter with a radius of 3\.

The results are summarized in[Table 3](https://arxiv.org/html/2606.19921#S4.T3)\. The number of iterations in eCNNTO \(also DLTOP\) is the sum of two parts: \(1\) the number of SIMP iterations used to generate the input, and \(2\) the number of SIMP iterations required to fine\-tune the predicted structure by eCNNTO until convergence\. We observe that both classification schemes yield substantial acceleration compared to the baseline SIMP across all the examples\. Meanwhile, the 3\-class scheme outperforms the 12\-class one except for Example 1\.

Table 3:Comparison of two classification schemes in terms of the acceleration performance\.- 1\.Note: Acceleration = \(\# SIMP \- \# eCNNTO\) / \# SIMP×100%\\times 100\\%

![Refer to caption](https://arxiv.org/html/2606.19921v1/x9.png)\(a\)Cantilever beam
![Refer to caption](https://arxiv.org/html/2606.19921v1/x10.png)\(b\)Simply supported beam

Figure 7:Two out\-of\-distribution examples, whereLx=2L\_\{x\}=2,Ly=1L\_\{y\}=1, andVf=0\.5V\_\{f\}=0\.5is the volume fraction\. \(a\) A cantilever beam subjected to a concentrated forceF=1F=1at its lower\-right corner, and \(b\) a simply supported beam under a uniformly distributed forceq=1q=1\.Intuitively, a finer classification granularity \(e\.g\., 12 classes\) narrows the density intervals, allowing the network predictions to more closely approximate the ground\-truth labels and thus to accelerate convergence\. However, a larger number of classes inherently increases the complexity of the classification task and makes it prone to falling into adjacent classes rather the target, leading to increased prediction errors on unseen data\. This is the reason for the degraded performance of the 12\-class scheme on OOD samples\. On the other hand, the 3\-class scheme shows a better trade\-off between robustness and generalization capabilities across different problems, and is thus adopted for all the subsequent examples\.

### 4\.2Comparison with DLTOP

This section demonstrates the advantages of eCNNTO in ensuring the structural connectivity and validity\. We compare its performance with DLTOP on two representative examples in 2D\. The first example is shown in[Figure 8](https://arxiv.org/html/2606.19921#S4.F8)\(a\), where a50×2050\\times 20mesh is used for a50×2050\\times 20domain\. We set the volume fraction and the filter radius as 0\.2 and 1\.5, respectively, which follows a typical setting when using SIMP for this example\. The number of input iterations is 5 for both DLTOP and eCNNTO as it is sufficient to capture the early drastic change in the density evolution\. The predicted structures of DLTOP and eCNNTO are shown in[Figure 8](https://arxiv.org/html/2606.19921#S4.F8)\(b, c\), respectively\. The predicted structure of DLTOP has isolated pieces in the bottom\-left and bottom\-right regions \(highlighted in red boxes\)\. It also has corner\-contact parts in the top region \(highlighted in the red circle\)\. This is due to the fact that DLTOP relies solely on individual elements for prediction, making its outputs vulnerable to early evolutionary fluctuations\. In contrast, the spatial correlation of the neighboring structural evolution is built in eCNNTO through CNN\. As a result, even if the density evolution of an element fluctuates drastically, the variation as a group of neighboring elements can be smoothed out and thus yield much more accurate predictions\.

![Refer to caption](https://arxiv.org/html/2606.19921v1/x11.png)\(a\)Problem setting
![Refer to caption](https://arxiv.org/html/2606.19921v1/x12.png)\(b\)DLTOP
![Refer to caption](https://arxiv.org/html/2606.19921v1/x13.png)\(c\)eCNNTO

Figure 8:Comparison of predicted structures between DLTOP and eCNNTO, where the optimized result should have a two\-bar structure\. \(a\) A beam fixed at the bottom with a concentrated forceF=1F=1loaded horizontally on the top, whereLx=50L\_\{x\}=50,Ly=20L\_\{y\}=20andVf=0\.2V\_\{f\}=0\.2is the volume fraction, \(b\) predicted structure of DLTOP that has both isolated pieces \(bottom\-left and bottom\-right\) and corner\-contact parts \(top\), and \(c\) predicted structure of eCNNTO, where the desired structural connectivity is well preserved\.![Refer to caption](https://arxiv.org/html/2606.19921v1/x14.png)\(a\)Problem setting
![Refer to caption](https://arxiv.org/html/2606.19921v1/x15.png)\(b\)DLTOP
![Refer to caption](https://arxiv.org/html/2606.19921v1/x16.png)\(c\)eCNNTO

Figure 9:Comparison of predicted structures between DLTOP and eCNNTO\. \(a\) A beam simply supported at four corners with two concentrated forcesF=1F=1loaded vertically on the top and bottom, whereLx=3L\_\{x\}=3,Ly=1L\_\{y\}=1, andVf=0\.4V\_\{f\}=0\.4is the volume fraction, \(b\) predicted structure of DLTOP that has isolated pieces \(top and bottom red circles\) and intermediate\-density defects \(middle red box\), and \(c\) predicted structure of eCNNTO, where the desired structure is obtained without any defects\.Next, we study a more complex example shown in[Figure 9](https://arxiv.org/html/2606.19921#S4.F9)\(a\), where we adopt a150×50150\\times 50mesh for a3×13\\times 1domain, and set the volume fraction as 0\.4\. The sensitivity and density filters are used in SIMP with a radius of 2\. The number of input iterations of both eCNNTO and DLTOP is 36\. Similar results to the first example are observed in[Figure 9](https://arxiv.org/html/2606.19921#S4.F9)\(b, c\), where isolated pieces \(highlighted in the red circles\) appear in the predicted structure of DLTOP but not in eCNNTO\. Meanwhile, intermediate\-density defects \(gray shades in the red box\) appear around the vertical component, where void is expected\. This is because elements located at structural boundaries undergo drastic variations in densities, and the absence of spatial correlation leads to fluctuations in predicted densities near boundaries\. In contrast, eCNNTO extracts density variation patterns of neighboring elements via CNN, and thus smooths out sharp fluctuations near structural boundaries\. Therefore, it produces more accurate predictions for elements around boundaries\.

Last but not least, eCNNTO achieves superior performance with much fewer training samples compared to DLTOP\. It only requires 11\.1% of training samples needed in DLTOP: 53,200 \(eCNNTO\) versus 480,000 \(DLTOP\)\. Due to the lack of spatial correlations among elements, DLTOP demands higher training costs yet fails to effectively resolve isolated pieces and intermediate\-density defects\. Moreover, further enlarging the DLTOP dataset would not overcome this issue\. In contrast, by adjusting the network architecture and carefully choosing features for training, eCNNTO generates structures with better connectivity and much fewer defects while significantly reducing the required data size\. Moreover, these adjustments can further enhance the speedup of eCNNTO, which will be discussed in detail in[Section 4\.3](https://arxiv.org/html/2606.19921#S4.SS3)\.

### 4\.3eCNNTO in 2D

![Refer to caption](https://arxiv.org/html/2606.19921v1/x17.png)\(a\)Long beam
![Refer to caption](https://arxiv.org/html/2606.19921v1/x18.png)\(b\)Square
![Refer to caption](https://arxiv.org/html/2606.19921v1/x19.png)\(c\)Column
![Refer to caption](https://arxiv.org/html/2606.19921v1/x20.png)\(d\)L\-shaped
![Refer to caption](https://arxiv.org/html/2606.19921v1/x21.png)\(e\)Uniformly distributed loading \(UDL\)

Figure 10:Problem settings of 2D test problems, whereVf=0\.4V\_\{f\}=0\.4is the volume fraction\. \(a\) A long beam simply supported at four corners withLx=8L\_\{x\}=8,Ly=1\.5L\_\{y\}=1\.5, andF=1F=1, \(b\) a square domain simply supported at the left two corners withLx=4L\_\{x\}=4,Ly=4L\_\{y\}=4, andF=1F=1, \(c\) a vertical column partially fixed at the bottom withLx=3L\_\{x\}=3,Ly=5L\_\{y\}=5, andF=1F=1, \(d\) an L\-shaped bracket clamped on the top withLx=2L\_\{x\}=2,Ly=2L\_\{y\}=2, andF=1F=1, and \(e\) a beam simply supported at five places on the bottom and applied with two uniformly distributed forcesq=1q=1, withLx=4L\_\{x\}=4andLy=1L\_\{y\}=1\.This section evaluates the efficiency and the generalizability of eCNNTO using 2D problems\. The model has been trained according to the problem settings in[Figure 4](https://arxiv.org/html/2606.19921#S3.F4)\. Now it is tested directly on the five problems shown in[Figure 10](https://arxiv.org/html/2606.19921#S4.F10)without any kind of retraining\. Comparing[Figure 4](https://arxiv.org/html/2606.19921#S3.F4)and[10](https://arxiv.org/html/2606.19921#S4.F10), we observe the significant difference between the training data and the test cases in terms of boundary conditions, loading cases, and design domain geometries\. In fact, different mesh resolutions will also be tested; see[Table 4](https://arxiv.org/html/2606.19921#S4.T4)\. These tests are mainly intended to show the generalization capabilities of eCNNTO\. Regarding TO parameters, the volume fraction of all these examples are0\.40\.4\. The radius of sensitivity filter is 6 for all but the L\-shaped beam, which takes 2\. For eCNNTO, the window size of an element patch isW=5W=5and the number of input iterations isN=48N=48\.

Table 4:Mesh resolutions of 2D test problems\.- 1\.Note:NxN\_\{x\}andNyN\_\{y\}are the number of elements in the x and y directions, respectively\. In the case of the L\-shaped domain, the mesh is meant for its bounding box\.

Table 5:Acceleration performance of eCNNTO in 2D and resulting compliance\.Table 6:Runtime comparison in 2D test problems between SIMP and eCNNTO\.![Refer to caption](https://arxiv.org/html/2606.19921v1/x22.png)\(a\)SIMP
![Refer to caption](https://arxiv.org/html/2606.19921v1/x23.png)\(b\)eCNNTO

![Refer to caption](https://arxiv.org/html/2606.19921v1/x24.png)\(c\)SIMP
![Refer to caption](https://arxiv.org/html/2606.19921v1/x25.png)\(d\)eCNNTO

![Refer to caption](https://arxiv.org/html/2606.19921v1/x26.png)\(e\)SIMP
![Refer to caption](https://arxiv.org/html/2606.19921v1/x27.png)\(f\)eCNNTO

![Refer to caption](https://arxiv.org/html/2606.19921v1/x28.png)\(g\)SIMP
![Refer to caption](https://arxiv.org/html/2606.19921v1/x29.png)\(h\)eCNNTO

![Refer to caption](https://arxiv.org/html/2606.19921v1/x30.png)\(i\)SIMP
![Refer to caption](https://arxiv.org/html/2606.19921v1/x31.png)\(j\)eCNNTO

Figure 11:Optimized structures in 2D using SIMP and eCNNTO: \(a, b\) Long beam, \(c, d\) Square, \(e, f\) Column, \(g, h\) L\-shaped, and \(i, j\) UDL\. In each pair, the former is obtained by SIMP and the latter by eCNNTO\.The acceleration performance and the corresponding structural compliance \(i\.e\., the optimization objective\) are summarized in[Table 5](https://arxiv.org/html/2606.19921#S4.T5), where the SIMP results are taken as a baseline\. We first observe that eCNNTO achieves a substantial reduction in the number of SIMP iterations across all the five examples, with the acceleration ratio ranging from 75\.8% to 90\.5%\. This superior performance demonstrates the strong generalization capability of eCNNTO: without retraining, eCNNTO yields significant speedup under unseen boundary conditions, loading cases, domain geometries, and mesh resolutions\. Moreover, eCNNTO produces compliance comparable to, and in most examples lower than, those obtained by SIMP, demonstrating that it can significantly accelerate convergence without compromising the performance\. This is because the predicted structure by eCNNTO is fine\-tuned by SIMP to meet the same convergence criterion, which yields the compliance comparable to that of SIMP\.

The optimized structures are shown in[Figure 11](https://arxiv.org/html/2606.19921#S4.F11), where the structures of eCNNTO exhibit clearer boundaries and fewer intermediate\-density regions\. While the optimized structures of SIMP and eCNNTO show discrepancies in certain structural details, the compliance values of eCNNTO are usually smaller\. This is due to the fact that topology optimization is non\-convex\. The predicted structure by eCNNTO generally does not correspond to a certain iteration of the original SIMP method, thereby leading to slightly different but close local minima\. We find that increasing the number of input iterations can effectively mitigate this difference\.

Regarding the runtime, it almost scales linearly with respect to the number of SIMP iterations, so the reduction of SIMP iterations by eCNNTO leads to significant speedup in the actual computational time; see[Table 6](https://arxiv.org/html/2606.19921#S4.T6)\. We observe that the acceleration ratio in runtime ranges from 76\.0% to 91\.7%, which shows an order\-of\-magnitude speedup\.

We further compare eCNNTO with DLTOP in terms of the acceleration performance\. The acceleration ratios \(computed against SIMP\) of both methods are reported in[Table 7](https://arxiv.org/html/2606.19921#S4.T7)\. We observe that eCNNTO consistently outperforms DLTOP in all examples\. Note that the results of DLTOP come fromKallioraset al\.\([2020](https://arxiv.org/html/2606.19921#bib.bib14)\)\.

Table 7:2D optimization comparison between DLTOP and eCNNTO\.
### 4\.4eCNNTO in 3D

This section evaluates eCNNTO on 3D test problems to demonstrate its significant speedup\. The model has been trained on the problem setttings in[Figure 5](https://arxiv.org/html/2606.19921#S3.F5), whereas four problems shown in[Figure 12](https://arxiv.org/html/2606.19921#S4.F12)are tested without retraining\. Similar to the 2D scenario, these test problems have different boundary conditions and loading cases from the training examples, as can be seen by comparing Figures[5](https://arxiv.org/html/2606.19921#S3.F5)and[12](https://arxiv.org/html/2606.19921#S4.F12)\. Moreover, the mesh resolutions of the test problems are significantly larger than those of the training problems, as shown in[Table 8](https://arxiv.org/html/2606.19921#S4.T8)\. Recall that a50×100×5050\\times 100\\times 50mesh is used for training\. For all 3D examples, a window size ofW=3W=3, the number of iterationsN=24N=24, and a PDE filter\(Kawamotoet al\.,[2011](https://arxiv.org/html/2606.19921#bib.bib15)\)with radius 3 are adopted\. The other hyperparameters and training settings are kept consistent with those in the 2D experiments\.

![Refer to caption](https://arxiv.org/html/2606.19921v1/x32.png)\(a\)Cantilever 1
![Refer to caption](https://arxiv.org/html/2606.19921v1/x33.png)\(b\)Cantilever 2
![Refer to caption](https://arxiv.org/html/2606.19921v1/x34.png)\(c\)Long beam
![Refer to caption](https://arxiv.org/html/2606.19921v1/x35.png)\(d\)Short beam

Figure 12:Problem settings of 3D test problems, whereF=1F=1is the concentrated force\. \(a\) A cantilever with a concentrated force at the bottom of the right boundary withLx=200L\_\{x\}=200,Ly=100L\_\{y\}=100, andLz=100L\_\{z\}=100, \(b\) a cantilever with a concentrated force at the center of the right boundary withLx=200L\_\{x\}=200,Ly=100L\_\{y\}=100, andLz=100L\_\{z\}=100, \(c\) a simply supported long beam with a concentrated force at the center of the top surface withLx=300L\_\{x\}=300,Ly=100L\_\{y\}=100, andLz=100L\_\{z\}=100, and \(d\) a simply supported short beam with a concentrated force at the center of the top surface withLx=200L\_\{x\}=200,Ly=200L\_\{y\}=200, andLz=200L\_\{z\}=200\.Table 8:Mesh resolutions of 3D test problems\.- 1\.Note:NxN\_\{x\},NyN\_\{y\}andNzN\_\{z\}are the number of elements in the x, y, and z directions, respectively\.

Table 9:Acceleration performance of eCNNTO in 3D and resulting complianceTable 10:Runtime comparison in 3D test problems between SIMP and eCNNTO\.![Refer to caption](https://arxiv.org/html/2606.19921v1/x36.png)\(a\)SIMP
![Refer to caption](https://arxiv.org/html/2606.19921v1/x37.png)\(b\)eCNNTO
![Refer to caption](https://arxiv.org/html/2606.19921v1/x38.png)\(c\)SIMP
![Refer to caption](https://arxiv.org/html/2606.19921v1/x39.png)\(d\)eCNNTO
![Refer to caption](https://arxiv.org/html/2606.19921v1/x40.png)\(e\)SIMP
![Refer to caption](https://arxiv.org/html/2606.19921v1/x41.png)\(f\)eCNNTO
![Refer to caption](https://arxiv.org/html/2606.19921v1/x42.png)\(g\)SIMP
![Refer to caption](https://arxiv.org/html/2606.19921v1/x43.png)\(h\)eCNNTO

Figure 13:Optimized structures in 3D using SIMP and eCNNTO: \(a, b\) Cantilever beam 1, \(c, d\) Cantilever beam 2, \(e, f\) Long beam, and \(g, h\) Short beam\. In each pair, the former is obtained by SIMP and the latter by eCNNTO\.The results of acceleration and compliance are summarized in[Table 9](https://arxiv.org/html/2606.19921#S4.T9)\. eCNNTO can achieve76\.0%∼96\.8%76\.0\\%\\sim 96\.8\\%acceleration on these examples\. As shown in[Figure 13](https://arxiv.org/html/2606.19921#S4.F13), very similar optimized structures are obtained using SIMP and eCNNTO, which can be further confirmed by the almost identical compliance values in[Table 9](https://arxiv.org/html/2606.19921#S4.T9)\. Therefore, strong generalization capabilities are observed also in 3D problems\. 3D topology optimization, owing to its substantial number of elements, is prone to exhibiting sluggish convergence; see the long beam and the short beam for examples\. In contrast, eCNNTO can learn the density evolution patterns and then bypass the sluggish region, thereby achieving a rapid convergence\.

The significant speedup can be better shown in terms of runtime; see[Table 10](https://arxiv.org/html/2606.19921#S4.T10)\. We observe that eCNNTO usually takes less than20%20\\%of the runtime SIMP\.

### 4\.5Effect of the window size

In this section, we investigate the impact the window size \(i\.e\.,WWin[Figure 3](https://arxiv.org/html/2606.19921#S3.F3)\) on the acceleration and compliance\. We consider window sizes of 3, 5, and 7\. All models are trained on the same dataset \(see Figures[4](https://arxiv.org/html/2606.19921#S3.F4)and[5](https://arxiv.org/html/2606.19921#S3.F5)\), where the only difference is the window size\. Examples in both 2D \([Figure 10](https://arxiv.org/html/2606.19921#S4.F10)\) and 3D \([Figure 12](https://arxiv.org/html/2606.19921#S4.F12)\) are tested\.

Table 11:Results of 2D test problems with different window sizes\.Table 12:Training cost of 2D models with different window sizes\.The results of 2D examples are listed in[Table 11](https://arxiv.org/html/2606.19921#S4.T11)\. We observe that the window size affects both the acceleration performance and the compliance\. The model withW=5W=5requires the fewest iterations in most examples, whereasW=7W=7yields marginally lower compliance values\. However, this minor improvement comes at the cost of a substantial increase in SIMP iterations, which implies that a large window size \(W=7W=7\) may introduce redundant neighborhood information that hinders convergence\. Moreover, enlarging the window size fromW=5W=5toW=7W=7nearly doubles the memory requirement and increases the training time per epoch by over 40%; see[Table 12](https://arxiv.org/html/2606.19921#S4.T12)\. Since the slight performance gain ofW=7W=7does not justify this disproportionate cost in both computational overhead and convergence iterations, a window size ofW=5W=5is identified as the most balanced option for accelerating 2D TO problems\.

Table 13:Results of 3D test problems with different window sizes\.Table 14:Training cost of 3D models with different window sizes\.Moreover, the results of 3D examples are summarized in[Table 13](https://arxiv.org/html/2606.19921#S4.T13)\. We observe that increasing the window size from 3 decreases both the number of iterations and the compliance in most examples\. However, while a larger window size \(W=7W=7\) yields marginally lower objective values in all examples, it does not consistently minimize the number of SIMP iterations\. Due to the high dimensionality, enlarging the window fromW=3W=3toW=7W=7increases the memory requirement by more than 13 times \(from approximately 1\.2 GB to 15\.7 GB\) and increases the training time per epoch by more than 9 times; see[Table 14](https://arxiv.org/html/2606.19921#S4.T14)\. Since the marginal gains offered by a larger window cannot justify its prohibitive cost in compute,W=3W=3is identified as the best option for 3D topology optimization\.

### 4\.6Non\-design domains

Last but not least, to further demonstrate the generalization capabilities, we apply eCNNTO to TO problems with non\-design domains, which are often encountered when certain important structural features are desired to be kept during optimization\. Elements in non\-design domains are referred to as passive elements, whose densities are prescribed as either void or solid throughout the entire optimization process\(Andreassenet al\.,[2011](https://arxiv.org/html/2606.19921#bib.bib3)\)\. We evaluate the performance of eCNNTO on two 2D test examples; see[Figure 14](https://arxiv.org/html/2606.19921#S4.F14)\(a, d\)\. Again, the model has been trained and it is directly used here\. The first example is a cantilever beam, where a circular hole inside the structure is a desired feature, so it is a non\-design domain\. The second example is a bridge with a deck as the non\-design domain to ensure sufficient load\-bearing capacity\. Both examples use a200×200200\\times 200mesh\. Their volume fractions are 0\.5 and 0\.4, respectively\.

![Refer to caption](https://arxiv.org/html/2606.19921v1/x44.png)\(a\)Problem setting
![Refer to caption](https://arxiv.org/html/2606.19921v1/x45.png)\(b\)SIMP
![Refer to caption](https://arxiv.org/html/2606.19921v1/x46.png)\(c\)eCNNTO
![Refer to caption](https://arxiv.org/html/2606.19921v1/x47.png)\(d\)Problem setting
![Refer to caption](https://arxiv.org/html/2606.19921v1/x48.png)\(e\)SIMP
![Refer to caption](https://arxiv.org/html/2606.19921v1/x49.png)\(f\)eCNNTO

Figure 14:Topology optimization with non\-design domains\. \(a\) A cantilever beam with a hole as the non\-design domain, whereLx=2L\_\{x\}=2,Ly=1L\_\{y\}=1,r=0\.4r=0\.4,l=0\.7l=0\.7, andF=1F=1, \(b, c\) optimized structures of the cantilever beam by SIMP and eCNNTO, \(d\) a bridge with a deck on the top as the non\-design domain, whereLx=2L\_\{x\}=2,Ly=1L\_\{y\}=1,t=0\.1t=0\.1, andq=1q=1, and \(e, f\) optimized structures of the bridge by SIMP and eCNNTO\.As illustrated in[Figure 14](https://arxiv.org/html/2606.19921#S4.F14), the optimized structures of eCNNTO strictly adhere to non\-design constraints and exhibit similar features to the those obtained by SIMP\. Furthermore, eCNNTO achieves substantial speedups of 79\.7% \(cantilever\) and 89\.0% \(bridge\) with even smaller compliance values, as shown in[Table 15](https://arxiv.org/html/2606.19921#S4.T15)\. Due to the hard constraints imposed on the element densities in non\-design domains, the density transition across interfaces between non\-design and design domains tends to be discontinuous, which causes SIMP to demand more iterations to enhance continuity and reach convergence\. In contrast, when predicting densities at these interfaces, the spatial correlation of eCNNTO already accounts for the density constraints by non\-design domains and thus produces smooth transitions\. For this reason, it can significantly reduce the number of subsequent SIMP iterations without compromising the optimized compliance value\.

Table 15:Optimization results of non\-design domain examples\.

## 5Conclusion

In this work, we propose eCNNTO, an element\-based Convolutional Neural Network \(CNN\) designed to significantly accelerate density\-based topology optimization by learning the local density evolution at the element level\. By integrating CNN and residual connections, the network explicitly captures the spatial correlations among neighboring elements, thereby fundamentally enhancing structural connectivity\. Data generation is highly efficient in that it only requires to run a few simple benchmark problems\. Furthermore, we introduce a novel training strategy by selecting features at the final stage for training, which substantially reduces the training data size\. By accounting for spatial dependencies, eCNNTO effectively suppresses isolated structural pieces and intermediate defects, consistently yielding physically valid structures in various cases\.

Once the model is trained on simple settings, it can be directly applied to a variety of unseen boundary conditions, loading cases, design domain geometries, mesh resolutions, and non\-design domains, consistently demonstrating its strong generalization capabilities and significant speedups\. Compared to SIMP, eCNNTO reduces up to 90% of iterations in 2D and 97% of iterations in 3D, while maintaining comparable, or in most cases, better compliance values\. Compared to DLTOP, eCNNTO requires a significantly smaller data size for training, exhibits much better structural connectivity, and also shows faster speedups\.

In the future, several promising directions are worth explorations\. For example, applying the proposed model to multi\-physics topology optimization may help tackle the curse of dimensionality in such problems\. Adoption of graph neural networks \(GNNs\) may be useful to facilitate the application of eCNNTO to unstructured mesh scenarios\. Finally, incorporating volume constraints in the process of learning may be worth investigating as it can further reduce the required SIMP iterations after the eCNNTO prediction\.

## Acknowledgments

S\. Lu and X\. Wei are partially supported by National Natural Science Foundation of China \(No\. 12571408 and No\. 12494550/12494555\)\.

## Appendix AImpact of training strategies

In order not to distract readers from the test problems in the main text, we postpone the study of the impact of different training strategies here\. We compare the three different training strategies with the data prepared in[Section 3\.2](https://arxiv.org/html/2606.19921#S3.SS2); see[Figure 4](https://arxiv.org/html/2606.19921#S3.F4)\. That is, we train three models with features from the Early Stage, the Middle Stage, and the Final Stage, respectively\. Once trained, they are tested on the same set of test problems in 2D; see[Figure 15](https://arxiv.org/html/2606.19921#A1.F15)\. These test problems use the same settings: the sensitivity filter with radius 3, a mesh of80×8080\\times 80, and the number of input sizeN=24N=24\.

![Refer to caption](https://arxiv.org/html/2606.19921v1/x50.png)\(a\)
![Refer to caption](https://arxiv.org/html/2606.19921v1/x51.png)\(b\)
![Refer to caption](https://arxiv.org/html/2606.19921v1/x52.png)\(c\)

Figure 15:Three test examples used to study the impact of different training strategies\. \(a\) A cantilever beam subjected to a concentrated forceF=1F=1at its lower\-right corner, whereLx=2L\_\{x\}=2andLy=1L\_\{y\}=1, \(b\) a beam clamped on both ends with a uniformly distributed forceq=1q=1on the top, whereLx=2L\_\{x\}=2andLy=1L\_\{y\}=1, and \(c\) a simply supported beam subjected to a concentrated forceF=1F=1on the top, whereLx=2L\_\{x\}=2andLy=13L\_\{y\}=\\frac\{1\}\{3\}\.![Refer to caption](https://arxiv.org/html/2606.19921v1/x53.png)\(a\)Early Stage
![Refer to caption](https://arxiv.org/html/2606.19921v1/x54.png)\(b\)Middle Stage
![Refer to caption](https://arxiv.org/html/2606.19921v1/x55.png)\(c\)Final Stage
![Refer to caption](https://arxiv.org/html/2606.19921v1/x56.png)\(d\)Early Stage
![Refer to caption](https://arxiv.org/html/2606.19921v1/x57.png)\(e\)Middle Stage
![Refer to caption](https://arxiv.org/html/2606.19921v1/x58.png)\(f\)Final Stage
![Refer to caption](https://arxiv.org/html/2606.19921v1/x59.png)\(g\)Early Stage
![Refer to caption](https://arxiv.org/html/2606.19921v1/x60.png)\(h\)Middle Stage
![Refer to caption](https://arxiv.org/html/2606.19921v1/x61.png)\(i\)Final Stage

Figure 16:Comparison of predicted structures using models trained with different features\. Each row corresponds to a test problem in[Figure 15](https://arxiv.org/html/2606.19921#A1.F15), whereas each column corresponds to a particular training strategy\.The predicted structures by these models are shown in[Figure 16](https://arxiv.org/html/2606.19921#A1.F16)\. Focusing on the predictions of the Early\-Stage model, we observe that jagged boundaries appear quite often\. This arises from the noise in the early density evolution, which compromises learning outcomes and prevents smooth structural transitions\. The predictions of the Middle\-Stage model, they exhibit more intermediate\-density regions around the boundaries\. This indicates that the model fails to make correct predictions for boundary elements, resulting in numerous intermediate\-density regions in the predicted structures and even longer iteration counts than SIMP\. In contrast, the model trained on the Final Stage features produces valid structures with clear boundaries for all the three examples while achieving significant speedup\.

## References

- N\. Aage, E\. Andreassen, B\. S\. Lazarov, and O\. Sigmund \(2017\)Giga\-voxel computational morphogenesis for structural design\.Nature550\(7674\),pp\. 84–86\.External Links:ISSN 1476\-4687,[Document](https://dx.doi.org/10.1038/nature23911)Cited by:[§1](https://arxiv.org/html/2606.19921#S1.p2.1)\.
- F\. Ahadi, M\. Azadi, M\. Biglari, and M\. Bodaghi \(2024\)Topology optimization of coronary artery stent considering structural and hemodynamic parameters\.Heliyon10\(20\)\.External Links:ISSN 2405\-8440,[Document](https://dx.doi.org/10.1016/j.heliyon.2024.e39452)Cited by:[§1](https://arxiv.org/html/2606.19921#S1.p1.1)\.
- E\. Andreassen, A\. Clausen, M\. Schevenels, B\. S\. Lazarov, and O\. Sigmund \(2011\)Efficient topology optimization in MATLAB using 88 lines of code\.Structural and Multidisciplinary Optimization43\(1\),pp\. 1–16\.External Links:ISSN 1615\-1488,[Document](https://dx.doi.org/10.1007/s00158-010-0594-7)Cited by:[§1](https://arxiv.org/html/2606.19921#S1.p2.1),[§2\.1](https://arxiv.org/html/2606.19921#S2.SS1.p11.1),[§3\.2](https://arxiv.org/html/2606.19921#S3.SS2.p3.2),[§4\.6](https://arxiv.org/html/2606.19921#S4.SS6.p1.1)\.
- S\. Banga, H\. Gehani, S\. Bhilare, S\. Patel, and L\. Kara \(2018\)3D topology optimization using convolutional neural networks\.arXiv\.External Links:1808\.07440,[Document](https://dx.doi.org/10.48550/arXiv.1808.07440)Cited by:[§1](https://arxiv.org/html/2606.19921#S1.p3.1)\.
- M\. M\. Behzadi and H\. T\. Ilieş \(2021\)GANTL: Toward practical and real\-time topology optimization with conditional generative adversarial networks and transfer learning\.Journal of Mechanical Design144\(021711\)\.External Links:ISSN 1050\-0472,[Document](https://dx.doi.org/10.1115/1.4052757)Cited by:[§1](https://arxiv.org/html/2606.19921#S1.p3.1)\.
- M\. P\. Bendsøe \(1989\)Optimal shape design as a material distribution problem\.Structural optimization1\(4\),pp\. 193–202\.External Links:ISSN 1615\-1488,[Document](https://dx.doi.org/10.1007/BF01650949)Cited by:[§1](https://arxiv.org/html/2606.19921#S1.p2.1)\.
- M\. P\. Bendsøe and O\. Sigmund \(2004\)Topology optimization\.Springer Berlin Heidelberg,Berlin, Heidelberg\.External Links:[Document](https://dx.doi.org/10.1007/978-3-662-05086-6),ISBN 978\-3\-642\-07698\-5 978\-3\-662\-05086\-6Cited by:[§2\.1](https://arxiv.org/html/2606.19921#S2.SS1.p8.7)\.
- A\. Dosovitskiy, L\. Beyer, A\. Kolesnikov, D\. Weissenborn, X\. Zhai, T\. Unterthiner, M\. Dehghani, M\. Minderer, G\. Heigold, S\. Gelly, J\. Uszkoreit, and N\. Houlsby \(2021\)An image is worth 16x16 words: Transformers for image recognition at scale\.arXiv\.External Links:2010\.11929,[Document](https://dx.doi.org/10.48550/arXiv.2010.11929)Cited by:[Remark 1](https://arxiv.org/html/2606.19921#Thmrmk1.p1.1.1)\.
- X\. Guo, W\. Zhang, and W\. Zhong \(2014\)Doing topology optimization explicitly and geometrically—a new moving morphable components based framework\.Journal of Applied Mechanics81\(8\),pp\. 081009\.External Links:ISSN 0021\-8936, 1528\-9036,[Document](https://dx.doi.org/10.1115/1.4027609)Cited by:[§1](https://arxiv.org/html/2606.19921#S1.p2.1)\.
- K\. He, X\. Zhang, S\. Ren, and J\. Sun \(2016\)Deep residual learning for image recognition\.InProceedings of the IEEE Conference on Computer Vision and Pattern Recognition,pp\. 770–778\.Cited by:[§3](https://arxiv.org/html/2606.19921#S3.p1.1)\.
- Y\. Joo, H\. Choi, G\. Jeong, and Y\. Yu \(2024\)Dynamic graph\-based convergence acceleration for topology optimization in unstructured meshes\.Engineering Applications of Artificial Intelligence132,pp\. 107916\.External Links:ISSN 0952\-1976,[Document](https://dx.doi.org/10.1016/j.engappai.2024.107916)Cited by:[§1](https://arxiv.org/html/2606.19921#S1.p4.1)\.
- Y\. Joo, Y\. Yu, and I\. G\. Jang \(2021\)Unit module\-based convergence acceleration for topology optimization using the spatiotemporal deep neural network\.IEEE Access9,pp\. 149766–149779\.External Links:ISSN 2169\-3536,[Document](https://dx.doi.org/10.1109/ACCESS.2021.3125014)Cited by:[§1](https://arxiv.org/html/2606.19921#S1.p5.1)\.
- N\. Ath\. Kallioras, G\. Kazakis, and N\. D\. Lagaros \(2020\)Accelerated topology optimization by means of deep learning\.Structural and Multidisciplinary Optimization62\(3\),pp\. 1185–1212\.External Links:ISSN 1615\-1488,[Document](https://dx.doi.org/10.1007/s00158-020-02545-z)Cited by:[§1](https://arxiv.org/html/2606.19921#S1.p5.1),[§1](https://arxiv.org/html/2606.19921#S1.p6.1),[§2\.2](https://arxiv.org/html/2606.19921#S2.SS2.p1.1),[§4\.3](https://arxiv.org/html/2606.19921#S4.SS3.p5.1)\.
- A\. Kawamoto, T\. Matsumori, S\. Yamasaki, T\. Nomura, T\. Kondoh, and S\. Nishiwaki \(2011\)Heaviside projection based topology optimization by a PDE\-filtered scalar function\.Structural and Multidisciplinary Optimization44\(1\),pp\. 19–24\.External Links:ISSN 1615\-1488,[Document](https://dx.doi.org/10.1007/s00158-010-0562-2)Cited by:[§2\.1](https://arxiv.org/html/2606.19921#S2.SS1.p10.1),[§4\.4](https://arxiv.org/html/2606.19921#S4.SS4.p1.3)\.
- Y\. LeCun, Y\. Bengio, and G\. Hinton \(2015\)Deep learning\.Nature521\(7553\),pp\. 436–444\.External Links:ISSN 1476\-4687,[Document](https://dx.doi.org/10.1038/nature14539)Cited by:[§3\.1](https://arxiv.org/html/2606.19921#S3.SS1.p2.1)\.
- J\. Lim, K\. Jung, Y\. Jung, and D\. Kim \(2024\)Accelerating topology optimization using deep learning\-based image super\-resolution\.Engineering Applications of Artificial Intelligence133,pp\. 108370\.External Links:ISSN 0952\-1976,[Document](https://dx.doi.org/10.1016/j.engappai.2024.108370)Cited by:[§1](https://arxiv.org/html/2606.19921#S1.p4.1)\.
- B\. S\. Mekki, J\. Langer, and S\. Lynch \(2021\)Genetic algorithm based topology optimization of heat exchanger fins used in aerospace applications\.International Journal of Heat and Mass Transfer170,pp\. 121002\.External Links:ISSN 0017\-9310,[Document](https://dx.doi.org/10.1016/j.ijheatmasstransfer.2021.121002)Cited by:[§1](https://arxiv.org/html/2606.19921#S1.p1.1)\.
- Z\. Nie, T\. Lin, H\. Jiang, and L\. B\. Kara \(2021\)TopologyGAN: Topology optimization using generative adversarial networks based on physical fields over the initial domain\.Journal of Mechanical Design143\(031715\)\.External Links:ISSN 1050\-0472,[Document](https://dx.doi.org/10.1115/1.4049533)Cited by:[§1](https://arxiv.org/html/2606.19921#S1.p3.1)\.
- A\. P\. Padhi, S\. Chakraborty, A\. Chakrabarti, and R\. Chowdhury \(2024\)Deep learning accelerated efficient framework for topology optimization\.Engineering Applications of Artificial Intelligence133,pp\. 108559\.External Links:ISSN 0952\-1976,[Document](https://dx.doi.org/10.1016/j.engappai.2024.108559)Cited by:[§1](https://arxiv.org/html/2606.19921#S1.p4.1)\.
- C\. Qian and W\. Ye \(2021\)Accelerating gradient\-based topology optimization design with dual\-model artificial neural networks\.Structural and Multidisciplinary Optimization63\(4\),pp\. 1687–1707\.External Links:ISSN 1615\-1488,[Document](https://dx.doi.org/10.1007/s00158-020-02770-6)Cited by:[§1](https://arxiv.org/html/2606.19921#S1.p3.1)\.
- J\. Sheng and X\. Wei \(2025\)Isogeometric topology optimization of thin\-walled structures with complex design domains\.Computer Methods in Applied Mechanics and Engineering444,pp\. 118114\.External Links:ISSN 0045\-7825,[Document](https://dx.doi.org/10.1016/j.cma.2025.118114)Cited by:[§1](https://arxiv.org/html/2606.19921#S1.p2.1)\.
- O\. Sigmund and J\. Petersson \(1998\)Numerical instabilities in topology optimization: a survey on procedures dealing with checkerboards, mesh\-dependencies and local minima\.Structural optimization16\(1\),pp\. 68–75\.External Links:ISSN 1615\-1488,[Document](https://dx.doi.org/10.1007/BF01214002)Cited by:[§2\.1](https://arxiv.org/html/2606.19921#S2.SS1.p10.1)\.
- O\. Sigmund \(2001\)A 99 line topology optimization code written in MATLAB\.Structural and Multidisciplinary Optimization21\(2\),pp\. 120–127\.External Links:ISSN 1615\-1488,[Document](https://dx.doi.org/10.1007/s001580050176)Cited by:[§2\.1](https://arxiv.org/html/2606.19921#S2.SS1.p8.7)\.
- O\. Sigmund and K\. Maute \(2012\)Sensitivity filtering from a continuum mechanics perspective\.Structural and Multidisciplinary Optimization46\(4\),pp\. 471–475\.External Links:ISSN 1615\-1488,[Document](https://dx.doi.org/10.1007/s00158-012-0814-4)Cited by:[§2\.1](https://arxiv.org/html/2606.19921#S2.SS1.p10.1)\.
- O\. Sigmund \(2007\)Morphology\-based black and white filters for topology optimization\.Structural and Multidisciplinary Optimization33\(4\),pp\. 401–424\.External Links:ISSN 1615\-1488,[Document](https://dx.doi.org/10.1007/s00158-006-0087-x)Cited by:[§2\.1](https://arxiv.org/html/2606.19921#S2.SS1.p10.1)\.
- I\. Sosnovik and I\. Oseledets \(2019\)Neural networks for topology optimization\.Russian Journal of Numerical Analysis and Mathematical Modelling34\(4\),pp\. 215–223\.External Links:ISSN 1569\-3988,[Document](https://dx.doi.org/10.1515/rnam-2019-0018)Cited by:[§1](https://arxiv.org/html/2606.19921#S1.p3.1)\.
- T\. Su, T\. He, R\. Yang, and M\. Li \(2022\)Topology optimization and lightweight design of stamping dies for forming automobile panels\.The International Journal of Advanced Manufacturing Technology121\(7\),pp\. 4691–4702\.External Links:ISSN 1433\-3015,[Document](https://dx.doi.org/10.1007/s00170-022-09683-2)Cited by:[§1](https://arxiv.org/html/2606.19921#S1.p1.1)\.
- K\. Svanberg \(1987\)The method of moving asymptotes—a new method for structural optimization\.International Journal for Numerical Methods in Engineering24\(2\),pp\. 359–373\.External Links:ISSN 1097\-0207,[Document](https://dx.doi.org/10.1002/nme.1620240207)Cited by:[§2\.1](https://arxiv.org/html/2606.19921#S2.SS1.p8.7)\.
- J\. Wang, N\. Aage, J\. Wu, O\. Sigmund, and R\. Westermann \(2025\)Efficient large\-scale 3D topology optimization with matrix\-free MATLAB code\.Structural and Multidisciplinary Optimization68\(9\),pp\. 174\.External Links:ISSN 1615\-1488,[Document](https://dx.doi.org/10.1007/s00158-025-04127-3)Cited by:[§1](https://arxiv.org/html/2606.19921#S1.p2.1),[§2\.1](https://arxiv.org/html/2606.19921#S2.SS1.p11.1),[§3\.2](https://arxiv.org/html/2606.19921#S3.SS2.p3.2)\.
- M\. Y\. Wang, X\. Wang, and D\. Guo \(2003\)A level set method for structural topology optimization\.Computer Methods in Applied Mechanics and Engineering192\(1\-2\),pp\. 227–246\.External Links:ISSN 00457825,[Document](https://dx.doi.org/10.1016/S0045-7825%2802%2900559-5)Cited by:[§1](https://arxiv.org/html/2606.19921#S1.p2.1)\.
- Y\. Wang, Hirshikesh, T\. Yu, S\. Natarajan, and T\. Q\. Bui \(2024\)Phase\-field method combined with optimality criteria approach for topology optimization\.Applied Mathematical Modelling129,pp\. 509–521\.External Links:ISSN 0307\-904X,[Document](https://dx.doi.org/10.1016/j.apm.2024.02.006)Cited by:[§1](https://arxiv.org/html/2606.19921#S1.p2.1)\.
- R\. V\. Woldseth, N\. Aage, J\. A\. Bærentzen, and O\. Sigmund \(2022\)On the use of artificial neural networks in topology optimisation\.Structural and Multidisciplinary Optimization65\(10\),pp\. 294\.External Links:ISSN 1615\-1488,[Document](https://dx.doi.org/10.1007/s00158-022-03347-1)Cited by:[§1](https://arxiv.org/html/2606.19921#S1.p3.1),[§1](https://arxiv.org/html/2606.19921#S1.p4.1)\.
- L\. Xia, Q\. Xia, X\. Huang, and Y\. M\. Xie \(2018\)Bi\-directional evolutionary structural optimization on advanced structures and materials: a comprehensive review\.Archives of Computational Methods in Engineering25\(2\),pp\. 437–478\.External Links:ISSN 1134\-3060, 1886\-1784,[Document](https://dx.doi.org/10.1007/s11831-016-9203-2)Cited by:[§1](https://arxiv.org/html/2606.19921#S1.p2.1)\.
- Y\.M\. Xie and G\.P\. Steven \(1993\)A simple evolutionary procedure for structural optimization\.Computers & Structures49\(5\),pp\. 885–896\.External Links:ISSN 00457949,[Document](https://dx.doi.org/10.1016/0045-7949%2893%2990035-C)Cited by:[§1](https://arxiv.org/html/2606.19921#S1.p2.1)\.
- Y\. Xing and L\. Tong \(2023\)An online autonomous learning and prediction scheme for machine learning assisted structural optimization\.Thin\-Walled Structures184,pp\. 110500\.External Links:ISSN 0263\-8231,[Document](https://dx.doi.org/10.1016/j.tws.2022.110500)Cited by:[§1](https://arxiv.org/html/2606.19921#S1.p3.1)\.
- H\. Xue, Z\. Luo, T\. Brown, and S\. Beier \(2020\)Design of self\-expanding auxetic stents using topology optimization\.Frontiers in Bioengineering and Biotechnology8\.External Links:ISSN 2296\-4185,[Document](https://dx.doi.org/10.3389/fbioe.2020.00736)Cited by:[§1](https://arxiv.org/html/2606.19921#S1.p1.1)\.
- Y\. Zhang, Y\. Shan, X\. Liu, and T\. He \(2021\)An integrated multi\-objective topology optimization method for automobile wheels made of lightweight materials\.Structural and Multidisciplinary Optimization64\(3\),pp\. 1585–1605\.External Links:ISSN 1615\-1488,[Document](https://dx.doi.org/10.1007/s00158-021-02913-3)Cited by:[§1](https://arxiv.org/html/2606.19921#S1.p1.1)\.
eCNNTO: A Highly Generalizable ConvNet for Accelerating Topology Optimization

Similar Articles

@BetaTomorrow: Paper: Topological Neural Operators Authors: Lennart Bastian(@lennart_bastian), Tolga Birdal(@tolga_birdal), Samuel Lev…

Topology-Preserving Neural Operator Learning via Hodge Decomposition

WeCon: An Efficient Weight-Conditioned Neural Solver for Multi-Objective Combinatorial Optimization Problems

A Robust Foundation Model for Conservation Laws: Injecting Context into Flux Neural Operators via Recurrent Vision Transformers

A lift for input-convex neural network training

Submit Feedback

Similar Articles

@BetaTomorrow: Paper: Topological Neural Operators Authors: Lennart Bastian(@lennart_bastian), Tolga Birdal(@tolga_birdal), Samuel Lev…
Topology-Preserving Neural Operator Learning via Hodge Decomposition
WeCon: An Efficient Weight-Conditioned Neural Solver for Multi-Objective Combinatorial Optimization Problems
A Robust Foundation Model for Conservation Laws: Injecting Context into Flux Neural Operators via Recurrent Vision Transformers
A lift for input-convex neural network training