LFNO: Bridging Laplace and Fourier via Transient-Steady Decomposition
Summary
LFNO is a unified neural operator framework that integrates Laplace and Fourier transforms to decompose system dynamics into transient and steady-state components, significantly outperforming existing operators on ODE and PDE benchmarks.
View Cached Full Text
Cached at: 06/09/26, 08:48 AM
# LFNO: Bridging Laplace and Fourier via Transient-Steady Decomposition
Source: [https://arxiv.org/html/2606.07601](https://arxiv.org/html/2606.07601)
###### Abstract
We introduce the Laplace–Fourier Neural Operator \(LFNO\), a unified framework for modeling dynamical systems across transient and steady\-state regimes by integrating the spectral advantages of Laplace and Fourier Neural Operators\. LFNO employs a dual\-branch architecture that explicitly decomposes system dynamics into transient and steady\-state components\. We evaluate LFNO on nine benchmarks, including three ODE systems \(Duffing, Lorenz, and Pendulum\) and six PDE systems \(Euler–Bernoulli beam, Heat, Reaction–diffusion, Brusselator, Burgers, and Navier–Stokes\)\. LFNO significantly outperforms existing operators on ODE systems, where transient dynamics dominate, and consistently surpasses LNO while achieving performance competitive with FNO on PDE benchmarks\. Furthermore, LFNO offers improved stability and physical interpretability through its component\-wise decomposition\. These results demonstrate that LFNO provides a robust and unified approach for learning complex dynamical systems across multiple temporal scales\.
Machine Learning, ICML
## 1Introduction
In recent years, advances through machine learning have emerged as a powerful paradigm for solving differential equations in several applications, such as fluid dynamics\(Lomaxet al\.,[2001](https://arxiv.org/html/2606.07601#bib.bib21); Bruntonet al\.,[2016](https://arxiv.org/html/2606.07601#bib.bib22); Linget al\.,[2016](https://arxiv.org/html/2606.07601#bib.bib23); Tartakovskyet al\.,[2020](https://arxiv.org/html/2606.07601#bib.bib24); Ranadeet al\.,[2021](https://arxiv.org/html/2606.07601#bib.bib25)\), electromagnetism\(Xionget al\.,[2019](https://arxiv.org/html/2606.07601#bib.bib26); Baldanet al\.,[2021](https://arxiv.org/html/2606.07601#bib.bib27); Zhanget al\.,[2021](https://arxiv.org/html/2606.07601#bib.bib28)\), and quantum mechanics\(Brockherdeet al\.,[2017](https://arxiv.org/html/2606.07601#bib.bib29); Weinan and Yu,[2018](https://arxiv.org/html/2606.07601#bib.bib30); Hermannet al\.,[2020](https://arxiv.org/html/2606.07601#bib.bib31)\)\. However, many of these approaches still struggle to handle problems with diverse boundary conditions or high\-dimensional dynamics efficiently\.
Neural operator frameworks were proposed to overcome these obstacles by directly learning the mapping between input functions and solutions\.\(Luet al\.,[2021](https://arxiv.org/html/2606.07601#bib.bib6)\)represented input functions through a branch network and query locations through a trunk network, enabling the approximation of highly nonlinear operators across diverse domains\. Based on this concept, neural networks for solving partial differential equations \(PDEs\), such as those in\(Liet al\.,[2020a](https://arxiv.org/html/2606.07601#bib.bib3),[b](https://arxiv.org/html/2606.07601#bib.bib4)\), emerged to generalize resolution\-invariant methods\. Fourier Neural Operator \(FNO\)\(Liet al\.,[2021](https://arxiv.org/html/2606.07601#bib.bib5)\)was later introduced as a spectral approach to operator learning\. FNO uses Fourier transforms to parameterize integral kernels in the frequency domain, retaining and updating only low\-frequency modes for resolution invariance, efficient training, and strong performance on complex PDE benchmarks\.\(Shinet al\.,[2022](https://arxiv.org/html/2606.07601#bib.bib8)\)theoretically and experimentally demonstrated that extending the FNO through the refinement of Fourier\-based kernels can enhance expressiveness in high\-frequency or inhomogeneous scenarios\. However, its reliance on periodic boundary conditions and regular grids limits its applicability to general geometries\. While some studies have attempted to address these limitations by incorporating additional techniques into existing models\(Liet al\.,[2024](https://arxiv.org/html/2606.07601#bib.bib15); Diab and Al\-Kobaisi,[2025](https://arxiv.org/html/2606.07601#bib.bib20); Eshaghiet al\.,[2025](https://arxiv.org/html/2606.07601#bib.bib19)\), other works have aimed to solve the problem by expanding the domain\.
\(Fanaskov and Oseledets,[2023](https://arxiv.org/html/2606.07601#bib.bib11)\)enhances representation by combining Fourier series with Chebyshev bases, improving the approximation beyond periodic settings\.\(Bonevet al\.,[2023](https://arxiv.org/html/2606.07601#bib.bib12)\)extends the FNO approach to respect the geometry of the two\-dimensional sphere, which applies to forecast atmospheric dynamics, and\(Liet al\.,[2023](https://arxiv.org/html/2606.07601#bib.bib9); Tranet al\.,[2023](https://arxiv.org/html/2606.07601#bib.bib10)\)generalizes neural operators from uniform rectangular meshes to arbitrary geometries, expanding their applicability to complex spatial domains\. More recently, Laplace Neural Operator \(LNO\)\(Caoet al\.,[2024](https://arxiv.org/html/2606.07601#bib.bib17)\)was proposed to extend operator learning to the Laplace domain\. By exploiting a pole–residue formulation, LNO naturally captures both transient and steady\-state behaviors of dynamical systems, offering improved accuracy for problems with non\-periodic or exponentially decaying responses\. While LNO captures the representation of steady\-state signals, FNO provides a more robust mechanism for learning their complex, non\-linear modal interactions in the frequency domain\.
Inspired by the spectral decomposition capabilities of LNO and FNO, we propose the Laplace–Fourier Neural Operator \(LFNO\), a unified framework that synergistically combines their respective strengths\. LFNO leverages the Laplace\-domain representation to isolate transient dynamics while employing Fourier integral operators to enhance the expressivity of steady\-state components in the frequency domain\. Our contributions are as follows:
- •We develop LFNO, a novel operator that unifies the theoretical foundations of Laplace\-domain pole\-residue modeling and Fourier spectral analysis\.
- •We introduce an architecture\-level integration that treats transient and steady\-state components via dual\-branch processing, ensuring high\-fidelity approximation of complex temporal evolutions\.
- •Through comprehensive benchmarks, we demonstrate that LFNO significantly outperforms LNO and FNO on ODEs with transient behavior, while maintaining performance competitive with FNO on complex PDE systems\.
## 2Methodology
This section details how the complementary strengths of LNO and FNO are combined to develop our proposed method, the Laplace\-Fourier Neural Operator \(LFNO\)\.
#### Pole\-residue formulation
Building upon the theoretical foundations of LNO\(Caoet al\.,[2023](https://arxiv.org/html/2606.07601#bib.bib16),[2024](https://arxiv.org/html/2606.07601#bib.bib17)\), we model the dynamical system by substituting the kernel integral operator with a representation defined in the Laplace domain\. Specifically, we parameterize the kernelKϕ\(s\)K\_\{\\phi\}\(s\)in a pole\-residue form asKϕ\(s\)=∑n=1Nβns−μnK\_\{\\phi\}\(s\)=\\sum\_\{n=1\}^\{N\}\\frac\{\\beta\_\{n\}\}\{s\-\\mu\_\{n\}\}, whereβn\\beta\_\{n\}andμn\\mu\_\{n\}denote trainable complex poles and residues, respectively\. According to the residue theorem\(Caoet al\.,[2023](https://arxiv.org/html/2606.07601#bib.bib16)\), the transformed outputU\(s\)=Kϕ\(s\)V\(s\)U\(s\)=K\_\{\\phi\}\(s\)V\(s\)admits the following pole\-residue decomposition:
U\(s\)=∑n=1Nγns−μn\+∑ℓ=−∞∞λℓs−iωℓ,U\(s\)=\\sum\_\{n=1\}^\{N\}\\frac\{\\gamma\_\{n\}\}\{s\-\\mu\_\{n\}\}\+\\sum\_\{\\ell=\-\\infty\}^\{\\infty\}\\frac\{\\lambda\_\{\\ell\}\}\{s\-i\\omega\_\{\\ell\}\},\(1\)
where the response residuesγn\\gamma\_\{n\}andλℓ\\lambda\_\{\\ell\}are given by:
γn=βn∑ℓ=−∞∞αℓμn−iωℓ,λℓ=αℓ∑n=1Nβniωℓ−μn,\\gamma\_\{n\}=\\beta\_\{n\}\\sum\_\{\\ell=\-\\infty\}^\{\\infty\}\\frac\{\\alpha\_\{\\ell\}\}\{\\mu\_\{n\}\-i\\omega\_\{\\ell\}\},\\quad\\lambda\_\{\\ell\}=\\alpha\_\{\\ell\}\\sum\_\{n=1\}^\{N\}\\frac\{\\beta\_\{n\}\}\{i\\omega\_\{\\ell\}\-\\mu\_\{n\}\},\(2\)
whereαℓ\\alpha\_\{\\ell\}denotes the complex Fourier coefficients of the input\. This formulation allows for a rigorous separation of the response into two distinct components: the transient response, associated with the system polesμn\\mu\_\{n\}, and the steady\-state response, associated with the excitation polesiωℓi\\omega\_\{\\ell\}of the input signal\. By evaluating the residuesγn\\gamma\_\{n\}at the system polesμn\\mu\_\{n\}andλℓ\\lambda\_\{\\ell\}at the excitation polesiωℓi\\omega\_\{\\ell\}, the time\-domain solutionu\(t\)u\(t\)is reconstructed via the inverse Laplace transform as a superposition of decaying and oscillatory exponentials\.
While this dual\-response characterization extends spectral analysis beyond conventional Fourier\-based methods\(Liet al\.,[2021](https://arxiv.org/html/2606.07601#bib.bib5); Fanaskov and Oseledets,[2023](https://arxiv.org/html/2606.07601#bib.bib11)\), enabling the capture of non\-stationary decaying dynamics, the discrete treatment of steady\-state components in original LNOs may limit expressivity for complex nonlinear dependencies\. To address this, we leverage the FNO framework\(Liet al\.,[2021](https://arxiv.org/html/2606.07601#bib.bib5)\)to generalize the steady\-state response\. By applying a learned operatorRRin the frequency domain, our LFNO effectively captures spectral coupling and nonlinearities that simple linear pole\-residue methods might overlook\. This hybrid approach ensures that the transient branch resolves decaying dynamics on thess\-plane, while the steady\-state branch achieves high\-fidelity convergence in the frequency domain\.
#### Mathematical formulation
Following\(Caoet al\.,[2024](https://arxiv.org/html/2606.07601#bib.bib17)\), we adopt the simple pole assumption for the Laplace domain representation, where the inverse Laplace transform of the learned representation yields the solution:
u\(t\)=∑n=1Nγneμnt\+∑ℓ=−LLλℓeiωℓt,u\(t\)=\\sum\_\{n=1\}^\{N\}\\gamma\_\{n\}e^\{\\mu\_\{n\}t\}\+\\sum\_\{\\ell=\-L\}^\{L\}\\lambda\_\{\\ell\}e^\{i\\omega\_\{\\ell\}t\},\(3\)
where the first term corresponds to the transient responseutransient\(t\)u\_\{transient\}\(t\)and the second to the steady\-state responseusteady\(t\)u\_\{steady\}\(t\)\. Sinceusteady\(t\)u\_\{steady\}\(t\)is a superposition of discrete frequency modes, we apply the Fourier transform to represent it in the frequency domain:
ℱ\(usteady\)\(ω\)=∑ℓ=−LLλℓδ\(ωℓ−ω\)\.\\mathcal\{F\}\(u\_\{steady\}\)\(\\omega\)=\\sum\_\{\\ell=\-L\}^\{L\}\\lambda\_\{\\ell\}\\delta\(\\omega\_\{\\ell\}\-\\omega\)\.\(4\)
We now apply the Fourier integral operator\(Liet al\.,[2021](https://arxiv.org/html/2606.07601#bib.bib5)\)\. In the discrete frequency domain, FNO parameterizes the operatorRRas a complex\-valued tensorR∈ℂd×d×\(2L\+1\)R\\in\\mathbb\{C\}^\{d\\times d\\times\(2L\+1\)\}, whereddis the number of hidden channels\. For each frequency modeℓ∈\{−L,…,L\}\\ell\\in\\\{\-L,\\dots,L\\\}, the updated steady\-state coefficientsλ^ℓ\\hat\{\\lambda\}\_\{\\ell\}are computed as:
λ^ℓ=Rℓ⋅λℓ,for−L≤ℓ≤L\\hat\{\\lambda\}\_\{\\ell\}=R\_\{\\ell\}\\cdot\\lambda\_\{\\ell\},\\quad\\text\{for \}\-L\\leq\\ell\\leq L\(5\)
whereRℓ∈ℂd×dR\_\{\\ell\}\\in\\mathbb\{C\}^\{d\\times d\}is the learnable weight matrix for theℓ\\ell\-th mode\. This transformation allows the model to learn a neural\-approximated transfer function that enhances the expressivity of the steady\-state component\. Applying the inverse transform, the enhanced steady\-state responseu~steady\\tilde\{u\}\_\{steady\}is obtained by
u~steady\(t\)=∑ℓ=−LLλ^ℓeiωℓt=∑ℓ=−LL\(Rℓλℓ\)eiωℓt\.\\tilde\{u\}\_\{steady\}\(t\)=\\sum\_\{\\ell=\-L\}^\{L\}\\hat\{\\lambda\}\_\{\\ell\}e^\{i\\omega\_\{\\ell\}t\}=\\sum\_\{\\ell=\-L\}^\{L\}\(R\_\{\\ell\}\\lambda\_\{\\ell\}\)e^\{i\\omega\_\{\\ell\}t\}\.\(6\)
The final solution is formulated as the superposition of the preserved transient dynamics and the filtered steady\-state response:
u\(t\)=∑n=1Nγneμnt\+∑ℓ=−LL\(Rℓλℓ\)eiωℓt\.u\(t\)=\\sum\_\{n=1\}^\{N\}\\gamma\_\{n\}e^\{\\mu\_\{n\}t\}\+\\sum\_\{\\ell=\-L\}^\{L\}\(R\_\{\\ell\}\\lambda\_\{\\ell\}\)e^\{i\\omega\_\{\\ell\}t\}\.\(7\)
The additive combination in[Equation7](https://arxiv.org/html/2606.07601#S2.E7)follows the superposition principle of linear differential equations, while the non\-linear interaction between components is further captured by the local linear transformations and projection layers in the subsequent architecture\.
#### Architecture
The overall architecture is illustrated in[Figure1](https://arxiv.org/html/2606.07601#S2.F1)\. Starting from an input function𝐟\(t\)\\mathbf\{f\}\(t\), we first lift it to a higher\-dimensional representation using a neural network𝒫\\mathcal\{P\}\. We then apply the Laplace transform to obtainv\(t\)v\(t\)and decompose it into transient and steady\-state components for each channel\. In the steady layers, we apply the operatorRℓR\_\{\\ell\}to the lower Fourier modes while filtering out higher modes to achieve regularization and computational efficiency\(Liet al\.,[2021](https://arxiv.org/html/2606.07601#bib.bib5)\)\. The transient layers operate in the time\-domain, modeling decaying exponential dynamics\. In this study, we allocate more layers to the transient module than to the steady\-state module\. This design choice is motivated by the empirical observation that transient phases—characterized by rapid transitions and complex spatiotemporal gradients—typically require a higher model capacity to achieve convergence compared to the periodic and structured nature of the steady\-state response\. Finally, a local linear transformationWWintegrated with non\-linear activation functions is applied, and the result is projected back to the target dimension using another neural network𝒬\\mathcal\{Q\}\.
\(a\)
\(b\)
Figure 1:Model architecture of LFNO\. \(a\) Overall process: given input function𝐟\(t\)\\mathbf\{f\}\(t\), 1\. embed the input function𝐟\(t\)\\mathbf\{f\}\(t\)into a higher\-dimensional latent space𝐯\(t\)\\mathbf\{v\}\(t\)via a shallow neural network𝒫\\mathcal\{P\}, 2\. apply Laplace transformℒ\\mathcal\{L\}, 3\. take the Transient and the Steady layers for each component, and 3\. Project the integrated result back to the target space to obtain the outputu\(t\)u\(t\)\. \(b\) Steady layer structure: Hybrid spectral processing that applies the pole\-residue method followed by a learnable Fourier weight matrixRRon the lower Fourier modes, and filters out the higher modes\.
## 3Dataset Generation
To rigorously evaluate the generalization and data efficiency of LFNO, we synthesize custom datasets across a wide range of physical systems\. We adopt three ODEs \(Duffing, Lorenz, Pendulum\) and six PDEs \(Euler\-Bernoulli beam, heat, reaction\-diffusion, Brusselator, Burgers, and Navier\-Stokes\)\. These benchmarks are specifically chosen to represent diverse spectral characteristics, ranging from simple oscillations to highly nonlinear turbulence\. The sample visualizations for the dataset are shown in[AppendixD](https://arxiv.org/html/2606.07601#A4)\.
### 3\.1ODEs
We generate 380 samples for each ODE dataset by varying the parameterAAin the source termf\(t\)=Ae−0\.05tsin\(5t\)f\(t\)=Ae^\{\-0\.05t\}\\sin\(5t\), as in\(Caoet al\.,[2024](https://arxiv.org/html/2606.07601#bib.bib17)\), whereAAis sampled from 380 evenly spaced values in the interval\[0\.05,10\]\[0\.05,10\]\. While the number of independent trajectories is relatively compact \(N=380N=380\), each sample contains a high\-density sequence of 2048 time steps with a resolution ofΔt=0\.01s\\Delta t=0\.01s\. This configuration is intentionally designed to evaluate the model’s sample efficiency in data\-scarce scenarios, challenging the framework to resolve complex temporal evolutions from a limited pool of physical realizations\. Such a setup reflects real\-world scenarios where high\-fidelity data acquisition is computationally or experimentally expensive\. We employ thesolve\_ivpsolver with theRK45algorithm \(substep of 4\) to ensure numerical stability\. For all equations and coefficients, details are given in[SectionB\.1](https://arxiv.org/html/2606.07601#A2.SS1)\.
### 3\.2PDEs
The PDE datasets are synthesized using two distinct strategies to evaluate the framework’s versatility: \(1\) applying task\-specific forcing functions to simulate externally driven dynamics, and \(2\) varying initial conditions for unforced, nonlinear systems, such as the Burgers’ and Navier\-Stokes equations\. While the spatio\-temporal grid resolution for most PDEs is maintained at50×5050\\times 50, the Navier\-Stokes equation was simulated on a32×3232\\times 32spatial grid\. This adjustment was necessary to mitigate Out\-of\-Memory \(OOM\) constraints associated with the LNO architecture during high\-resolution 2D processing\. We utilize theRK45algorithm, except for the reaction\-diffusion \(R\-D\) equation, which requires theBDFmethod for stiff dynamics\. By maintaining a restricted sample size of 380 across all nine benchmarks, we demonstrate that LFNO possesses superior sample efficiency\. The details of the equations and coefficients are in[SectionB\.2](https://arxiv.org/html/2606.07601#A2.SS2)\. All PDE samples are described in[Table8](https://arxiv.org/html/2606.07601#A2.T8)\.
## 4Results
We evaluate the empirical performance of LFNO by comparing it against LNO and FNO across diverse problem settings\. Our analysis focuses on these three closely related frameworks—LFNO, LNO, and FNO—to ensure a rigorous comparison\. Quantitative performance is assessed using the relativeℒ2\\mathcal\{L\}\_\{2\}andℒ∞\\mathcal\{L\}\_\{\\infty\}error on test samples, while qualitative effectiveness is evaluated by visualization of predicted trajectories\. In particular, theℒ∞\\mathcal\{L\}\_\{\\infty\}metric provides a rigorous assessment of robustness against worst\-case local errors, such as shock\-induced oscillations in Burgers’ equation, which may be overlooked by average\-based metrics likeℒ2\\mathcal\{L\}\_\{2\}\.
Notably, for the Navier\-Stokes \(NS\) equations, we conduct experiments on two distinct regimes to verify the model’s scalability: \(i\) a canonical configuration featuring structured initial conditions to assess spectral energy preservation, and \(ii\) a high\-complexity turbulent regime using the standard benchmarks from the original FNO study to evaluate performance under low\-viscosity \(ν=10−3\\nu=10^\{\-3\}\) and multi\-scale vortex interactions\. For all benchmarks, we employ a fixed hyperparameter configuration and a consistent number of training epochs\. Detailed results follow in the subsections below, with implementation specifics provided in[AppendixA](https://arxiv.org/html/2606.07601#A1)\.
### 4\.1ODE Experiments
The detailedℒ2\\mathcal\{L\}\_\{2\}andℒ∞\\mathcal\{L\}\_\{\\infty\}errors are presented in[Table2](https://arxiv.org/html/2606.07601#S4.T2)and[Table3](https://arxiv.org/html/2606.07601#S4.T3)\. The results indicate that LFNO consistently achieves superior performance across diverse dynamical systems\. Specifically, in the Duffingc=0\.5c=0\.5and Lorenzρ=10\\rho=10tasks, LFNO significantly reduces theℒ∞\\mathcal\{L\}\_\{\\infty\}error compared to both FNO and LNO, demonstrating its robustness in handling complex nonlinearities\.
It should be noted that while FNO shows competitive performance in the purely periodic Duffingc=0c=0case following rigorous hyperparameter optimization, LFNO maintains a dominant advantage in systems with both transient and steady\-state characteristics, such as the Lorenz and dissipative Duffing/Pendulum oscillators\. For qualitative evaluation, we overlay the inference results of the baseline models as shown in[Table1](https://arxiv.org/html/2606.07601#S4.T1)\. In the Lorenz experiments, LFNO captures long\-term damping and chaotic trajectories with high fidelity, whereas LNO and FNO struggle to reproduce the correct amplitudes\. Furthermore, LFNO effectively integrates periodic and transient feature extraction, outperforming LNO in periodic capture for Duffingc=0\.5c=0\.5and Pendulumc=0\.5c=0\.5experiments\.
TaskLFNO vs\. FNOLFNO vs\. LNODuffingc=0c=0![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/results/ODE_results/FNO_LFNO_prediction_singleFigure_LFNO_1D_Duffing_c0.png)![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/results/ODE_results/LNO_LFNO_prediction_singleFigure_LFNO_1D_Duffing_c0.png)Duffingc=0\.5c=0\.5![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/results/ODE_results/FNO_LFNO_prediction_singleFigure_LFNO_1D_Duffing_c05.png)![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/results/ODE_results/LNO_LFNO_prediction_singleFigure_LFNO_1D_Duffing_c05.png)Lorenzρ=5\\rho=5![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/results/ODE_results/FNO_LFNO_prediction_singleFigure_LFNO_1D_Lorenz_rho5.png)![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/results/ODE_results/LNO_LFNO_prediction_singleFigure_LFNO_1D_Lorenz_rho5.png)Lorenzρ=10\\rho=10![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/results/ODE_results/FNO_LFNO_prediction_singleFigure_LFNO_1D_Lorenz_rho10.png)![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/results/ODE_results/LNO_LFNO_prediction_singleFigure_LFNO_1D_Lorenz_rho10.png)Pendulumc=0\.5c=0\.5![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/results/ODE_results/FNO_LFNO_prediction_singleFigure_LFNO_1D_Pendulum_c05.png)![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/results/ODE_results/LNO_LFNO_prediction_singleFigure_LFNO_1D_Pendulum_c05.png)Table 1:Qualitative results for three models\. Comparison of LFNO with FNO \(left\) and LNO \(right\)\. Red: LFNO results; Green: comparison model results\.ODEsEquationDuffingc=0c=0Duffingc=0\.5c=0\.5Lorenzρ=5\\rho=5Lorenzρ=10\\rho=10Pendulumc=0\.5c=0\.5Epochs50005200810040006800FNO0\.07260\.05650\.12250\.43400\.0426LNO0\.20140\.07620\.02880\.25710\.0735LFNO0\.05180\.01420\.00750\.17310\.0315
Table 2:ℒ2\\mathcal\{L\}\_\{2\}error table for ODEs tasks\.ODEsEquationDuffingc=0c=0Duffingc=0\.5c=0\.5Lorenzρ=5\\rho=5Lorenzρ=10\\rho=10Pendulumc=0\.5c=0\.5Epochs50005000500050005000FNO0\.13380\.02930\.42950\.71370\.0356LNO0\.66860\.16000\.08370\.40870\.1405LFNO0\.28420\.02180\.22450\.26460\.0337
Table 3:ℒ∞\\mathcal\{L\}\_\{\\infty\}error table for ODEs tasks\.PDEsEquationBeamHeatR\-DBrusselatorBurgersNSNSν=10−3\\nu=10^\{\-3\}Epochs5000500050005000500050005000FNO0\.00370\.00150\.00280\.00230\.00460\.00060\.0286LNO0\.25090\.02270\.07790\.01370\.02750\.00420\.0458LFNO0\.00950\.00310\.00430\.00290\.00260\.00040\.0026Table 4:ℒ2\\mathcal\{L\}\_\{2\}error table for PDEs tasks\.PDEsEquationBeamHeatR\-DBrusselatorBurgersNSNSν=10−3\\nu=10^\{\-3\}Epochs5000500050005000500050005000FNO0\.00720\.00590\.00780\.00830\.04710\.00240\.0411LNO0\.27980\.05600\.14000\.05670\.23870\.00660\.1268LFNO0\.02450\.01240\.01480\.01100\.01250\.00240\.0071Table 5:ℒ∞\\mathcal\{L\}\_\{\\infty\}error table for PDEs tasks\.GTFNOLFNOLNOBeamHeatR\-DBrusselatorBurgersNSFigure 2:Qualitative comparison on six PDE benchmarks\. Columns correspond to ground truth \(GT\), FNO, LFNO, and LNO predictions\. Full\-resolution results are provided in[AppendixE](https://arxiv.org/html/2606.07601#A5)\.
### 4\.2PDE Experiments
As summarized in[Table4](https://arxiv.org/html/2606.07601#S4.T4)and[Table5](https://arxiv.org/html/2606.07601#S4.T5), LFNO demonstrates performance highly competitive with, and in several cases superior to, FNO, while significantly surpassing LNO across all PDE benchmarks\. Most notably, in the Burgers’ equation, LFNO yields anℒ∞\\mathcal\{L\}\_\{\\infty\}error of 0\.0125, representing nearly a four\-fold reduction compared to FNO’s 0\.0471\. This substantial suppression of the maximum point\-wise error proves that LFNO’s dual\-branch architecture effectively mitigates spectral ringing artifacts and local instabilities near shock fronts—regions where pure Fourier\-based methods typically incur high worst\-case errors\.
The superiority of LFNO becomes even more pronounced in high\-complexity regimes, such as the Navier\-Stokes equation with lower viscosity \(ν=10−3\\nu=10^\{\-3\}\)\. In this challenging task, LFNO achieves anℒ2\\mathcal\{L\}\_\{2\}error of 0\.0026, outperforming FNO \(0\.0286\) by over an order of magnitude\. While FNO’s performance degrades significantly as the complexity of the flow increases, LFNO maintains high\-fidelity spectral representations with a remarkably lowℒ∞\\mathcal\{L\}\_\{\\infty\}error of 0\.0071 \(compared to FNO’s 0\.0411\)\. This demonstrates that LFNO’s dual\-branch approach is uniquely capable of resolving intricate, fine\-scale vortical structures that emerge in low\-viscosity regimes, which single\-branch operators often fail to capture\.
Qualitative comparisons in[Figure2](https://arxiv.org/html/2606.07601#S4.F2)further underscore this trend; while LNO completely fails to produce meaningful predictions in stiff systems like the Euler\-Bernoulli beam, LFNO excels in recovering complex spatiotemporal patterns\. These results indicate that by decomposing dynamics into transient and steady components, LFNO ensures a more stable and interpretable learning process, maintaining its worst\-case robustness even in stiff dynamical systems characterized by multi\-scale spatiotemporal features\.
### 4\.3Learning Curves
We analyze the training dynamics of FNO, LNO, and LFNO by comparing their loss trajectories over the designated training epochs in[Figure3](https://arxiv.org/html/2606.07601#S4.F3),[Figure4](https://arxiv.org/html/2606.07601#S4.F4)\. Across all experiments, the training and validation losses are visualized for LFNO \(lime, red\), LNO \(orange, blue\), and FNO \(magenta, cyan\)\.
In ODE tasks, LFNO demonstrates superior optimization stability compared to the baselines\. As illustrated in[Figure3\(a\)](https://arxiv.org/html/2606.07601#S4.F3.sf1),[Figure3\(c\)](https://arxiv.org/html/2606.07601#S4.F3.sf3), and[Figure3\(d\)](https://arxiv.org/html/2606.07601#S4.F3.sf4), FNO frequently exhibits optimization failures or stagnation, particularly in chaotic regimes\. While LNO manages to reduce loss, it displays significant fluctuations in the Lorenz systems, suggesting difficulty in stabilizing the pole\-residue approximation\. In contrast, LFNO maintains a monotonic and stable decay, indicating that the transient\-steady decomposition effectively regularizes the loss landscape, preventing the model from being trapped in suboptimal local minima\.
For the PDE experiments, the convergence behavior reveals an intriguing decoupling between training loss and empirical accuracy\. In tasks such as the Burgers’ and Navier\-Stokes equations, FNO reaches the lowest training\-loss floor, settling to a decimal order of10−510^\{\-5\}\. However, when comparing the final testℒ2\\mathcal\{L\}\_\{2\}errors, LFNO achieves superior performance in[Table4](https://arxiv.org/html/2606.07601#S4.T4), despite maintaining a slightly higher validation loss plateau of10−410^\{\-4\}\.
This phenomenon suggests that LFNO’s explicit separation of transient and steady\-state responses acts as an implicit regularizer\. By isolating transient gradients, LFNO prevents the model from overfitting to high\-frequency spectral noise, which FNO may capture in its pursuit of a lower loss floor, thereby preserving a more physically consistent representation of the underlying field\. Meanwhile, LFNO maintains a convergence plateau consistently one order of magnitude lower than that of LNO, which fails to achieve competitive results in stiff systems such as the Euler\-Bernoulli beam in[Figure4\(a\)](https://arxiv.org/html/2606.07601#S4.F4.sf1)\.
\(a\)Duffingc=0c=0\(b\)Duffingc=0\.5c=0\.5\(c\)Lorenzρ=5\\rho=5\(d\)Lorenzρ=10\\rho=10\(e\)Pendulumc=0\.5c=0\.5
Figure 3:ODE learning curves on different models\. Training and validation sets are represented by lime and red for LFNO, orange and blue for LNO, and magenta and cyan for FNO\.\(a\)Euler\-Bernoulli beam\(b\)Heat\(c\)Reaction\-diffusion\(d\)Brusselator\(e\)Burgers\(f\)Navier\-Stokes
Figure 4:PDE learning curves on different models\. Training and validation sets are represented by lime and red for LFNO, orange and blue for LNO, and magenta and cyan for FNO\.
## 5Discussion
WhileCaoet al\.\([2024](https://arxiv.org/html/2606.07601#bib.bib17)\)claimed that LNO produces lower validation errors than FNO, our empirical results consistently demonstrate otherwise under a unified benchmarking framework\. Interestingly, despite its stated capability to capture transient behavior, LNO exhibited a disproportionately large performance gap in the Euler\-Bernoulli beam experiments—a system characterized by high\-order spatial derivatives and stiff dynamics—compared to its performance in ODE tasks\. We hypothesize that while the pole\-residue formulation is theoretically sound, the classical LNO framework encounters significant representation bottlenecks when the system’s complexity increases, failing to stabilize the optimization of complex residues\.
To address these challenges, LFNO utilizes a dual\-branch architecture that isolates transient non\-stationarities from steady\-state oscillations\. While reviewers may note a marginal accuracy gap compared to FNO in certain standard PDEs, we argue that this is not a fundamental performance regression but rather a result of preventing the steady\-state module from overfitting to high\-frequency spectral noise\. By shielding the Fourier branch from transient gradients, LFNO ensures a more stable loss landscape and physically consistent representations, providing a robust compromise between spectral precision and temporal interpretability\.
A critical factor contributing to this discrepancy lies in the target mapping consistency within the datasets\. In the original LNO datasets, we observed that the forcing functions used to generate training samples differed from those used for validation\. From a functional analysis perspective, this implies that the model is tasked with learning a transformation where the target operator shifts between the training and evaluation phases\. Such a mismatch forces the neural operator to compensate for inconsistent physical mappings rather than learning the underlying differential operator\. To eliminate this confounding factor, we constructed datasets where the source functions and their corresponding target responses are strictly aligned across all splits\. Under this rigorous setup, the Euler\-Bernoulli beam equation reveals substantially stronger transient effects than those found in conventional benchmarks\. Our findings suggest that the perceived advantages of previous models might have been, in part, an artifact of inconsistent data configurations that failed to fully challenge the models’ ability to resolve intricate transient transitions\.
Looking forward, the success of bridging Laplace and Fourier domains suggests a broader potential for hybrid basis expansion in neural operator learning\. While LFNO effectively resolves the decay\-oscillation trade\-off, future work could focus on integrating additional orthogonal bases, such as Wavelets for localized multiscale features or Chebyshev polynomials for non\-periodic boundary conditions\. By developing a general methodology to hybridize multiple spectral and temporal bases, we can create operators that are tailored to even more complex, non\-homogeneous spatiotemporal dynamics\. Such an approach would extend the utility of LFNO from a specific solution for transient\-steady separation to a universal framework for adaptive basis learning in high\-dimensional physical systems\.
## 6Conclusion
In this work, we propose LFNO, a unified framework that integrates the Laplace and Fourier Neural Operators to resolve the inherent trade\-offs between transient and steady\-state modeling\. By leveraging pole\-residue dynamics within the Laplace domain and the periodic expressivity of the Fourier integral operator, LFNO provides a comprehensive representation of complex physical signals\. The strategic allocation of model capacity toward the transient module, combined with nonlinear steady\-state representations, enables LFNO to capture a broader range of spatiotemporal dynamics than conventional operators\.
Our experimental results demonstrate that LFNO not only achieves a principled theoretical integration but also delivers empirically robust performance across diverse ODE and PDE benchmarks\. By constructing datasets with consistent target mappings, we isolate and expose LFNO’s advantages in modeling intricate transitions that were previously obscured in inconsistent data configurations\. Overall, these results establish LFNO as a highly competitive operator learning framework, demonstrating superior generalization over existing baselines in complex, non\-periodic spatiotemporal tasks\. LFNO highlights a versatile foundation for future research on hybrid spectral methods and adaptive basis expansion in high\-dimensional physical dynamics by explicitly bridging the Laplace and Fourier domains\.
## Impact Statement
This paper presents a methodological advancement in neural operator architectures for modeling complex dynamical systems\. By explicitly decomposing transient and steady\-state responses, the proposed LFNO framework provides enhanced accuracy and robustness in solving PDEs\. We anticipate that this work will have a positive impact on scientific computing, particularly in domains requiring high\-fidelity simulations such as climate modeling, aerospace engineering, and fluid dynamics\. Furthermore, the computational efficiency of our approach could contribute to reducing the energy footprint associated with large\-scale numerical simulations on high\-performance computing resources\.
While we do not foresee direct negative societal impacts, we acknowledge that reliance on purely data\-driven or hybrid models requires careful validation\. Improper deployment in safety\-critical systems without domain\-specific oversight could lead to unreliable predictions\. We encourage practitioners to use this method as a complementary tool alongside traditional numerical solvers to ensure physical consistency and reliability\.
## References
- M\. Baldan, G\. Baldan, and B\. Nacke \(2021\)Solving 1d non\-linear magneto quasi\-static maxwell’s equations using neural networks\.IET Science, Measurement & Technology15\(2\),pp\. 204–217\.Cited by:[§1](https://arxiv.org/html/2606.07601#S1.p1.1)\.
- B\. Bonev, T\. Kurth, C\. Hundt, J\. Pathak, M\. Baust, K\. Kashinath, and A\. Anandkumar \(2023\)Spherical fourier neural operators: learning stable dynamics on the sphere\.InInternational conference on machine learning,pp\. 2806–2823\.Cited by:[§1](https://arxiv.org/html/2606.07601#S1.p3.1)\.
- F\. Brockherde, L\. Vogt, L\. Li, M\. E\. Tuckerman, K\. Burke, and K\. Müller \(2017\)Bypassing the kohn\-sham equations with machine learning\.Nature communications8\(1\),pp\. 872\.Cited by:[§1](https://arxiv.org/html/2606.07601#S1.p1.1)\.
- S\. L\. Brunton, J\. L\. Proctor, and J\. N\. Kutz \(2016\)Discovering governing equations from data by sparse identification of nonlinear dynamical systems\.Proceedings of the national academy of sciences113\(15\),pp\. 3932–3937\.Cited by:[§1](https://arxiv.org/html/2606.07601#S1.p1.1)\.
- Q\. Cao, S\. Goswami, and G\. E\. Karniadakis \(2024\)Laplace neural operator for solving differential equations\.Nature Machine Intelligence6\(6\),pp\. 631–640\.Cited by:[§1](https://arxiv.org/html/2606.07601#S1.p3.1),[§2](https://arxiv.org/html/2606.07601#S2.SS0.SSS0.Px1.p1.5),[§2](https://arxiv.org/html/2606.07601#S2.SS0.SSS0.Px2.p1.1),[§3\.1](https://arxiv.org/html/2606.07601#S3.SS1.p1.6),[§5](https://arxiv.org/html/2606.07601#S5.p1.1)\.
- Q\. Cao, S\. James Hu, and H\. Li \(2023\)Laplace\- and frequency\-domain methods on computing transient responses of oscillators with hysteretic dampings to deterministic loading\.Journal of Engineering Mechanics149\(3\),pp\. 04023005\.Cited by:[§2](https://arxiv.org/html/2606.07601#S2.SS0.SSS0.Px1.p1.5)\.
- W\. Diab and M\. Al\-Kobaisi \(2025\)Temporal neural operator for modeling time\-dependent physical phenomena\.arXiv preprint arXiv:2504\.20249\.Cited by:[§1](https://arxiv.org/html/2606.07601#S1.p2.1)\.
- M\. S\. Eshaghi, C\. Anitescu, M\. Thombre, Y\. Wang, X\. Zhuang, and T\. Rabczuk \(2025\)Variational physics\-informed neural operator \(vino\) for solving partial differential equations\.Computer Methods in Applied Mechanics and Engineering437,pp\. 117785\.Cited by:[§1](https://arxiv.org/html/2606.07601#S1.p2.1)\.
- V\. S\. Fanaskov and I\. V\. Oseledets \(2023\)Spectral neural operators\.InDoklady Mathematics,Vol\.108,pp\. S226–S232\.Cited by:[§1](https://arxiv.org/html/2606.07601#S1.p3.1),[§2](https://arxiv.org/html/2606.07601#S2.SS0.SSS0.Px1.p6.2)\.
- J\. Hermann, Z\. Schätzle, and F\. Noé \(2020\)Deep\-neural\-network solution of the electronic schrödinger equation\.Nature Chemistry12\(10\),pp\. 891–897\.Cited by:[§1](https://arxiv.org/html/2606.07601#S1.p1.1)\.
- Z\. Li, D\. Z\. Huang, B\. Liu, and A\. Anandkumar \(2023\)Fourier neural operator with learned deformations for pdes on general geometries\.Journal of Machine Learning Research24\(388\),pp\. 1–26\.Cited by:[§1](https://arxiv.org/html/2606.07601#S1.p3.1)\.
- Z\. Li, N\. Kovachki, K\. Azizzadenesheli, B\. Liu, K\. Bhattacharya, A\. Stuart, and A\. Anandkumar \(2020a\)Neural operator: graph kernel network for partial differential equations\.arXiv preprint arXiv:2003\.03485\.Cited by:[§1](https://arxiv.org/html/2606.07601#S1.p2.1)\.
- Z\. Li, N\. Kovachki, K\. Azizzadenesheli, B\. Liu, K\. Bhattacharya, A\. Stuart, and A\. Anandkumar \(2021\)Fourier neural operator for parametric partial differential equations\.InInternational Conference on Learning Representations,External Links:[Link](https://openreview.net/forum?id=c8P9NQVtmnO)Cited by:[§1](https://arxiv.org/html/2606.07601#S1.p2.1),[§2](https://arxiv.org/html/2606.07601#S2.SS0.SSS0.Px1.p6.2),[§2](https://arxiv.org/html/2606.07601#S2.SS0.SSS0.Px2.p3.5),[§2](https://arxiv.org/html/2606.07601#S2.SS0.SSS0.Px3.p1.6)\.
- Z\. Li, N\. Kovachki, K\. Azizzadenesheli, B\. Liu, A\. Stuart, K\. Bhattacharya, and A\. Anandkumar \(2020b\)Multipole graph neural operator for parametric partial differential equations\.Advances in Neural Information Processing Systems33,pp\. 6755–6766\.Cited by:[§1](https://arxiv.org/html/2606.07601#S1.p2.1)\.
- Z\. Li, H\. Zheng, N\. Kovachki, D\. Jin, H\. Chen, B\. Liu, K\. Azizzadenesheli, and A\. Anandkumar \(2024\)Physics\-informed neural operator for learning partial differential equations\.ACM/IMS Journal of Data Science1\(3\),pp\. 1–27\.Cited by:[§1](https://arxiv.org/html/2606.07601#S1.p2.1)\.
- J\. Ling, A\. Kurzawski, and J\. Templeton \(2016\)Reynolds averaged turbulence modelling using deep neural networks with embedded invariance\.Journal of Fluid Mechanics807,pp\. 155–166\.Cited by:[§1](https://arxiv.org/html/2606.07601#S1.p1.1)\.
- H\. Lomax, T\. H\. Pulliam, and D\. W\. Zingg \(2001\)Fundamentals of Computational Fluid Dynamics\.Springer Berlin Heidelberg,Berlin, Heidelberg\.External Links:ISBN 978\-3\-540\-41607\-4,[Link](https://link.springer.com/book/10.1007/978-3-662-04654-8)Cited by:[§1](https://arxiv.org/html/2606.07601#S1.p1.1)\.
- L\. Lu, P\. Jin, G\. Pang, Z\. Zhang, and G\. E\. Karniadakis \(2021\)Learning nonlinear operators via deeponet based on the universal approximation theorem of operators\.Nature machine intelligence3\(3\),pp\. 218–229\.Cited by:[§1](https://arxiv.org/html/2606.07601#S1.p2.1)\.
- R\. Ranade, C\. Hill, and J\. Pathak \(2021\)DiscretizationNet: a machine\-learning based solver for navier–stokes equations using finite volume discretization\.Computer Methods in Applied Mechanics and Engineering378,pp\. 113722\.Cited by:[§1](https://arxiv.org/html/2606.07601#S1.p1.1)\.
- J\. Y\. Shin, J\. Y\. Lee, and H\. J\. Hwang \(2022\)Pseudo\-differential neural operator: generalized fourier neural operator for learning solution operators of partial differential equations\.arXiv preprint arXiv:2201\.11967\.Cited by:[§1](https://arxiv.org/html/2606.07601#S1.p2.1)\.
- A\. M\. Tartakovsky, C\. O\. Marrero, P\. Perdikaris, G\. D\. Tartakovsky, and D\. Barajas\-Solano \(2020\)Physics\-informed deep neural networks for learning parameters and constitutive relationships in subsurface flow problems\.Water Resources Research56\(5\),pp\. e2019WR026731\.Cited by:[§1](https://arxiv.org/html/2606.07601#S1.p1.1)\.
- A\. Tran, A\. Mathews, L\. Xie, and C\. S\. Ong \(2023\)Factorized fourier neural operators\.InThe Eleventh International Conference on Learning Representations,Cited by:[§1](https://arxiv.org/html/2606.07601#S1.p3.1)\.
- E\. Weinan and B\. Yu \(2018\)The deep ritz method: a deep learning\-based numerical algorithm for solving variational problems\.Communications in Mathematics and Statistics6\(1\),pp\. 1–12\.Cited by:[§1](https://arxiv.org/html/2606.07601#S1.p1.1)\.
- B\. Xiong, H\. Fu, F\. Xu, and Y\. Jin \(2019\)Data\-driven discovery of partial differential equations for multiple\-physics electromagnetic problem\.arXiv preprint arXiv:1910\.13531\.Cited by:[§1](https://arxiv.org/html/2606.07601#S1.p1.1)\.
- P\. Zhang, Y\. Hu, Y\. Jin, S\. Deng, X\. Wu, and J\. Chen \(2021\)A maxwell’s equations based deep learning method for time domain electromagnetic simulations\.IEEE Journal on Multiscale and Multiphysics Computational Techniques6,pp\. 35–40\.Cited by:[§1](https://arxiv.org/html/2606.07601#S1.p1.1)\.
## Appendix AImplementation Details
In this section, we introduce the implementation details\. For all models, the Adam optimizer with step learning rate scheduling is used to improve stability and better convergence\. We trained the model on an NVIDIA RTX 3090 GPU\. The model hyperparameters are listed in[Table6](https://arxiv.org/html/2606.07601#A1.T6)\. We particularly employed the early stopping method for all experiments and compared the corresponding LNOs with those from the same number of epochs iterated in the LNO experiment\. We also present the computational cost for each training in[Table7](https://arxiv.org/html/2606.07601#A1.T7)\. To ensure precise measurement of pure GPU kernel execution time, the average inference latency was recorded over 100 iterations after 10 warm\-up runs, with explicit synchronization using torch\.cuda\.synchronize\(\)\. We replaced Model Usage with Model VRAM footprint to provide a more precise measure of the peak hardware memory required during the model’s actual deployment and inference process\.
ApplicationFNO ODEsLNO ODEsLFNO ODEsFNO PDEsLNO PDEsLFNO PDEsLayer416416Width444161616Mode161616444Train batch size646464505050Learning rate0\.00250\.00250\.00250\.0020\.0020\.002weight decay0\.020\.020\.020\.00010\.00010\.0001scheduler step size100100100100100100gamma0\.850\.850\.850\.80\.80\.8Activation functionrelusinrelurelusinrelu
Table 6:Hyperparameters used in the FNO, LNO, and LFNO for training\.Computational costFNOLNOLFNOParameter count7,5371,3093,417Inference time \(ms\)0\.91950\.66641\.8522Multiply\-Accumulate \(MACs\)146\.952\.8016\.48FLOPs \(10710^\{7\}\)0\.8120\.6361\.592Model VRAM footprint \(MB\)46\.024\.048\.0Table 7:Computational cost for FNO, LNO, and LFNO training\.
## Appendix BDataset Configuration
### B\.1ODEs
#### Duffing equation
The Duffing equation is a nonlinear second\-order differential equation describing the driven oscillator\. The Duffing equation is written as follows\.
x¨\+cx˙\+αx\+βx3=f\(t\),\\ddot\{x\}\+c\\dot\{x\}\+\\alpha x\+\\beta x^\{3\}=f\(t\),\(8\)whereccis the damping coefficient,α\\alphais the linear stiffness,β\\betais the non\-linear coefficient, andx,x˙,x,\\dot\{x\},andx¨\\ddot\{x\}are the displacement, velocity, and acceleration variables of the dynamic response\. We set the parameters asα=1\\alpha=1andβ=1\\beta=1\. The initial condition is set tox\(0\)=0x\(0\)=0andx˙\(0\)=0\\dot\{x\}\(0\)=0\. To evaluate LFNO’s ability to resolve both undamped persistent oscillations and transient decay dynamics, we conduct experiments with damping coefficients ofc=0c=0andc=0\.5c=0\.5\.
#### Pendulum equation
The Pendulum equation describes the angular displacementx\(t\)x\(t\)of a pendulum relative to its rest position under an external driving forcef\(t\)f\(t\)\. The equation can be expressed as follows\.
x¨\+cx˙\+glsin\(x\)=f\(t\)\.\\ddot\{x\}\+c\\dot\{x\}\+\\frac\{g\}\{l\}\\sin\(x\)=f\(t\)\.\(9\)Here,ggis the acceleration due to gravity,llis the length of the pendulum, andccis the damping coefficient\. For brevity, we useg/l=1g/l=1and the initial condition ofx\(0\)=0x\(0\)=0andx˙\(0\)=0\\dot\{x\}\(0\)=0\. We experiment with the damping coefficientc=0\.5c=0\.5\.
#### Forced Lorenz system
The forced Lorenz system is the set of three coupled nonlinear differential equations representing simplified atmospheric convection\.
x˙\\displaystyle\\dot\{x\}=γ\(y−x\)\\displaystyle=\\gamma\(y\-x\)\(10\)y˙\\displaystyle\\dot\{y\}=x\(ρ−z\)−y\\displaystyle=x\(\\rho\-z\)\-yz˙\\displaystyle\\dot\{z\}=xy−βz−f\(t\),\\displaystyle=xy\-\\beta z\-f\(t\),whereγ\\gamma,ρ\\rho, andβ\\betaare physical parameters\. To evaluate the framework’s robustness in chaotic regimes and its sensitivity to parameter variations, we fixγ=10\\gamma=10andβ=8/3\\beta=8/3while setting the Rayleigh numberρ\\rhoto55and1010\. Both experiments are initialized from the statex\(0\)=1,y\(0\)=0,z\(0\)=0x\(0\)=1,y\(0\)=0,z\(0\)=0\.
### B\.2PDEs
#### Euler\-Bernoulli beam equation
The deflectiony\(x,t\)y\(x,t\)of a homogeneous Euler\-Bernoulli beam of 1D with forcing functionf\(x,t\)f\(x,t\)is given as follows:
EI∂4y∂x4\+ρA∂2y∂t2=f\(x,t\),EI\\frac\{\\partial^\{4\}y\}\{\\partial x^\{4\}\}\+\\rho A\\frac\{\\partial^\{2\}y\}\{\\partial t^\{2\}\}=f\(x,t\),\(11\)whereEI,ρ,EI,\\rho,andAAdenote bending stiffness, density, and cross\-sectional area, respectively\. The forcing function is defined asf\(x,t\)=−99Ae−xsin\(10t\)f\(x,t\)=\-99Ae^\{\-x\}\\sin\(10t\), which represents a periodic spatially decaying excitation\. We use the parameters ofEI=1\.334EI=1\.334,ρ=7850\\rho=7850, andA=0\.01A=0\.01\.
#### Heat equation
The heat equation describes the spatiotemporal evolution of a temperature fieldy\(x,t\)y\(x,t\)driven by a source termf\(x,t\)\.f\(x,t\)\.The governing equation is given as follows:
∂2y∂x2−∂y∂t=f\(x,t\)\.\\frac\{\\partial^\{2\}y\}\{\\partial x^\{2\}\}\-\\frac\{\\partial y\}\{\\partial t\}=f\(x,t\)\.\(12\)Here, we use the source term off\(x,t\)=Ae−t\(1−π2\)sin\(πx\)f\(x,t\)=Ae^\{\-t\}\(1\-\\pi^\{2\}\)\\sin\(\\pi x\)\.
#### Reaction\-diffusion equation
Reaction\-diffusion systems model the chemical concentrationy\(x,t\)y\(x,t\)under the influence of both diffusion and nonlinear reaction kinetics\. In this experiment, we use the following governing equation:
D∂2y∂x2\+ky2−∂y∂t=f\(x,t\),D\\frac\{\\partial^\{2\}y\}\{\\partial x^\{2\}\}\+ky^\{2\}\-\\frac\{\\partial y\}\{\\partial t\}=f\(x,t\),\(13\)wherey\(x,t\)y\(x,t\)is the concentration of the chemical,D=1−0\.95π2D=1\-0\.95\\pi^\{2\}is the diffusion coefficient, andk=1k=1is the rate of reaction\. The source termf\(x,t\)=Ae−t\(1−π2\)sin\(πx\)\+A2e−2tsin\(πx\)2f\(x,t\)=Ae^\{\-t\}\(1\-\\pi^\{2\}\)\\sin\(\\pi x\)\+A^\{2\}e^\{\-2t\}\\sin\(\\pi x\)^\{2\}is configured to produce complex nonlinear interactions between the decaying forcing and the termy2y^\{2\}, challenging the model’s spectral expressivity\.
#### Brusselator equation
The Brusselator is a prototypical model for autocatalytic oscillating reactions, describing the concentrations of two interacting chemicals,u\(x,t\)u\(x,t\)andv\(x,t\)v\(x,t\)\. The system is governed by:
∂u∂t\\displaystyle\\frac\{\\partial u\}\{\\partial t\}=D0∂2u∂x2\+a\+f\(x,t\)−\(1\+b\)u\+vu2\\displaystyle=D\_\{0\}\\frac\{\\partial^\{2\}u\}\{\\partial x^\{2\}\}\+a\+f\(x,t\)\-\(1\+b\)u\+vu^\{2\}\(14\)∂v∂t\\displaystyle\\frac\{\\partial v\}\{\\partial t\}=D1∂2v∂x2\+bu−vu2,\\displaystyle=D\_\{1\}\\frac\{\\partial^\{2\}v\}\{\\partial x^\{2\}\}\+bu\-vu^\{2\},whereD0=1D\_\{0\}=1andD1=0\.5D\_\{1\}=0\.5are diffusion coefficients anda=1,b=3a=1,b=3are the parameters of the system\. The forcing function isf\(x,t\)=Ae−t\(1−π2\)sin\(πx\)\.f\(x,t\)=Ae^\{\-t\}\(1\-\\pi^\{2\}\)\\sin\(\\pi x\)\.
#### Burgers’ equation
Burgers’ equation serves as a fundamental model capturing the interplay between nonlinear advection and diffusion\. Unlike the previously described benchmarks, this is an unforced system where the field evolves solely from its initial state:
∂u∂t\+u∂u∂x=ν∂2u∂x2\\frac\{\\partial u\}\{\\partial t\}\+u\\frac\{\\partial u\}\{\\partial x\}=\\nu\\frac\{\\partial^\{2\}u\}\{\\partial x^\{2\}\}\(15\)whereν=0\.01/π\\nu=0\.01/\\piis the diffusion coefficient\. We initialize the system withu\(x,0\)=Uxu\(x,0\)=Ux, whereU∼𝒰\[1\.6,2\.4\)U\\sim\\mathcal\{U\}\[1\.6,2\.4\)\.
#### Navier\-Stokes equation
The Navier\-Stokes equation governs the motion of incompressible viscous fluids\. In our experiment, the system representing the vorticityω\\omegais given by:
∂ω∂t\+\(𝐮⋅∇\)ω=ν∇2ω\\displaystyle\\frac\{\\partial\\omega\}\{\\partial t\}\+\(\\mathbf\{u\}\\cdot\\nabla\)\\omega=\\nu\\nabla^\{2\}\\omega\(16\)∇2ψ=−ω\\displaystyle\\nabla^\{2\}\\psi=\-\\omega𝐮=⟨∂ψ∂y,−∂ψ∂x⟩\.\\displaystyle\\mathbf\{u\}=\\left\\langle\\frac\{\\partial\\psi\}\{\\partial y\},\-\\frac\{\\partial\\psi\}\{\\partial x\}\\right\\rangle\.Here,ν=0\.001\\nu=0\.001denotes a kinematic viscosity\. The initial condition isω\(x,y,0\)=Vsin\(x\)cos\(y\),\\omega\(x,y,0\)=V\\sin\(x\)\\cos\(y\),whereVVis sampled uniformly from\[0\.8,1\.2\)\.\[0\.8,1\.2\)\.
TaskBoundary conditionsInitial values\(for allxx\)SpatialRes\.Δt\\Delta tEuler\-Bernoulli beamFixed ends\(y=0y=0\)y\(x,0\)=0y\(x,0\)=0y˙\(x,0\)=0\\dot\{y\}\(x,0\)=0Δx=0\.03\\Delta x=0\.030\.02Heat equationy\(x,0\)=0y\(x,0\)=0Δx=0\.08\\Delta x=0\.080\.01Reaction\-diffusionΔx=0\.08\\Delta x=0\.080\.02BrusselatorNeumannu\(x,0\)=0\.5u\(x,0\)=0\.5v\(x,0\)=1v\(x,0\)=1Δx=0\.03\\Delta x=0\.030\.02BurgersPeriodicVaryΔx=π/25\\Delta x=\\pi/250\.02Navier\-StokesPeriodicVaryΔx=π/25\\Delta x=\\pi/250\.01Table 8:Detailed configurations of PDEs
## Appendix CAnalysis of Transient and Steady\-state Error Dynamics
In this section, we present a comparative analysis of the empirical properties of LNO and LFNO, focusing on their error stability across different dynamical regimes\. To evaluate long\-term predictive performance, the time horizon is partitioned into the transient state \(t<500t<500\) and the steady state \(t≥500t\\geq 500\)\. This threshold oft=500t=500is determined based on the physical settling time of the governing equations and the observed stabilization of error profiles\. In the following figures in[Figure5](https://arxiv.org/html/2606.07601#A3.F5), results for LNO and LFNO are denoted in green and magenta, respectively\. To clarify long\-term trends, a moving average with a window size of100100steps is applied\.
Across all test cases, LNO exhibits significant fluctuations in error as time progresses\. In contrast, LFNO maintains a remarkably narrow error range with minimal variance, demonstrating superior statistical stability during long\-term integration\. As the coefficients of the Duffing equationccand the Lorenz systemρ\\rhoincrease, the systems exhibit stronger chaotic characteristics, which typically hinders the learning process\. However, LFNO consistently maintains a stable convergence pattern regardless of these variations, whereas the performance of LNO tends to degrade under increased complexity\. These visual and numerical evidences confirm that LFNO provides substantially higher steady\-state stability and predictive precision than the conventional LNO, particularly in chaotic dynamics\.
\(a\)Duffingc=0c=0\(b\)Duffingc=0\.5c=0\.5\(c\)Lorenzρ=5\\rho=5\(d\)Lorenzρ=10\\rho=10\(e\)Pendulumc=0\.5c=0\.5
Figure 5:Comparison of point\-wise Mean Absolute Error \(MAE\) between LNO and LFNO across ODE systems\. The time horizon is partitioned into the transient state \(t<500t<500\), and the steady\-state \(t\>500t\>500\)\. \(a\), \(b\) Duffing \(c=0,0\.5c=0,0\.5\), \(c\), \(d\) Lorenz \(ρ=5,10\\rho=5,10\), and \(e\) Pendulum \(c=0\.5c=0\.5\)\. The green lines represent the LNO, while the magenta lines denote the LFNO\.
## Appendix DDataset Samples
[Table9](https://arxiv.org/html/2606.07601#A4.T9)and[Table10](https://arxiv.org/html/2606.07601#A4.T10)show the visualizations of the datasets for each task\. The datasets themselves do not have a split, as we used a split of K\-folds or a random split on the fly to shuffle and split the test and validation samples\.
TaskSignalGrount Truth \(GT\)Duffingc0![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/dataset/1d_signal.png)![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/dataset/Duffing_c0.png)Duffingc05![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/dataset/1d_signal.png)![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/dataset/Duffing_c05.png)Lorenzrho5![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/dataset/1d_signal.png)![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/dataset/Lorenz_rho5.png)Lorenzrho10![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/dataset/1d_signal.png)![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/dataset/Lorenz_rho10.png)Pendulumc05![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/dataset/1d_signal.png)![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/dataset/Pendulum_c05.png)Table 9:Visualization of ODE samples in our datasetTaskSignalGrount Truth \(GT\)Beam![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/dataset/Beam_signal.png)![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/dataset/Beam.png)Heat![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/dataset/Diffusion_signal.png)![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/dataset/Diffusion.png)Reaction\-diffusion![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/dataset/ReactionDiffusion_signal.png)![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/dataset/ReactionDiffusion.png)Brusselator![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/dataset/gray_signal.png)![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/dataset/brusselator.png)Burgers![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/dataset/burgers_signal.png)![[Uncaptioned image]](https://arxiv.org/html/2606.07601v1/images/dataset/burgers.png)Table 10:Visualization of PDE samples in our dataset
## Appendix EPDEs Qualitative Figures
The qualitative results of PDEs are shown in[Figure6](https://arxiv.org/html/2606.07601#A5.F6),[Figure7](https://arxiv.org/html/2606.07601#A5.F7),[Figure8](https://arxiv.org/html/2606.07601#A5.F8),[Figure9](https://arxiv.org/html/2606.07601#A5.F9), and[Figure10](https://arxiv.org/html/2606.07601#A5.F10)\. Our model’s qualitative results are competitive with those of FNO\.
\(a\)Qualitative Results and Scatter of the Euler\-Bernoulli beam in FNO
\(b\)Qualitative Results and Scatter of the Euler\-Bernoulli beam in LFNO
\(c\)Qualitative Results and Scatter of the Euler\-Bernoulli beam in LNO
Figure 6:Euler\-Bernoulli beam equation: Comparison of qualitative results computed by FNO, LFNO, and LNO\. It represents the ground truth shown in the left plot, the prediction of each model in the center plot, and the \(truth, pred\) scatters are shown in the right plot\.\(a\)Qualitative Results and Scatter of the heat equation in FNO
\(b\)Qualitative Results and Scatter of the heat equation in LFNO
\(c\)Qualitative Results and Scatter of the heat equation in LNO
Figure 7:Heat equation: Comparison of qualitative results computed by FNO, LFNO, and LNO\. It represents the ground truth shown in the left plot, the prediction of each model in the center plot, and the \(truth, pred\) scatters are shown in the right plot\.\(a\)Qualitative Results and Scatter of the reaction\-diffusion equation in FNO
\(b\)Qualitative Results and Scatter of the reaction\-diffusion equation in LFNO
\(c\)Qualitative Results and Scatter of the reaction\-diffusion equation in LNO
Figure 8:Reaction\-diffusion equation: Comparison of qualitative results computed by FNO, LFNO, and LNO\. It represents the ground truth shown in the left plot, the prediction of each model in the center plot, and the \(truth, pred\) scatters are shown in the right plot\.\(a\)Qualitative Results and Scatter of the Brusselator equation in FNO
\(b\)Qualitative Results and Scatter of the Brusselator equation in LFNO
\(c\)Qualitative Results and Scatter of the Brusselator equation in LNO
Figure 9:Brusselator equation: Comparison of qualitative results computed by FNO, LFNO, and LNO\. It represents the ground truth shown in the left plot, the prediction of each model in the center plot, and the \(truth, pred\) scatters are shown in the right plot\.\(a\)Qualitative Results and Scatter of the Burgers’ equation in FNO
\(b\)Qualitative Results and Scatter of the Burgers’ equation in LFNO
\(c\)Qualitative Results and Scatter of the Burgers’ equation in LNO
Figure 10:Burgers’ equation: Comparison of qualitative results computed by FNO, LFNO, and LNO\. It represents the ground truth shown in the left plot, the prediction of each model in the center plot, and the \(truth, pred\) scatters are shown in the right plot\.\(a\)Qualitative Results and Scatter of the Navier\-Stokes equation in FNO
\(b\)Qualitative Results and Scatter of the Navier\-Stokes equation in LFNO
\(c\)Qualitative Results and Scatter of the Navier\-Stokes equation in LNO
Figure 11:Navier\-Stokes equation: Comparison of qualitative results computed by FNO, LFNO, and LNO\. It represents the ground truth shown in the left plot, the prediction of each model in the center plot, and the \(truth, pred\) scatters are shown in the right plot\.Similar Articles
SirenFNO: Efficient and Full Frequency Learning of Fourier Neural Operators
SirenFNO leverages sinusoidal representation networks to learn full-frequency Fourier kernels, eliminating frequency truncation and achieving significant parameter reductions while improving accuracy on PDE benchmarks.
@AnimaAnandkumar: Great to see extrapolation success with FNOs.
Fourier neural operators (FNOs) achieve extrapolation success in modeling periodically driven quantum systems, capturing temporal correlations in frequency space for physically faithful dynamics beyond training data.
Sequential Physics-Constrained Neural Operator Forward Modeling for the $\textit{Norne}$ Reservoir System
This paper presents a comprehensive mathematical framework for sequential surrogate modeling of three-phase black-oil reservoir dynamics using Fourier Neural Operators (FNO) and physics-informed variants (PINO), applied to the Norne benchmark reservoir. Theoretical contributions include functional-analytic formulation, covariate shift analysis, physics-constrained spectral stability, and truncated backpropagation gradient analysis.
Frequency Bias and OOD Generalization in Neural Operators under a Variable-Coefficient Wave Equation
This paper investigates the generalization behavior of Fourier Neural Operators and Deep Operator Networks under distribution shifts in a variable-coefficient wave equation, revealing that FNO struggles with high-frequency inputs while DeepONet shows milder degradation.
Decoupled Latent Optimization of Diffusion Models for Full Waveform Inversion
Introduces Decoupled Latent Optimization (DLO) for full waveform inversion, which relaxes latent optimization into a quadratic-penalty objective, outperforming classical and diffusion-based methods on benchmarks while preserving smoothed-velocity initialization.