UFO: A Domain-Unification-Free Operator Framework for Generalized Operator Learning

arXiv cs.LG 05/14/26, 04:00 AM Papers
Summary
Introduces UFO, a cross-domain neural operator framework that adaptively learns operators across different representational domains, enabling discretization-decoupled predictions robust to distribution shifts.
arXiv:2605.12700v1 Announce Type: new Abstract: Neural operators have become an effective framework for learning mappings between function spaces, yet most existing architectures realize operators within a single representational domain, such as physical, spectral, or latent space. In this work, we introduce UFO (Domain-Unification-Free Operator), a cross-domain neural operator framework that realizes operators through adaptive, jointly conditioned interactions among representations defined on distinct domains. UFO enables discretization decoupling: the input function can be observed at resolutions or locations different from those used during training, while the solution can be queried at arbitrary output resolutions. Across four complementary benchmarks covering discontinuous inputs, irregular sampling with spectral mismatch, nonlinear dynamics, and stochastic high-frequency fields, UFO delivers accurate, robust, and physically coherent predictions under distribution shifts. These results establish cross-domain, phase-modulated realization as a powerful framework for discretization-decoupled neural operator learning.
Original Article
View Cached Full Text
Cached at: 05/14/26, 06:17 AM
# UFO: A Domain-Unification-Free Operator Framework for Generalized Operator Learning
Source: [https://arxiv.org/html/2605.12700](https://arxiv.org/html/2605.12700)
###### Abstract

Neural operators have become an effective framework for learning mappings between function spaces, yet most existing architectures realize operators within a single representational domain, such as physical, spectral, or latent space\. In this work, we introduce UFO \(Domain\-Unification\-Free Operator\), a cross\-domain neural operator framework that realizes operators through adaptive, jointly conditioned interactions among representations defined on distinct domains\. UFO enables discretization decoupling: the input function can be observed at resolutions or locations different from those used during training, while the solution can be queried at arbitrary output resolutions\. Across four complementary benchmarks covering discontinuous inputs, irregular sampling with spectral mismatch, nonlinear dynamics, and stochastic high\-frequency fields, UFO delivers accurate, robust, and physically coherent predictions under distribution shifts\. These results establish cross\-domain, phase\-modulated realization as a powerful framework for discretization\-decoupled neural operator learning\.

\\affiliation

\[label1\]organization=Water and Mining Environment Unit, Geological Survey of Finland, addressline=Vuorimiehentie 5, city=Espoo, postcode=02151, country=Finland

\\affiliation

\[label2\]organization=Division of Applied Mathematics, Brown University, addressline=170 Hope Street, city=Providence, postcode=RI 02912, country=USA

\\affiliation

\[label3\]organization=Institute of Geosciences, University of Bonn, addressline=Kirschallee 1\-3, city=Bonn, postcode=53115, country=Germany

## 1Introduction

Learning nonlinear operators that map functions to functions is a central problem in scientific machine learning, with applications spanning fluid mechanics, geophysics, climate modeling, and materials science\(Wenet al\.,[2022](https://arxiv.org/html/2605.12700#bib.bib28); Pathaket al\.,[2026](https://arxiv.org/html/2605.12700#bib.bib31); Choiet al\.,[2024](https://arxiv.org/html/2605.12700#bib.bib29); Huanget al\.,[2024](https://arxiv.org/html/2605.12700#bib.bib30); Azizzadenesheliet al\.,[2024](https://arxiv.org/html/2605.12700#bib.bib12)\)\. Over the past several years, neural operators have emerged as a powerful paradigm for approximating solution operators of parametric partial differential equations \(PDEs\) and related infinite\-dimensional mappings directly from data\(Luet al\.,[2021](https://arxiv.org/html/2605.12700#bib.bib1); Kovachkiet al\.,[2023](https://arxiv.org/html/2605.12700#bib.bib2)\)\.

Two families of neural operators have been particularly influential\. Deep Operator Networks \(DeepONets\) represent operators through a decomposition into a branch network, which encodes the input function, and a trunk network, which evaluates the solution at queried spatial or temporal coordinates\(Luet al\.,[2021](https://arxiv.org/html/2605.12700#bib.bib1)\)\. This architecture offers flexibility with respect to irregular geometries, scattered sensors, and physics\-informed constraints\. On the other hand, Fourier Neural Operators \(FNOs\) and related spectral architectures embed convolutional kernels in Fourier space, enabling efficient learning of global interactions and improved handling of high\-frequency modes\(Liet al\.,[2021](https://arxiv.org/html/2605.12700#bib.bib5)\)\.

Although the two families differ in architecture and inductive bias, they are fundamentally realized within a single representational domain\. In DeepONet family, operator realization is carried out in the physical domain through branch\-trunk representations\. In contrast, FNO\-type methods realize operators within a spectral domain via global Fourier representations\. Despite their success, such single\-domain formulations introduce inherent limitations\. For instance, DeepONet lacks explicit mechanisms for representing frequency content, often resulting in spectral bias when learning operators associated with highly oscillatory solutions\(Rahamanet al\.,[2019](https://arxiv.org/html/2605.12700#bib.bib3); Wanget al\.,[2022](https://arxiv.org/html/2605.12700#bib.bib4)\)\. Meanwhile, FNO\-type methods rely on inverse Fourier transforms that typically require regular grids and fixed spectral representations, which may limit their applicability to heterogeneous domains\. Another challenge lies in the incorporation of physics\-informed constraints, as these often require explicit derivative evaluations in the physical domain\(Kovachkiet al\.,[2023](https://arxiv.org/html/2605.12700#bib.bib2); Liet al\.,[2024](https://arxiv.org/html/2605.12700#bib.bib6)\)\.

Subsequent works have addressed these difficulties from several complementary directions\. Recent studies improve spectral expressivity to better capture high\-frequency, oscillatory, or non\-smooth solution structures\(Zhuet al\.,[2023](https://arxiv.org/html/2605.12700#bib.bib20); Peyvanet al\.,[2026](https://arxiv.org/html/2605.12700#bib.bib21); Khodakaramiet al\.,[2026a](https://arxiv.org/html/2605.12700#bib.bib22); Chenget al\.,[2025](https://arxiv.org/html/2605.12700#bib.bib7); Jianget al\.,[2024](https://arxiv.org/html/2605.12700#bib.bib8); Sojitraet al\.,[2026](https://arxiv.org/html/2605.12700#bib.bib9); Zhaoet al\.,[2025](https://arxiv.org/html/2605.12700#bib.bib24)\)\. In parallel, the regular\-grid restriction has motivated neural operators for irregular geometries, non\-uniform meshes, and flexible input formats\. These mechanisms mainly rely on learned geometric deformations, graph\- or attention\-based representations, and mesh\-adaptive constructions\(Liet al\.,[2023b](https://arxiv.org/html/2605.12700#bib.bib16),[a](https://arxiv.org/html/2605.12700#bib.bib15); Haoet al\.,[2023](https://arxiv.org/html/2605.12700#bib.bib13); Fuet al\.,[2025](https://arxiv.org/html/2605.12700#bib.bib19); Liu and Tang,[2025](https://arxiv.org/html/2605.12700#bib.bib18); Yinet al\.,[2024](https://arxiv.org/html/2605.12700#bib.bib25)\)\. These advances significantly broaden the applicability of neural operators beyond rectangular grids and smooth solution regimes\.

Resolution invariance during evaluation is another key motivation of neural operators\. However, existing architectures still exhibit practical forms of discretization dependence: DeepONet\-type models usually rely on a fixed set of input sensors, while FNO\-type models require compatible input\-output grid representations\. This limits their use in regimes where dense measurements are expensive or unavailable, but high\-resolution predictions are required\. Several recent works aim to reduce this dependence, including learning operators in latent spaces\(Wang and Wang,[2024](https://arxiv.org/html/2605.12700#bib.bib17); Kontolatiet al\.,[2024](https://arxiv.org/html/2605.12700#bib.bib26)\), resolution\-independent neural operators\(Bahmaniet al\.,[2025](https://arxiv.org/html/2605.12700#bib.bib14)\), Laplace neural operator\(Caoet al\.,[2024](https://arxiv.org/html/2605.12700#bib.bib27)\), and super\-resolution neural operators\(Wei and Zhang,[2023](https://arxiv.org/html/2605.12700#bib.bib23)\)\.

Despite these developments, most existing approaches improve expressivity, geometry handling, or resolution transfer through architectural augmentation within a predefined realization mechanism\. In particular, representations from different sources or domains are often encoded separately and then combined through fixed fusion rules, such as concatenation, inner products, interpolation, or attention\-based aggregation\. Such mechanisms enrich the features available to the model, but the operator realization itself remains governed by a fixed composition\.

Motivated by this observation, we introduce UFO \(Domain\-Unification\-Free Operator\), a cross\-domain operator framework, where the operator is realized through non\-separable, jointly conditioned interactions among representations defined on distinct domains\. Under this view, operator realization is no longer a fixed mapping, but emerges from an adaptive, phase\-modulated coupling between heterogeneous input\-space and solution\-space representations\.

This work makes three main contributions\. First, we introduce a novel operator realization mechanism, UFO\. Second, we develop the foundational UFO architecture, consisting of a spectral encoder, a spatial basis network, and an adaptive phase\-modulated coupling operator\. This architecture enables discretization decoupling, which means that input functions can be observed at resolutions or locations different from those used during training, while solutions can be queried at arbitrary output resolutions\. Third, we design a set of complementary benchmarks to evaluate not only pointwise accuracy, but also spectral consistency, structural coherence, generalization, and resolution behavior\.

Specifically, we evaluate UFO on four complementary benchmarks: StepHeat for discontinuous\-input spectral bias, Delta\-Helmholtz for translation consistency under irregular sampling and spectrum mismatch, Burgers for nonlinear structure preservation under bidirectional extrapolation, and GRF\-Helmholtz for stochastic high\-frequency random\-field operators\. Across these settings, UFO delivers accurate, robust, and physically coherent predictions under distribution shifts\.

## 2UFO theory and architecture

Let𝒜\\mathcal\{A\}and𝒰\\mathcal\{U\}be Banach spaces of functions defined on domainsΩin\\Omega\_\{in\}andΩout\\Omega\_\{out\}, respectively\. An operator learning problem consists of approximating a \(possibly nonlinear\) operator𝒢:𝒜→𝒰\\mathcal\{G\}:\\ \\mathcal\{A\}\\rightarrow\\mathcal\{U\}from a finite set of input\-output function pairs\{\(fi,𝒢\(fi\)\)\}\\\{\(f\_\{i\},\\mathcal\{G\}\(f\_\{i\}\)\)\\\}\. A neural operator is a parametric family of maps𝒢θ\\mathcal\{G\}\_\{\\theta\}designed to approximate𝒢\\mathcal\{G\}uniformly on compact subsets of𝒜\\mathcal\{A\}\.

The solution operator𝒢\\mathcal\{G\}in UFO is realized through an adaptive, phase\-modulated coupling among cross\-domain representations, which gives rise to the following definition\.

###### Definition 1\(Cross\-domain Operator Learning\)\.

A neural operator𝒢α\\mathcal\{G\}\_\{\\alpha\}is said to employ cross\-domain operator learning if it is realized through a non\-separable, jointly\-conditioned interaction among representations defined on distinct spaces, i\.e\.,

𝒢α\(f\)\(x\)=𝒞α\(Φ𝒮\(x\),Ψℋ\(f\)\),x∈Ωout,\\mathcal\{G\}\_\{\\alpha\}\(f\)\(x\)=\\mathcal\{C\}\_\{\\alpha\}\(\\Phi\_\{\\mathcal\{S\}\}\(x\),\\ \\Psi\_\{\\mathcal\{H\}\}\(f\)\),\\qquad x\\in\\Omega\_\{out\},whereΦ𝒮\(x\)∈𝒮,Ψℋ\(f\)∈ℋ,𝒮≠ℋ\\Phi\_\{\\mathcal\{S\}\}\(x\)\\in\\mathcal\{S\},\\ \\Psi\_\{\\mathcal\{H\}\}\(f\)\\in\\mathcal\{H\},\\ \\mathcal\{S\}\\not=\\mathcal\{H\}, and𝒞α\\mathcal\{C\}\_\{\\alpha\}is a learnable coupling whose realization depends jointly on these representations and is not reducible to independent mappings acting on each representation separately\.

Clearly, operator realization in Definition[1](https://arxiv.org/html/2605.12700#Thmdefinition1)emerges from adaptive interactions among representations defined on distinct spaces\. DeepONet\-type methods realize operators within a single physical\-domain representation, whereas FNO\-type methods realize operators within a single spectral\-domain representation\. Hence, neither family belongs to the cross\-domain operator learning \(CDOL\) framework\.

#### Remark

The definitions above formalize the operator realization principle introduced by UFO\. It naturally admits multi\-domain extensions within the same UFO framework\.

### 2\.1UFO architecture

As shown in Fig\.[1](https://arxiv.org/html/2605.12700#S2.F1), the above principles are instantiated through three tightly coupled modules: the spectral encoder for input\-domain representation, the spatial basis network for solution\-domain representation, and the adaptive phase\-modulated coupling operator for cross\-domain realization\. Fig\.[1](https://arxiv.org/html/2605.12700#S2.F1)presents the architecture corresponding to the minimal two\-domain form of UFO\.

![Refer to caption](https://arxiv.org/html/2605.12700v1/x1.png)Figure 1:Architecture of the UFO framework\.#### Spectral encoder \(SE\)

The SE is a central component of UFO that constructs a global representation of the input function in a domain distinct from the physical space\. To this end, we introduce a learnable, coordinate\-conditioned SE that maps an input function to a representation in a spectral domain\.

Letf:Ω→ℝdff:\\Omega\\rightarrow\\mathbb\{R\}^\{d\_\{f\}\}be an input function observed at locations\{xi′\}i=1N⊂Ω\\\{x^\{\\prime\}\_\{i\}\\\}\_\{i=1\}^\{N\}\\subset\\Omega\. The SE constructs the input\-domain representationΨℋ\(f\)∈ℂC\\Psi\_\{\\mathcal\{H\}\}\(f\)\\in\\mathbb\{C\}^\{C\}in UFO\. We first liftffinto a higher\-dimensional latent space:f~i=ℒθ\(f\(xi′\)\),f~i∈ℝdℓ,\\tilde\{f\}\_\{i\}=\\mathcal\{L\}\_\{\\theta\}\(f\(x^\{\\prime\}\_\{i\}\)\),\\ \\tilde\{f\}\_\{i\}\\in\\mathbb\{R\}^\{d\_\{\\ell\}\},whereℒθ\\mathcal\{L\}\_\{\\theta\}is a learnable linear lifting map\.

A spectral transformation is then applied to the lifted sample sequence,f^=𝒯\(f~\)\\hat\{f\}=\\mathcal\{T\}\(\\tilde\{f\}\)\. In our implementation,𝒯\\mathcal\{T\}is instantiated as an FFT\-based transform that provides an initial spectral coordinate system for learnable representations, rather than an exact discretization\-dependent Fourier expansion\. UFO does not rely on any specific choice of transform\.

The spectral coefficients are further modulated by a coordinate\-conditioned weighting function to enable discretization\-agnostic and resolution\-invariant representations\. Specifically, we introduceωθ:Ω→ℝdℓ,\\omega\_\{\\theta\}:\\Omega\\rightarrow\\mathbb\{R\}^\{d\_\{\\ell\}\},and for each sampling locationxi′x^\{\\prime\}\_\{i\}, we perform element\-wise modulation:

zi=ωθ\(xi′\)⊙f^i\.z\_\{i\}=\\omega\_\{\\theta\}\(x^\{\\prime\}\_\{i\}\)\\odot\\hat\{f\}\_\{i\}\.The global representation offfis then obtained via mean aggregation,z¯=1N∑i=1Nzi,\\displaystyle\\bar\{z\}=\\frac\{1\}\{N\}\\sum\_\{i=1\}^\{N\}z\_\{i\},which yields a global spectral summary of the entire input function\. In the continuum limit, this operation can be interpreted as a learned spectral integral of the form:

z¯≈∫Ωωθ\(x′\)f^\(x′\)𝑑μ\(x′\),\\bar\{z\}\\approx\\int\_\{\\Omega\}\\omega\_\{\\theta\}\(x^\{\\prime\}\)\\hat\{f\}\(x^\{\\prime\}\)d\\mu\(x^\{\\prime\}\),capturing the global structure of the function in the spectral domain\. Here,μ\\mudenotes the sampling measure induced by the input discretization\.

Finally, we learn a spectral representation through separate nonlinear mappings applied to the real and imaginary components ofz¯\\bar\{z\}:

Ψℋ\(f\)=ρr\(Re⁡\(z¯\)\)\+iρi\(Im⁡\(z¯\)\),\\Psi\_\{\\mathcal\{H\}\}\(f\)=\\rho\_\{r\}\(\\operatorname\{Re\}\(\\bar\{z\}\)\)\+i\\,\\rho\_\{i\}\(\\operatorname\{Im\}\(\\bar\{z\}\)\),\(1\)whereρr\\rho\_\{r\}andρi\\rho\_\{i\}are multilayer perceptrons\. This step decouples the learned representation from any specific spectral basis \(e\.g\., Fourier\), allowing the model to construct an expressive representation adapted to the operator learning task\.

The SE differs fundamentally from existing neural operator constructions\. Instead of realizing the operator within a fixed domain \(e\.g\., purely physical or spectral\), it constructs a global, coordinate\-conditioned representation in a distinct domain\. It is subsequently coupled with spatial representations through a non\-separable and phase\-modulated coupling operator\. This design is central to UFO: operator realization emerges from coupling a complementary input\-domain representation with spatial representations of the solution space, enabling structured cross\-domain interaction\.

#### Spatial basis network

Complementary to the global spectral representation of the input function generated by the SE, UFO constructs a continuous representation of the solution domain through a spatial basis network \(SBN\)\. Given query coordinatesx∈Ωx\\in\\Omega, we defineΦ𝒮\(x\)∈ℝC\\Phi\_\{\\mathcal\{S\}\}\(x\)\\in\\mathbb\{R\}^\{C\}as a learned spatial feature representation\. Here,xxis usually different from the observed locationsx′x^\{\\prime\}in the SE\. Specifically, the SBN is implemented as a multilayer perceptron:

Φ𝒮\(x\)=ϕθ\(x\),\\Phi\_\{\\mathcal\{S\}\}\(x\)=\\phi\_\{\\theta\}\(x\),whereϕθ\\phi\_\{\\theta\}maps coordinates directly to a high\-dimensional feature space\. This formulation is independent of any discretization and naturally supports arbitrary query resolutions and irregular locations\. Unlike grid\-based representations, the SBN does not construct the solution through convolution or interpolation\. Instead, it learns a parameterization of the solution space, providing a continuous spatial basis through which the cross\-domain operator realization is carried out\.

#### Adaptive phase\-modulated cross\-domain coupling operator

With the input function encoded as a global spectral representationΨℋ\(f\)\\Psi\_\{\\mathcal\{H\}\}\(f\)and the solution domain represented through spatial featuresΦ𝒮\(x\\Phi\_\{\\mathcal\{S\}\}\(x\), UFO realizes the operator through a non\-separable and phase\-modulated cross\-domain coupling:

𝒢α\(f\)\(x\)=𝒞α\(Φ𝒮\(x\),Ψℋ\(f\)\)\.\\mathcal\{G\}\_\{\\alpha\}\(f\)\(x\)=\\mathcal\{C\}\_\{\\alpha\}\\big\(\\Phi\_\{\\mathcal\{S\}\}\(x\),\\Psi\_\{\\mathcal\{H\}\}\(f\)\\big\)\.The coupling operator𝒞α\\mathcal\{C\}\_\{\\alpha\}is designed to model structured interactions between the two representations\. LetΨℋ\(f\)=a\+ib,a,b∈ℝC,\\Psi\_\{\\mathcal\{H\}\}\(f\)=a\+ib,\\ a,b\\in\\mathbb\{R\}^\{C\},andΦ𝒮\(x\)∈ℝC\\Phi\_\{\\mathcal\{S\}\}\(x\)\\in\\mathbb\{R\}^\{C\}\. We construct a joint featureη\(Φ𝒮\(x\),Ψℋ\(f\)\)=\[Φ𝒮\(x\),a,b\],\\eta\(\\Phi\_\{\\mathcal\{S\}\}\(x\),\\ \\Psi\_\{\\mathcal\{H\}\}\(f\)\)=\\big\[\\Phi\_\{\\mathcal\{S\}\}\(x\),\\,a,\\,b\\big\],which is used to generate the coupling phase:

α=γθ\(η\(Φ𝒮\(x\),Ψℋ\(f\)\)\),\\alpha=\\gamma\_\{\\theta\}\\big\(\\eta\(\\Phi\_\{\\mathcal\{S\}\}\(x\),\\ \\Psi\_\{\\mathcal\{H\}\}\(f\)\)\\big\),whereγθ\\gamma\_\{\\theta\}is a learnable mapping\.

The coupling operator is then realized through a phase\-modulated interaction, where bounded trigonometric modulation \(sin2⁡α\+cos2⁡α=1\\sin^\{2\}\\alpha\+\\cos^\{2\}\\alpha=1\) provides a stable mechanism for structure\-aware cross\-domain coupling

𝒢α\(f\)\(x\)\\displaystyle\\mathcal\{G\}\_\{\\alpha\}\(f\)\(x\)=⟨Φ𝒮\(x\),cos⁡α⊙a\+sin⁡α⊙b⟩\\displaystyle=\\langle\\Phi\_\{\\mathcal\{S\}\}\(x\),\\,\\cos\\alpha\\odot a\+\\sin\\alpha\\odot b\\rangle=∑c=1C\(ur\(c\)\+ui\(c\)\),\\displaystyle=\\sum\_\{c=1\}^\{C\}\\big\(u\_\{r\}^\{\(c\)\}\+u\_\{i\}^\{\(c\)\}\\big\),\(2\)whereur=cos⁡\(α\)⊙a⊙Φ𝒮\(x\),ui=sin⁡\(α\)⊙b⊙Φ𝒮\(x\)\.u\_\{r\}=\\cos\(\\alpha\)\\odot a\\odot\\Phi\_\{\\mathcal\{S\}\}\(x\),\\ u\_\{i\}=\\sin\(\\alpha\)\\odot b\\odot\\Phi\_\{\\mathcal\{S\}\}\(x\)\.Rather than decomposing the operator into independent mappings over input and output domains, UFO realizes it as a jointly conditioned, non\-separable, and phase\-modulated interaction between heterogeneous representations\.

## 3Experiments and results

We design four benchmarks with complementary roles to evaluate how an operator is realized, along with the predicted performance\.*StepHeat*targets spectral bias induced by discontinuous inputs,*δ\\delta\-Helmholtz*tests translation\-consistent realization under strong extrapolation,*Burgers*focuses on nonlinear structure preservation, and*GRF\-Helmholtz*examines frequency\-dominated Gaussian random field \(GRF\) operators\. These problems are chosen to expose distinct regimes in which different operator realizations succeed or degrade\.

Accordingly, we report relativeL2L^\{2\}error for pointwise accuracy and Barron norm\(Barron,[2002](https://arxiv.org/html/2605.12700#bib.bib10)\)relative error, which is quantified between the predicted and reference solutions in a frequency\-weighted spectral norm\. It is particularly useful for evaluating the model’s ability to capture oscillatory or high\-frequency features\(Khodakaramiet al\.,[2026b](https://arxiv.org/html/2605.12700#bib.bib11)\)\. A larger value indicates that the prediction deviates more from the reference solution in spectral content, suggesting a more inaccurate capture of high\-frequency components\. Moreover, we analyze qualitative structure through field visualizations and contour alignment\. We further distinguish in\-distribution \(ID\) from bidirectional out\-of\-distribution \(OOD\) generalization, and examine varying input/output resolutions to test whether the learned operator is tied to a fixed discretization\. These evaluations are central to UFO: the goal is not only an accurate approximation, but also a cross\-domain operator realization that remains structurally coherent across regimes, resolutions, and distribution shifts\.

### 3\.1StepHeat: spectral bias under discontinuous inputs

This benchmark provides a controlled setting to evaluate how operator realization differs between single\-domain and cross\-domain learning frameworks under spectral bias, particularly in capturing and propagating the high\-frequency modes induced by non\-smooth inputs\.

We consider a one\-dimensional heat\-type benchmark on\(x,t\)∈\[0,1\]×\[0,1\]\(x,t\)\\in\[0,1\]\\times\[0,1\]with homogeneous Dirichlet boundary conditions,

ut=βuxx,\\displaystyle u\_\{t\}=\\beta u\_\{xx\},u\(0,t\)=u\(1,t\)=0,\\displaystyle u\(0,t\)=u\(1,t\)=0,\(3\)and a discontinuous step initial conditionu\(x,0\)=f0\(x;s\)=𝟙x\>s,u\(x,0\)=f\_\{0\}\(x;s\)=\\mathds\{1\}\_\{x\>s\},wheresscontrols the discontinuity location\.

To control the spectral difficulty, we use the following sine\-series realization:u\(x,t;s\)=∑n=1Nan\(κ\)\(s\)sin⁡\(nκπx\)e−β\(nκπ\)2t,\\displaystyle u\(x,t;s\)=\\sum\_\{n=1\}^\{N\}a\_\{n\}^\{\(\\kappa\)\}\(s\)\\,\\sin\(n\\kappa\\pi x\)\\,e^\{\-\\beta\(n\\kappa\\pi\)^\{2\}t\},with coefficientsan\(κ\)\(s\)=2nκπ\(cos⁡\(nκπs\)−cos⁡\(nκπ\)\),a\_\{n\}^\{\(\\kappa\)\}\(s\)=\\frac\{2\}\{n\\kappa\\pi\}\\Big\(\\cos\(n\\kappa\\pi s\)\-\\cos\(n\\kappa\\pi\)\\Big\),whereκ\\kappais a frequency\-scaling factor controlling the spectral complexity\. Whenκ=1\\kappa=1, this reduces to the standard sine\-series solution; largerκ\\kappayields a more spectrally demanding regime\. We setκ=20\\kappa=20and the diffusivity constantβ=6\.25×10−4\\beta=6\.25\\times 10^\{\-4\}\. Finally, we generate 128 training samples by varying the discontinuity locations∈\[0\.3,0\.7\]s\\in\[0\.3,0\.7\]\.

Table 1:ID performance on StepHeat with varying discontinuity locationss\. RelativeL2L^\{2\}and Barron errors are computed over the full spatio\-temporal solution field, lower is better\.In this benchmark,ssinduces distinct spectral patterns, making the problem challenging even in the ID regime as shown in Table[1](https://arxiv.org/html/2605.12700#S3.T1)\. This creates a particularly demanding setting for neural operators: DeepONet\-type methods are prone to spectral bias under such non\-smooth inputs, while FNO\-type methods face an additional challenge in handling discontinuities despite their spectral inductive bias\. UFO, by contrast, is required to capture and propagate these discontinuity\-induced high\-frequency modes through cross\-domain realization\. StepHeat, therefore, serves as a stress test of whether an operator realization can remain accurate and structurally consistent under both non\-smooth inputs and strong spectral demands\.

Table[1](https://arxiv.org/html/2605.12700#S3.T1)shows clearly that UFO achieves the lowest relativeL2L^\{2\}error on four of the six cases and remains highly competitive on the other two\. UFO also attains the best Barron error on the first three cases, while DeepONet is slightly better on the remaining three\. This indicates that UFO improves both pointwise reconstruction accuracy and spectral structure preservation in the spatio\-temporal solution\.

The competitive performance of DeepONet is consistent with its flexibility in coordinate\-based querying\. However, the generally higher errors might be an indicator suggesting that physical\-domain realization alone is insufficient to fully capture the discontinuity\-induced high\-frequency modes\. FNO performs substantially worse across most cases, with especially large degradation where Barron errors increase sharply\. This behavior indicates that, despite its spectral inductive bias, FNO is more sensitive to the discontinuous input family and presents unstable realization quality in the StepHeat benchmark\.

### 3\.2δ\\delta\-Helmholtz: translation\-consistent realization

We consider the parametric 2D Helmholtz equation onΩ=\[0,1\]2\\Omega=\[0,1\]^\{2\}with homogeneous Dirichlet boundary conditions,

uxx\+uyy\+k2u=f,\(x,y\)∈Ω,u\|∂Ω=0\.u\_\{xx\}\+u\_\{yy\}\+k^\{2\}u=f,\\qquad\(x,y\)\\in\\Omega,\\quad u\|\_\{\\partial\\Omega\}=0\.\(4\)The defined analytical solution isu\(x,y;δ;k=10\)=\(x\+y\)sin⁡\(10πx\)sin⁡\(10πy\)\+δ,u\(x,y;\\delta;k=10\)=\(x\+y\)\\sin\(10\\pi x\)\\sin\(10\\pi y\)\+\\delta,whereδ\\deltaacts as an additive global shift\. The forcing term with mismatch frequency \(k=1k=1\) is given byf\(x,y;δ;k=1\)=2π\(sin⁡\(πx\)cos⁡\(πy\)\+cos⁡\(πx\)sin⁡\(πy\)\)\+\(1−2π2\)\(x\+y\)sin⁡\(πx\)sin⁡\(πy\)\+δ\.f\(x,y;\\delta;k=1\)=\\ 2\\pi\\big\(\\sin\(\\pi x\)\\cos\(\\pi y\)\+\\cos\(\\pi x\)\\sin\(\\pi y\)\\big\)\+\\big\(1\-2\\pi^\{2\}\\big\)\(x\+y\)\\sin\(\\pi x\)\\sin\(\\pi y\)\+\\delta\.

This benchmark is a dual\-difficulty task of irregular sampling and cross\-frequency, targeting translation\-consistent operator realization\. For UFO, each input sample is observed on a randomly and non\-uniformly sampled set of locations, and we use regular sampling for DeepONet and FNO due to their architectures\. Using 256 samples, we evaluate strong global\-shift extrapolation cases to compare the performance of different neural operators under spectral mismatch\.

Fig\.[2](https://arxiv.org/html/2605.12700#S3.F2)shows that UFO is the most stable method across both interpolation and extrapolation cases\. Atδ=4\.3\\delta=4\.3, UFO achieves the lowest relativeL2L^\{2\}and Barron errors, indicating the best performance in both pointwise accuracy and spectral consistency\. Under strong extrapolation \(δ=±30\.8\\delta=\\pm 30\.8\), UFO continues to preserve the globally shifted oscillatory structure with mild amplitude deviation, whereas DeepONet exhibits severe structural distortion and FNO collapses under large shifts despite remaining competitive interpolation\. Since each UFO input sample is observed on a randomly non\-uniform discretization, these results verify that cross\-domain realization can maintain translation consistency and stable generalization\.

![Refer to caption](https://arxiv.org/html/2605.12700v1/x2.png)Figure 2:Qualitative comparison onδ\\delta\-Helmholtz for interpolation and extrapolation of global shifts\. The model is trained onδ∈\[−5,5\]\\delta\\in\[\-5,5\]with 256 samples in total\.δ=4\.3\\delta=4\.3is interpolation, whileδ=±30\.8\\delta=\\pm 30\.8are strong extrapolation cases\. Each UFO sample is observed on a randomly non\-uniform input discretization\.
### 3\.3Structure\-preserving evaluation on 2D steady Burgers equation

We further evaluate the models on a 2D steady Burgers equation,

uux\+uuy−ν\(uxx\+uyy\)=f,\(x,y\)∈\[0,1\]2,u\|∂Ω=0uu\_\{x\}\+uu\_\{y\}\-\\nu\(u\_\{xx\}\+u\_\{yy\}\)=f,\\quad\(x,y\)\\in\[0,1\]^\{2\},\\quad u\|\_\{\\partial\\Omega\}=0with a manufactured solutionu\(x,y\)=x\(1−x\)y\(1−y\)exp\(λ\(x−y\)u\(x,y\)=x\(1\-x\)y\(1\-y\)\\exp\(\\lambda\(x\-y\)\), whereν\\nuis set to 0\.05 andλ\\lambdacontrols the deformation of the solution profile, inducing systematic changes in the spatial organization of the field\. Models are trained onλ∈\[3,6\]\\lambda\\in\[3,6\]with 128 samples, and evaluating includes both ID and bidirectional OOD generalization, as shown in Fig\.[3](https://arxiv.org/html/2605.12700#S3.F3)and Figs\.[7](https://arxiv.org/html/2605.12700#A1.F7)\-[9](https://arxiv.org/html/2605.12700#A1.F9)\(in Appendix\)\.

![Refer to caption](https://arxiv.org/html/2605.12700v1/x3.png)Figure 3:Structural comparison on the parametric 2D Burgers equation under bidirectional OOD generalization\. Models are trained onλ∈\[3,6\]\\lambda\\in\[3,6\]\. A discretized colormap is used to reveal topological inconsistencies more clearly, where regions belonging to the same value range may become fragmented or disconnected\.In the ID regimes, all methods produce visually accurate solutions \(Figs\.[7](https://arxiv.org/html/2605.12700#A1.F7)\-[9](https://arxiv.org/html/2605.12700#A1.F9)in Appendix\)\. However, quantitative differences are evident: UFO achieves the lowest errors in relativeL2L^\{2\}and Barron errors, outperforming both DeepONet and FNO\. DeepONet yields stable but consistently higher errors, while FNO exhibits the largest errors despite visually plausible reconstructions\.

Under OOD conditions in Fig\.[3](https://arxiv.org/html/2605.12700#S3.F3), on the left side \(λ=1\.5,2\.0\\lambda=1\.5,2\.0\), UFO achieves the lowest relativeL2L^\{2\}and Barron errors, while preserving the contour geometry almost exactly\. FNO remains competitive in pointwise accuracy but already exhibits mild topological inconsistency, whereas DeepONet shows clear geometric bias despite producing a smooth field\.

The difference becomes more pronounced under right\-side extrapolation\. Atλ=6\.6\\lambda=6\.6andλ=7\.5\\lambda=7\.5, UFO continues to achieve the lowest relativeL2L^\{2\}and Barron errors while preserving the main nonlinear level\-set structure\. Unlike the left\-side cases, DeepONet becomes more competitive and produces smoother contours\. FNO degenerates sharply, with spurious structures and fragmented contours\. This indicates that the baselines are sensitive to the extrapolation direction, whereas UFO remains stable across both sides\.

### 3\.4GRF\-Helmholtz: frequency\-dominated operator realization

The last benchmark targets frequency\-dominated operator realization on stochastic random field inputs\. In contrast to the preceding parameterized benchmarks, which emphasize discontinuities, nonlinear structure, or global shifts, this setting probes whether a neural operator can stably realize solutions when the input is drawn from a spatially correlated GRF family with rich spectral content\.

We consider the 2D Helmholtz equation onΩ=\[0,1\]2\\Omega=\[0,1\]^\{2\}with homogeneous Dirichlet boundary conditions as shown in \([4](https://arxiv.org/html/2605.12700#S3.E4)\) andkkis the wave number\. The input forcingf\(x,yf\(x,y\) is generated as a spatially correlated GRF using a Matérn kernel with smoothness parameter\(ν=1\.5\)\(\\nu=1\.5\)\. The full 2D GRF is constructed through a Kronecker\-product Cholesky decomposition\. Givenff, the solutionu\(x,yu\(x,y\) is computed numerically by finite\-difference discretization with sparse matrix formulation\.

In the GRF\-Helmholtz setting, the model is trained on correlation lengthsℓ∈\{0\.1,0\.2,0\.3\}\\ell\\in\\\{0\.1,0\.2,0\.3\\\}with 150 training samples each and evaluated atℓ=0\.05\\ell=0\.05andℓ=0\.35\\ell=0\.35, corresponding to bidirectional extrapolation toward both shorter and longer correlation scales\.

![Refer to caption](https://arxiv.org/html/2605.12700v1/x4.png)

![Refer to caption](https://arxiv.org/html/2605.12700v1/x5.png)

Figure 4:Comparison on GRF\-Helmholtz under OOD correlation lengths with moderate wave numberk=60k=60in the top panel and extreme wave numberk=120k=120in the bottom panel\.This testbed is designed to enforce learning from unstructured, non\-parametric, and highly oscillatory random inputs\. As expected, FNO achieves the best relativeL2L^\{2\}and Barron errors in all cases visualized in Fig\.[4](https://arxiv.org/html/2605.12700#S3.F4), reflecting its strong inductive bias for frequency\-dominated Helmholtz operators\. UFO remains consistently competitive and substantially outperforms DeepONet, especially for extreme casek=120k=120, where DeepONet exhibits severe structural distortion and large spectral error\.

These results show that GRF\-Helmholtz lies close to the natural advantage region of spectral\-domain methods\. Nevertheless, UFO preserves the dominant random\-field structures across the OOD correlation lengths under both wave\-number regimes\. Together with the stronger results on StepHeat,δ\\delta\-Helmholtz, and Burgers, this suggests that UFO provides a more balanced operator realization mechanism across heterogeneous regimes, rather than specializing only to spectral\-domain problems\.

### 3\.5Discretization\-decoupled operator realization

A key theoretical advantage of UFO is discretization decoupling: the input function can be observed at resolutions or locations different from those used during training, while the solution can be queried at arbitrary output resolutions\.

We evaluate this property from two complementary perspectives that align with the practical concerns\. First, we decrease the resolution at which the input function is observed while keeping the output evaluation grid fixed, testing whether the learned operator remains stable when the input discretization differs from training\. Second, we fix the input observation and increase the output query resolution, testing whether dense solution fields can be produced without requiring equally dense input observations\.

The top row in Fig\.[5](https://arxiv.org/html/2605.12700#S3.F5)varies the input resolution while keeping the output evaluation fixed\. Across all testedλ\\lambdavalues, UFO exhibits only mild changes in relativeL2L^\{2\}and Barron errors as the input resolution decreases from100×100100\\times 100to55×5555\\times 55\. This indicates that the learned input\-domain representation is not tightly coupled to a fixed input discretization\. The degradation is most visible when the input becomes sparse and the solution regime is more difficult, but the overall error curves remain stable, showing that UFO preserves its operator realization under input resolution changes\.

The second row further tests output query resolution by increasing the evaluation grid from100×100100\\times 100to550×550550\\times 550atλ=5\.8\\lambda=5\.8\. UFO remains nearly flat in both relativeL2L^\{2\}and Barron errors, demonstrating that the predicted solution can be queried at much denser resolutions without loss of accuracy\. DeepONet also remains relatively stable, but its error is consistently higher than UFO\. In contrast, FNO deteriorates rapidly as the output resolution increases, especially in Barron error, reflecting its dependence on grid\-tied spectral realization\. These results support the key advantage of UFO: the input representation and solution query space are decoupled, enabling stable operator realization across varying input and output resolutions\.

![Refer to caption](https://arxiv.org/html/2605.12700v1/x6.png)

![Refer to caption](https://arxiv.org/html/2605.12700v1/x7.png)

Figure 5:Resolution behavior of UFO on Burgers\. Top row: relativeL2L^\{2\}and Barron errors under decreasing input resolutions for differentλ\\lambdavalues\. Second row: output query resolution study atλ=5\.8\\lambda=5\.8\.
### 3\.6Ablation study in StepHeat

To isolate the role of adaptive phase modulation coupling𝒞α\\mathcal\{C\}\_\{\\alpha\}, we consider a separable variant of UFO that keeps the SE and SBN unchanged, but replaces the phase\-modulated coupling with a fixed separable readout,

𝒢\(f\)\(x\)=⟨Φ𝒮\(x\),a\+b⟩,Ψℋ\(f\)=a\+ib\.\\mathcal\{G\}\(f\)\(x\)=\\langle\\Phi\_\{\\mathcal\{S\}\}\(x\),a\+b\\rangle,\\qquad\\Psi\_\{\\mathcal\{H\}\}\(f\)=a\+ib\.This variant preserves the two\-domain representation structure of UFO, but removes the jointly conditioned interaction,α\\alpha, between domains\. Therefore, the comparison tests whether UFO’s performance comes merely from having input\-domain and solution\-domain representations, or from the adaptive, jointly dependent interaction between them\.

Table 2:Ablation of the adaptive phaseα\\alphaon StepHeat\. The ablated variant keeps the spectral encoder and spatial basis network unchanged, but removes the jointly conditioned phase modulation and replaces Eq\. \([2\.1](https://arxiv.org/html/2605.12700#S2.Ex7)\) with a fixed separable readout\. Across all discontinuity locations, removingα\\alphasubstantially degrades both relativeL2L^\{2\}and Barron relative errors, showing that the adaptive phase is essential for coupling the input\-domain spectral representation with the solution\-domain basis in this non\-smooth, high\-frequency regime\.Table[2](https://arxiv.org/html/2605.12700#S3.T2)isolates the role of the adaptive phaseα\\alpha\. Although the model still uses two\-domain representations, its errors increase substantially across all StepHeat cases, especially ats=0\.39s=0\.39ands=0\.41s=0\.41\. The consistent degradation in both relativeL2L^\{2\}and Barron errors shows thatα\\alphais essential for adaptive cross\-domain realization in the discontinuous, high\-frequency regime\. We also observe that removingα\\alphamakes optimization substantially harder, with training losses decreasing slowly and often failing to converge to the same level as the full UFO\.

This confirms that𝒞α\\mathcal\{C\}\_\{\\alpha\}is not a cosmetic module\. It is the mechanism that turns separate domain representations into an adaptive cross\-domain operator realization\. Without it, UFO degenerates to a separable multi\-domain representation, which lacks the jointly conditioned interaction needed to preserve solution structure under distribution shift\.

## 4Discussion

By separating input\-domain encoding, solution\-domain representation, and adaptive phase\-modulated coupling, UFO reframes neural operator learning from single\-domain approximation to cross\-domain realization\. UFO provides an effective way to realize operators across heterogeneous regimes, including discontinuous inputs, irregular observations, nonlinear dynamics, and frequency\-dominated random fields\.

A key implication of UFO is discretization decoupling, which means that the input function can be observed at resolutions or locations different from those used during training, and the output solution queried at arbitrary output resolutions\. This is valuable in practical settings where dense measurements are expensive, but high\-resolution predictions are required\. However, this flexibility is not a free lunch\. Specifically, when the target operator relies strongly on localized input features, overly sparse observations will degrade the prediction\. Thus, UFO is discretization\-flexible, but the input observations must still resolve the structures that determine the operator response\.

UFO is not merely a neural operator architecture, but a way to think about operator realization beyond domain constraints\. This view opens the door to multi\-domain, time\-dependent, physics\-informed, and multimodal extensions for complex scientific systems\. Specifically, it may extend the instantiated minimal two\-domain form of UFO via additional domain incorporations, including image\-domain representations, multiple spectral transforms, and multiscale physical descriptors\. Such extensions are particularly relevant for complex systems, including CFD, reactive transport modeling, inverse problem, and coupled multiphysics dynamics\.

## Acknowledgments

This study is funded by Jane and Aatos Erkko Foundation via the project titled ML‐Mining: Machine Learning surrogate modeling for risk assessment and water quality prediction at Mining sites \(Grant 220021\)\. This work has also been supported by the Research Council of Finland project 372518\. The authors also acknowledge the support from the IT Center for Science, Finland \(CSC\), for generously sharing their computational resources\.

## References

- K\. Azizzadenesheli, N\. Kovachki, Z\. Li, M\. Liu\-Schiaffini, J\. Kossaifi, and A\. Anandkumar \(2024\)Neural operators for accelerating scientific simulations and design\.Nature Reviews Physics6,pp\. 320–328\.Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p1.1)\.
- B\. Bahmani, S\. Goswami, I\. G\. Kevrekidis, and M\. D\. Shields \(2025\)A resolution independent neural operator\.Computer Methods in Applied Mechanics and Engineering444,pp\. 118113\.Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p5.1)\.
- A\. R\. Barron \(2002\)Universal approximation bounds for superpositions of a sigmoidal function\.IEEE Transactions on Information Theory39\(3\),pp\. 930–945\.Cited by:[§3](https://arxiv.org/html/2605.12700#S3.p2.1)\.
- Q\. Cao, S\. Goswami, and G\. E\. Karniadakis \(2024\)Laplace neural operator for solving differential equations\.Nature Machine Intelligence6,pp\. 631–640\.Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p5.1)\.
- Q\. Cheng, M\. H\. Sahadath, H\. Yang, S\. Pan, and W\. Ji \(2025\)Surrogate modeling of heat transfer under flow fluctuation conditions using fourier basis\-deep operator network with uncertainty quantification\.Progress in Nuclear Energy188,pp\. 105895\.Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p4.1)\.
- B\. Choi, H\. S\. Jin, and B\. Lkhagvasuren \(2024\)Applications of the fourier neural operator in a regional ocean modeling and prediction\.Frontiers in Marine ScienceVolume 11 \- 2024\.Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p1.1)\.
- X\. Fu, G\. Chen, Y\. Li, X\. Liu, L\. Chen, Q\. Meng, C\. Liu, and X\. Hao \(2025\)Spatio\-temporal neural operator on complex geometries\.Computer Physics Communications315,pp\. 109754\.Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p4.1)\.
- Z\. Hao, Z\. Wang, H\. Su, C\. Ying, Y\. Dong, S\. Liu, Z\. Cheng, J\. Song, and J\. Zhu \(2023\)GNOT: a general neural operator transformer for operator learning\.InProceedings of the 40th International Conference on Machine Learning,A\. Krause, E\. Brunskill, K\. Cho, B\. Engelhardt, S\. Sabato, and J\. Scarlett \(Eds\.\),Proceedings of Machine Learning Research, Vol\.202,pp\. 12556–12569\.Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p4.1)\.
- P\. Huang, Y\. Leng, C\. Lian, and H\. Liu \(2024\)Porous\-deeponet: learning the solution operators of parametric reactive transport equations in porous media\.Engineering39,pp\. 94–103\.Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p1.1)\.
- Z\. Jiang, M\. Zhu, and L\. Lu \(2024\)Fourier\-mionet: fourier\-enhanced multiple\-input neural operators for multiphase modeling of geological carbon sequestration\.Reliability Engineering and System Safety251,pp\. 110392\.Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p4.1)\.
- S\. Khodakarami, V\. Oommen, A\. Bora, and G\. E\. Karniadakis \(2026a\)Mitigating spectral bias in neural operators via high\-frequency scaling for physical systems\.Neural Networks193,pp\. 108027\.Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p4.1)\.
- S\. Khodakarami, V\. Oommen, N\. A\. Daryakenari, M\. Beekenkamp, and G\. E\. Karniadakis \(2026b\)Spectral bias in physics\-informed and operator learning: analysis and mitigation guidelines\.arXiv preprint arXiv:2602\.19265v1\.Cited by:[§3](https://arxiv.org/html/2605.12700#S3.p2.1)\.
- K\. Kontolati, S\. Goswami, G\. E\. Karniadakis, and M\. D\. Shields \(2024\)Learning nonlinear operators in latent spaces for real\-time predictions of complex dynamics in physical systems\.Nature Communications15,pp\. 5101\.Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p5.1)\.
- N\. Kovachki, Z\. Li, B\. Liu, K\. Azizzadenesheli, K\. Bhattacharya, A\. Stuart, and A\. Anandkumar \(2023\)Neural operator: learning maps between function spaces with applications to pdes\.Journal of Machine Learning Research24\(1\),pp\. 4061–4157\.Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p1.1),[§1](https://arxiv.org/html/2605.12700#S1.p3.1)\.
- Z\. Li, D\. Z\. Huang, B\. Liu, and A\. Anandkumar \(2023a\)Fourier neural operator with learned deformations for pdes on general geometries\.Journal of Machine Learning Research24\(388\),pp\. 1–26\.Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p4.1)\.
- Z\. Li, N\. B\. Kovachki, K\. Azizzadenesheli, B\. liu, K\. Bhattacharya, A\. Stuart, and A\. Anandkumar \(2021\)Fourier neural operator for parametric partial differential equations\.InProceedings of the ICLR 2021,Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p2.1)\.
- Z\. Li, N\. Kovachki, C\. Choy, B\. Li, J\. Kossaifi, S\. Otta, M\. A\. Nabian, M\. Stadler, C\. Hundt, K\. Azizzadenesheli, and A\. Anandkumar \(2023b\)Geometry\-informed neural operator for large\-scale 3d pdes\.InAdvances in Neural Information Processing Systems,Vol\.36,pp\. 35836–35854\.Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p4.1)\.
- Z\. Li, H\. Zheng, N\. Kovachki, D\. Jin, H\. Chen, B\. Liu, K\. Azizzadenesheli, and A\. Anandkumar \(2024\)Physics\-informed neural operator for learning partial differential equations\.ACM/IMS Journal of Data Science1\(3\),pp\. 1–27\.Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p3.1)\.
- X\. Liu and H\. Tang \(2025\)DiffFNO: diffusion fourier neural operator\.InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition \(CVPR\),pp\. 150–160\.Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p4.1)\.
- L\. Lu, P\. Jin, G\. Pang, Z\. Zhang, and G\. E\. Karniadakis \(2021\)Learning nonlinear operators via deeponet based on the universal approximation theoremof operators\.Nature Machine Intelligence3,pp\. 218–229\.Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p1.1),[§1](https://arxiv.org/html/2605.12700#S1.p2.1)\.
- J\. Pathak, S\. Subramanian, P\. Harrington, S\. Raja, A\. Chattopadhyay, M\. Mardani, T\. Kurth, D\. Hall, Z\. Li, K\. Azizzadenesheli, P\. Hassanzadeh, K\. Kashinath, and A\. Anandkumar \(2026\)FourCastNet: a global data\-driven high\-resolution weather model using adaptive fourier neural operators\.arXiv preprint arXiv:2202\.11214\.Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p1.1)\.
- A\. Peyvan, V\. Kumar, and G\. E\. Karniadakis \(2026\)Fusion\-deeponet: a data\-efficient neural operator for geometry\-dependent hypersonic and supersonic flows\.Journal of Computational Physics544,pp\. 114432\.Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p4.1)\.
- N\. Rahaman, A\. Baratin, D\. Arpit, F\. Draxler, M\. Lin, F\. A\. Hamprecht, Y\. Bengio, and A\. Courville \(2019\)On the spectral bias of neural networks\.InProceedings of the 36 th International Conference on Machine Learning,Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p3.1)\.
- A\. Sojitra, M\. Dhingra, and O\. San \(2026\)FEDONet: fourier\-embedded deeponet for spectrally accurate operator learning\.arXiv preprint arXiv: 2509\.12344v4\.Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p4.1)\.
- S\. Wang, X\. Yu, and P\. Perdikaris \(2022\)When and why pinns fail to train: a neural tangent kernel perspective\.Journal of Computational Physics449,pp\. 110768\.Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p3.1)\.
- T\. Wang and C\. Wang \(2024\)Latent neural operator for solving forward and inverse pde problems\.InAdvances in Neural Information Processing Systems,Vol\.37,pp\. 33085–33107\.Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p5.1)\.
- M\. Wei and X\. Zhang \(2023\)Super\-resolution neural operator\.InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition \(CVPR\),pp\. 18247–18256\.Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p5.1)\.
- G\. Wen, Z\. Li, K\. Azizzadenesheli, A\. Anandkumar, and S\. M\. Benson \(2022\)U\-fno\-an enhanced fourier neural operator\-based deep\-learning model for multiphase flow\.Advances in Water Resources163,pp\. 104180\.Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p1.1)\.
- M\. Yin, N\. Charon, R\. Brody, L\. Lu, N\. Trayanova, and M\. Maggioni \(2024\)A scalable framework for learning the geometry\-dependent solution operators of partial differential equations\.Nature Computational Science4,pp\. 928–940\.Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p4.1)\.
- Z\. Zhao, C\. Liu, Y\. Li, Z\. Chen, and X\. Liu \(2025\)Diffeomorphism neural operator for various domains and parameters of partial differential equations\.Communication Physics8,pp\. 15\.Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p4.1)\.
- M\. Zhu, S\. Feng, Y\. Lin, and L\. Lu \(2023\)Fourier\-deeponet: fourier\-enhanced deep operator networks for full waveform inversion with improved accuracy, generalizability, and robustness\.Computer Methods in Applied Mechanics and Engineering416,pp\. 116300\.Cited by:[§1](https://arxiv.org/html/2605.12700#S1.p4.1)\.

## Appendix AAppendix

### A\.1Table of parameters

Table[3](https://arxiv.org/html/2605.12700#A1.T3)reports parameter counts and per\-epoch runtime\. UFO remains compact, with fewer than8×1048\\times 10^\{4\}trainable parameters in all settings, and achieves competitive runtime compared with the baselines\. It is consistently faster than FNO except on Burgers, while the additional cost relative to DeepONet reflects the spectral encoder and adaptive phase\-modulated coupling\. Thus, UFO’s performance is not driven by model size, but by its cross\-domain realization mechanism\.

Table 3:Trainable parameters and per\-epoch runtime across benchmarks\. UFO uses fewer than8×1048\\times 10^\{4\}parameters in all settings and maintains competitive runtime, while DeepONet and FNO often require larger parameter counts or higher per\-epoch cost\.StepHeatδ\\delta\-HelmholtzBurgersGRF\-Helmholtz \(k=60k=60\)GRF\-Helmholtz \(k=120k=120\)UFO48,44244,33273,45256,76251,734DeepONet27,456181,056661,056181,056181,056FNO8,422,2098,422,2092,118,27318,907,9058,422,209Runtime per epochUFO0\.087ss0\.143ss0\.145ss0\.257ss0\.228ssDeepONet0\.048ss0\.050ss0\.028ss0\.065ss0\.065ssFNO0\.126ss0\.449ss0\.090ss0\.334ss0\.265ss

### A\.2Benchmarks summary

Table 4:Summary of benchmarks used in Section[3](https://arxiv.org/html/2605.12700#S3)\. Each benchmark is designed to probe the operator realization from distinct views, including spectral bias, irregular input sampling, nonlinear structure preservation, stochastic high\-frequency fields, and discretization behavior\.
### A\.3Multi\-seed quantitative results

Table 5:Multi\-seed quantitative results across benchmarks\. We repeat each experiment with five independent training seeds\{42,200,500,2010,2026\}\\\{42,200,500,2010,2026\\\}\. The generated datasets and evaluation samples are fixed across seeds, while model initialization and training stochasticity are varied\. We report mean±\\pmstandard deviation for relativeL2L^\{2\}error and Barron norm relative error\. Bold values indicate the best mean performance for each metric\.BenchmarkUFODeepONetFNORel\.L2L^\{2\}Barron Rel\.Rel\.L2L^\{2\}Barron Rel\.Rel\.L2L^\{2\}Barron Rel\.StepHeats=0\.32s=0\.320\.1070±\\pm0\.00680\.3413±\\pm0\.02130\.1256±\\pm0\.01800\.3186±\\pm0\.02260\.2360±\\pm0\.00300\.6248±\\pm0\.0102s=0\.39s=0\.390\.1090±\\pm0\.00720\.1734±\\pm0\.01370\.2072±\\pm0\.01760\.2491±\\pm0\.00700\.3210±\\pm0\.03800\.4381±\\pm0\.0712s=0\.41s=0\.410\.1117±\\pm0\.00980\.1835±\\pm0\.01580\.2434±\\pm0\.02220\.2801±\\pm0\.00500\.3224±\\pm0\.07600\.4613±\\pm0\.1323s=0\.48s=0\.480\.1078±\\pm0\.01030\.3431±\\pm0\.02670\.1026±\\pm0\.00470\.2837±\\pm0\.02180\.2276±\\pm0\.00920\.5682±\\pm0\.0352s=0\.52s=0\.520\.1083±\\pm0\.01100\.3436±\\pm0\.02510\.1028±\\pm0\.00480\.2834±\\pm0\.02190\.2231±\\pm0\.00680\.5549±\\pm0\.0378s=0\.66s=0\.660\.0534±\\pm0\.00430\.2875±\\pm0\.01800\.0533±\\pm0\.00490\.2384±\\pm0\.01230\.1605±\\pm0\.00410\.5836±\\pm0\.0250δ\\delta\-Helmholtzδ=−30\.8\\delta=\-30\.80\.0569±\\pm0\.05011\.1394±\\pm1\.36700\.0641±\\pm0\.04980\.5667±\\pm0\.29201\.1290±\\pm1\.68561\.6335±\\pm1\.8539δ=30\.8\\delta=30\.80\.0579±\\pm0\.04320\.4856±\\pm0\.53580\.2222±\\pm0\.06153\.2333±\\pm2\.66250\.3899±\\pm0\.38850\.7637±\\pm0\.6656δ=4\.3\\delta=4\.30\.0013±\\pm0\.00060\.0343±\\pm0\.02610\.2279±\\pm0\.00440\.1331±\\pm0\.00460\.0078±\\pm0\.00000\.1042±\\pm0\.00032D Burgersλ=1\.5\\lambda=1\.50\.0676±\\pm0\.01340\.1612±\\pm0\.04770\.5387±\\pm0\.13880\.9144±\\pm0\.22880\.0782±\\pm0\.02360\.1886±\\pm0\.0240λ=2\.0\\lambda=2\.00\.0292±\\pm0\.00770\.0842±\\pm0\.03050\.4314±\\pm0\.02550\.6545±\\pm0\.04890\.0321±\\pm0\.01010\.0923±\\pm0\.0100λ=4\.5\\lambda=4\.50\.0024±\\pm0\.00200\.0083±\\pm0\.00550\.0074±\\pm0\.00360\.0524±\\pm0\.01200\.0073±\\pm0\.00050\.0238±\\pm0\.0006λ=6\.6\\lambda=6\.60\.0420±\\pm0\.02240\.0648±\\pm0\.01820\.0402±\\pm0\.02390\.1282±\\pm0\.03370\.0840±\\pm0\.01180\.2643±\\pm0\.0343λ=7\.5\\lambda=7\.50\.1404±\\pm0\.06810\.4023±\\pm0\.19230\.3052±\\pm0\.15520\.4008±\\pm0\.22720\.6383±\\pm0\.10011\.3082±\\pm0\.1478GRF\-Helmholtzk=60,ℓ=0\.05k=60,\\ \\ell=0\.051\.0486±\\pm0\.26511\.0190±\\pm0\.21811\.2074±\\pm0\.12241\.2996±\\pm0\.13470\.8428±\\pm0\.08740\.8700±\\pm0\.1289k=60,ℓ=0\.35k=60,\\ \\ell=0\.351\.0027±\\pm0\.38611\.0747±\\pm0\.40270\.7120±\\pm0\.12220\.6785±\\pm0\.10040\.6176±\\pm0\.02830\.5706±\\pm0\.1164k=120,ℓ=0\.05k=120,\\ \\ell=0\.051\.2138±\\pm0\.19312\.6334±\\pm0\.30252\.9746±\\pm1\.675311\.3943±\\pm5\.01780\.9096±\\pm0\.02381\.2323±\\pm0\.0377k=120,ℓ=0\.35k=120,\\ \\ell=0\.350\.9989±\\pm0\.22652\.7161±\\pm0\.30491\.5353±\\pm0\.39906\.8914±\\pm3\.38880\.7084±\\pm0\.00341\.0692±\\pm0\.0070

### A\.4Additional analysis of input observation resolution

![Refer to caption](https://arxiv.org/html/2605.12700v1/x8.png)Figure 6:In\-distribution input\-resolution analysis of UFO on the 2D Burgers equation\. UFO is trained with100×100100\\times 100input observations and evaluated atλ=4\.5\\lambda=4\.5using progressively coarser input resolutions from100×100100\\times 100to55×5555\\times 55\. The output query resolution is fixed\. Unlike the OOD setting in Fig\.[5](https://arxiv.org/html/2605.12700#S3.F5), where UFO remains largely stable under input sparsification, the in\-distribution error increases smoothly as input observations become coarser, reflecting the benefit of dense input information for fine\-grained ID reconstruction\.Fig\.[6](https://arxiv.org/html/2605.12700#A1.F6)complements the main resolution\-decoupling study\. While Fig\.[5](https://arxiv.org/html/2605.12700#S3.F5)shows that UFO remains largely stable under input sparsification in the OOD regime, this ID test atλ=4\.5\\lambda=4\.5reveals a smooth increase in both relativeL2L^\{2\}and Barron errors as the input observation resolution decreases\. This does not contradict discretization decoupling: UFO can process input observations at resolutions different from training without requiring a fixed grid, but denser observations still provide more information for fine\-grained ID reconstruction\. Thus, UFO is discretization\-decoupled rather than information\-free, and its accuracy degrades gradually when input information is reduced\.

### A\.5In distribution predictions on 2D steady Burgers equation

Figs[7](https://arxiv.org/html/2605.12700#A1.F7)\-[9](https://arxiv.org/html/2605.12700#A1.F9)provide additional in\-distribution results on the parametric 2D Burgers equation\. Models are trained onλ∈\[3,6\]\\lambda\\in\[3,6\]and evaluated atλ=3\.2,3\.8,4\.2,5\.8\\lambda=3\.2,3\.8,4\.2,5\.8\. The first row shows the ground truth, the second row shows the prediction, and the third row visualizes the absolute error\. The two numbers below each column report relative errors ofL2L^\{2\}and Barron norm, respectively\.

All three models recover the main solution structure within the training range, but their error patterns differ substantially\. UFO achieves the lowest relative errors ofL2L^\{2\}and Barron norm across all testedλ\\lambdavalues, with relativeL2L^\{2\}errors on the order of10−410^\{\-4\}and Barron errors below8×10−38\\times 10^\{\-3\}\. Its absolute error remains localized and low\-magnitude, indicating that the learned operator preserves both pointwise accuracy and the spectral structure of the solution\.

DeepONet produces visually plausible fields, but its errors are consistently larger than those of UFO, especially in Barron error\. This suggests that although coordinate\-based querying captures the global solution geometry, the learned representation is less accurate in preserving spectral content\. FNO also reconstructs the coarse solution structure, but exhibits larger error near boundary and high\-gradient regions, and its Barron error is substantially higher than UFO\. Overall, the in\-distribution results confirm that UFO’s advantage is not limited to OOD extrapolation: even within the training regime, it provides more accurate and spectrally consistent operator realization\.

![Refer to caption](https://arxiv.org/html/2605.12700v1/x9.png)Figure 7:In\-distribution predictions on the parametric 2D Burgers equation using UFO\.![Refer to caption](https://arxiv.org/html/2605.12700v1/x10.png)Figure 8:In\-distribution predictions on the parametric 2D Burgers equation using DeepONet\.![Refer to caption](https://arxiv.org/html/2605.12700v1/x11.png)Figure 9:In\-distribution predictions on the parametric 2D Burgers equation using FNO\.
### A\.6Additional StepHeat temporal profiles

Figs\.[10](https://arxiv.org/html/2605.12700#A1.F10)\-[12](https://arxiv.org/html/2605.12700#A1.F12)provide a profile\-level view of the StepHeat predictions corresponding to the quantitative results in Table[1](https://arxiv.org/html/2605.12700#S3.T1)\. The ground\-truth lines are shown in black\. Across different discontinuity locations, all models capture the dominant oscillatory structure and the diffusion\-induced amplitude decay over time, confirming that the main temporal dynamics are learned\. The remaining discrepancies are concentrated in the high\-frequency peaks and troughs, especially at early time snapshots where the discontinuity\-induced modes are strongest\.

UFO generally tracks the reference profiles closely across the selectedssvalues, with small deviations in peak amplitude and phase\. DeepONet also produces smooth temporal profiles, but shows more noticeable local amplitude mismatch in several cases, consistent with its larger spectral errors in Table[1](https://arxiv.org/html/2605.12700#S3.T1)\. FNO captures the periodic structure well in many snapshots, yet exhibits sharper local deviations near high\-frequency extrema, reflecting its sensitivity to non\-smooth input\-induced modes\.

![Refer to caption](https://arxiv.org/html/2605.12700v1/x12.png)

![Refer to caption](https://arxiv.org/html/2605.12700v1/x13.png)

![Refer to caption](https://arxiv.org/html/2605.12700v1/x14.png)

Figure 10:Temporal profile visualization on StepHeat\. Predictions are shown for UFO, ats=0\.32,0\.41,0\.52s=0\.32,0\.41,0\.52\. Each subplot compares the predicted and reference one\-dimensional solution profiles at multiple time snapshotst∈\{0\.1,0\.25,0\.5,0\.75,0\.9\}t\\in\\\{0\.1,0\.25,0\.5,0\.75,0\.9\\\}\.![Refer to caption](https://arxiv.org/html/2605.12700v1/x15.png)

![Refer to caption](https://arxiv.org/html/2605.12700v1/x16.png)

![Refer to caption](https://arxiv.org/html/2605.12700v1/x17.png)

Figure 11:Temporal profile visualization on StepHeat\. Predictions are shown for DeepONet, ats=0\.32,0\.41,0\.52s=0\.32,0\.41,0\.52\. Each subplot compares the predicted and reference one\-dimensional solution profiles at multiple time snapshotst∈\{0\.1,0\.25,0\.5,0\.75,0\.9\}t\\in\\\{0\.1,0\.25,0\.5,0\.75,0\.9\\\}\.![Refer to caption](https://arxiv.org/html/2605.12700v1/x18.png)

![Refer to caption](https://arxiv.org/html/2605.12700v1/x19.png)

![Refer to caption](https://arxiv.org/html/2605.12700v1/x20.png)

Figure 12:Temporal profile visualization on StepHeat\. Predictions are shown for FNO, ats=0\.32,0\.41,0\.52s=0\.32,0\.41,0\.52\. Each subplot compares the predicted and reference one\-dimensional solution profiles at multiple time snapshotst∈\{0\.1,0\.25,0\.5,0\.75,0\.9\}t\\in\\\{0\.1,0\.25,0\.5,0\.75,0\.9\\\}\.
### A\.7Additional visualization ofδ\\delta\-Helmholtz predictions

Figs\.[13](https://arxiv.org/html/2605.12700#A1.F13)\-[15](https://arxiv.org/html/2605.12700#A1.F15)provide additional visualizations with colorbars and absolute error maps\. Colorbars are included to expose amplitude shifts and error magnitudes that are not visible from the compact comparison in the main text\. The ID case confirms that UFO yields the smallest and most localized residual error, while DeepONet and FNO exhibit more structured error patterns\. In the strong OOD cases \(δ=±30\.8\\delta=\\pm 30\.8\), the gap becomes more pronounced: UFO preserves the globally shifted oscillatory field with moderate localized errors, whereas DeepONet suffers from structural distortion and FNO often collapses to an incorrect amplitude regime\. These results provide a more detailed view of the failure modes behind the compact comparison in the main text\.

### A\.8Additional GRF\-Helmholtz samples

Fig\.[16](https://arxiv.org/html/2605.12700#A1.F16)shows additional samples confirm the same trend as discussed in Fig\.[4](https://arxiv.org/html/2605.12700#S3.F4)\.

![Refer to caption](https://arxiv.org/html/2605.12700v1/x21.png)

![Refer to caption](https://arxiv.org/html/2605.12700v1/x22.png)

![Refer to caption](https://arxiv.org/html/2605.12700v1/x23.png)

Figure 13:Additionalδ\\delta\-Helmholtz visualizations of UFO with colorbars and absolute error maps in the cases corresponding to Fig\.[2](https://arxiv.org/html/2605.12700#S3.F2)\.![Refer to caption](https://arxiv.org/html/2605.12700v1/x24.png)

![Refer to caption](https://arxiv.org/html/2605.12700v1/x25.png)

![Refer to caption](https://arxiv.org/html/2605.12700v1/x26.png)

Figure 14:Additionalδ\\delta\-Helmholtz visualizations of DeepONet with colorbars and absolute error maps in the cases corresponding to Fig\.[2](https://arxiv.org/html/2605.12700#S3.F2)\.![Refer to caption](https://arxiv.org/html/2605.12700v1/x27.png)

![Refer to caption](https://arxiv.org/html/2605.12700v1/x28.png)

![Refer to caption](https://arxiv.org/html/2605.12700v1/x29.png)

Figure 15:Additionalδ\\delta\-Helmholtz visualizations of FNO with colorbars and absolute error maps in the cases corresponding to Fig\.[2](https://arxiv.org/html/2605.12700#S3.F2)\.![Refer to caption](https://arxiv.org/html/2605.12700v1/x30.png)

![Refer to caption](https://arxiv.org/html/2605.12700v1/x31.png)

Figure 16:Multiple test samples under OOD correlation lengthsℓ=0\.05\\ell=0\.05andℓ=0\.35\\ell=0\.35, with relativeL2L^\{2\}and Barron errors reported below each prediction\. The top panel presents the predictions atk=60k=60, while the bottom panel shows the results atk=120k=120\.
UFO: A Domain-Unification-Free Operator Framework for Generalized Operator Learning

Similar Articles

Universal Approximation of Nonlinear Operators and Their Derivatives

Frequency Bias and OOD Generalization in Neural Operators under a Variable-Coefficient Wave Equation

Topology-Preserving Neural Operator Learning via Hodge Decomposition

Learning Laplacian Eigenspace with Mass-Aware Neural Operators on Point Clouds

Nonlocal operator learning for fMRI encoding and decoding tasks

Submit Feedback

Similar Articles

Universal Approximation of Nonlinear Operators and Their Derivatives
Frequency Bias and OOD Generalization in Neural Operators under a Variable-Coefficient Wave Equation
Topology-Preserving Neural Operator Learning via Hodge Decomposition
Learning Laplacian Eigenspace with Mass-Aware Neural Operators on Point Clouds
Nonlocal operator learning for fMRI encoding and decoding tasks