Hierarchical Multi-Fidelity Learning for Predicting Three-Dimensional Flame Wrinkling and Turbulent Burning Velocity
Summary
This paper introduces MuFiNNs, a hierarchical multi-fidelity neural network framework for predicting 3D flame wrinkling and turbulent burning velocity using sparse experimental data. The approach integrates low-fidelity physical trends with high-fidelity corrections to enable robust prediction and extrapolation in data-limited combustion regimes.
View Cached Full Text
Cached at: 05/12/26, 07:09 AM
# Hierarchical Multi-Fidelity Learning for Predicting Three-Dimensional Flame Wrinkling and Turbulent Burning Velocity
Source: [https://arxiv.org/html/2605.08232](https://arxiv.org/html/2605.08232)
Yu XieJunfeng YangSafa JamaliDepartment of Mechanical and Industrial Engineering, Northeastern University, Boston, MA 02115, USASchool of Mechanical Engineering, University of Leeds, Leeds, UK
###### Abstract
High\-fidelity experimental characterization of turbulent premixed flames remains severely limited by the cost and complexity of advanced diagnostics, particularly under elevated pressures and intense turbulence where measurements of coupled flame morphology and burning dynamics are sparse\. Here, we develop a hierarchical multi\-fidelity neural network framework \(MuFiNNs\) to address this challenge by integrating sparse high\-fidelity experimental data with structured low\-fidelity representations that encode dominant physical trends\. The framework combines hierarchical low\-fidelity construction with nonlinear multi\-fidelity correction to learn coupled geometric and reactive flame behavior while recovering discrepancies that simplified models alone cannot capture\. The methodology is applied to expanding turbulent premixed flames to predict both three\-dimensional flame wrinkling dynamics and turbulent mass burning velocity across varying fuels, pressures, and turbulence intensities\. Using experimentally informed low\-fidelity trend models together with sparse high\-fidelity measurements, MuFiNNs accurately reconstructs observed flame behavior, enables reliable interpolation across unseen operating conditions, and demonstrates robust extrapolative capability beyond the training domain\. Importantly, the framework remains effective even in regimes characterized by noisy, weakly structured, or experimentally inaccessible data, where conventional data\-driven approaches often fail\. These results show that hierarchical multi\-fidelity learning provides a scalable and physically grounded strategy for predictive combustion modeling in data\-limited regimes\. More broadly, this work establishes multi\-fidelity scientific machine learning as a practical framework for extracting physically meaningful predictive models from sparse experiments, particularly for instability\-dominated and turbulence\-sensitive reactive flows where conventional high\-fidelity data acquisition is prohibitively demanding\.
###### keywords:
\\KWDTurbulent premixed flames, Hydrogen combustion, Flame surface wrinkling, Turbulent burning velocity, Multi\-fidelity neural networks
††journal:Combustion and Flame\\emailauthor
s\.jamali@northeastern\.eduSafa Jamali
## Novelty and significance statement
This work presents a hierarchical multi\-fidelity learning framework for predicting coupled flame morphology and turbulent burning response in expanding premixed flames from sparse high\-fidelity experimental data\. The novelty lies in integrating experimentally informed low\-fidelity trend models with nonlinear high\-fidelity measurements to jointly predict 3D flame\-surface evolution and turbulent mass burning velocity, rather than treating flame wrinkling and burning rate as separate observables\. The framework is evaluated using methane and hydrogen flame datasets spanning fuel composition, equivalence ratio, pressure, temperature, and turbulence intensity, with systematic hold\-out tests for interpolation, extrapolation, and unseen operating conditions\. The results show that multi\-fidelity learning can recover physically consistent flame behavior in data\-limited regimes where direct 3D diagnostics and pressure\-based burning\-velocity measurements are sparse, noisy, or experimentally challenging\. This establishes a practical route for predictive combustion modeling that combines sparse experiments with structured surrogate physics for instability\-sensitive turbulent premixed flames\.
## 1Introduction
The rapid transition toward hydrogen and low\-carbon fuels has renewed interest in the dynamics of turbulent premixed flames, particularly in regimes where intrinsic flame\-front instability coexists with externally imposed turbulence\. Lean premixed hydrogen–air flames form a canonical low\-Lewis\-number system \(Le<1Le<1\), in which preferential diffusion, flame stretch sensitivity, and gas expansion jointly destabilize the flame front\. Classical analyses link diffusive–thermal imbalance and the Markstein response to the amplification and wavelength selection of finite disturbances\[Zeldovich1944,Markstein1951,Clavin1985,MatalonMatkowsky1982,Law2006,BechtoldMatalon2001\], while hydrodynamic instability and its nonlinear evolution are captured by the classical framework and related weakly nonlinear theories\[Sivashinsky1977,MichelsonSivashinsky1977,BychkovLiberman2000\]\. As a consequence, low\-LeLeflames can develop cellular structures, curvature localization, and persistently non\-Gaussian curvature statistics even in nominally laminar, outwardly propagating conditions\[WilliamsBook,Peters2000,PoinsotVeynante2005\]\.
A key practical implication is that instability\-driven morphology does not merely ’decorate’ a laminar flame; it can measurably modify the global consumption rate through surface amplification and curvature\-conditioned transport\. A sequence of recent studies has established \(i\) stretch\- and pressure\-controlled cellular onset in outwardly propagating hydrogen flames\[Xie2022FuelCellular,LiXie2024FuelEthaneH2\], \(ii\) self\-acceleration and global pulsation driven purely by intrinsic surface amplification\[Xie2023FuelPulsation\], and \(iii\) wrinkling/self\-disturbance dynamics with a pronounced convexity bias toward reactant\-facing protrusions\[Xie2024CnFwrinkling\]\. Importantly, fully three\-dimensional characterization shows that instability evolution in 3D differs fundamentally from 2D projections, with nonlinear mode competition, curvature redistribution, and topology bifurcation that only become evident with volumetric reconstructions\[Xie2025CnF3D\]\. These findings echo broader evidence that differential diffusion can imprint a directional morphology and alter local burning rates via curvature and strain\[Matalon2009,ChenIm2018\]\.
When turbulence is imposed, the interaction between turbulence\-driven wrinkling and intrinsic instability becomes strongly nonlinear and condition\-dependent\. In classical turbulent premixed combustion, turbulent burning velocity is related to flame surface area growth and flame surface density \(FSD\), through flamelet concepts and surface\-transport formulations\[Peters2000,PoinsotVeynante2005,VeynanteVervisch2002,CandelPoinsot1990,Bray1990\]\. These frameworks often assume that turbulence is the primary source of corrugation and that chemistry remains flamelet\-like\. However, in low\-LeLehydrogen flames, intrinsic instability provides an ’open’ amplification pathway: turbulence can act as a catalyst that injects energy into an already unstable morphological state, selectively enhancing convex leading structures and increasing the prevalence of cusp\-like features, rather than producing a statistically symmetric wrinkling field\[Aspden2011,ChenIm2018\]\. In contrast, thermo\-diffusively stable flames \(e\.g\. methane under conditions yielding stabilizing diffusive response\) typically exhibit turbulence\-driven area increase without generating a persistent population of sharply peaked convex structures\[Bradley2003,HowarthAspden2012\]\. This conditional response implies that models based solely on area scaling or single\-parameter wrinkling factors may be insufficient when intrinsic instability and turbulence operate on comparable time scales\.
A central difficulty in the analysis of turbulent premixed flames is the identification of observables that jointly characterize flame morphology and global burning response\. In the present work, the normalized three\-dimensional flame surface area,A3D/a3DA\_\{3D\}/a\_\{3D\}, and the turbulent mass burning velocity,utmu\_\{tm\}, are selected because together they provide a coupled geometric–reactive description of the flame\. Here,A3DA\_\{3D\}is the reconstructed wrinkled flame surface area anda3D=4πr3D2a\_\{3D\}=4\\pi r\_\{3D\}^\{2\}is the surface area of an equivalent smooth sphere with the same burned\-gas volume, so thatA3D/a3DA\_\{3D\}/a\_\{3D\}directly measures the departure from a smooth reference geometry and hence the degree of flame wrinkling and surface amplification\. This is particularly relevant in lean hydrogen flames, where flame\-surface corrugation is governed not only by turbulence\-induced folding but also by intrinsic thermo\-diffusive and hydrodynamic instabilities\[Peters2000,PoinsotVeynante2005,VeynanteVervisch2002,Xie2025CnF3D,Ahmed2021\]\. By contrast,utmu\_\{tm\}characterizes the effective global burning response of the flame and reflects the integrated outcome of flame\-surface growth, preferential diffusion, curvature effects, flame stretch, and instability\-mediated transport\[CandelPoinsot1990,Bray1990,Ahmed2021\]\. The combined consideration ofA3D/a3DA\_\{3D\}/a\_\{3D\}andutmu\_\{tm\}is therefore essential for assessing whether an increase in burning rate can be explained by flame\-area amplification alone or whether additional mechanisms contribute to the enhancement of turbulent burning\[Ahmed2021\]\. From a measurement perspective, however, these quantities are also among the most demanding to obtain simultaneously\. Accurate determination ofA3D/a3DA\_\{3D\}/a\_\{3D\}requires time\-resolved three\-dimensional flame reconstruction using advanced multi\-laser\-sheet diagnostics and substantial post\-processing, including volumetric reconstruction, triangulation, and surface analysis, with uncertainties that become particularly severe for highly wrinkled flames and under intense turbulence\[Ahmed2024PoF\]\. The extraction ofutmu\_\{tm\}from pressure\-rise measurements is likewise subject to non\-negligible uncertainty because it depends on signal quality, temporal differentiation, smoothing, and the kinematic and volumetric assumptions adopted during post\-processing; these issues are especially important at small flame radii and in unstable flames, where pressure\-based and geometry\-based characterizations may diverge\[Xie2025CnF3D,Ahmed2021,Zheng2025CnF\]\. Consequently, datasets containing simultaneous measurements ofA3D/a3DA\_\{3D\}/a\_\{3D\}andutmu\_\{tm\}over broad ranges of pressure, temperature, equivalence ratio, and turbulence intensity are inherently sparse, expensive, and heterogeneous\. This combination of strong physical relevance and severe data limitation directly motivates the use of machine\-learning\-based multi\-fidelity frameworks that can preserve the coupling between flame geometry and global burning while making efficient use of limited high\-fidelity measurements\[Xie2025CnF3D,Ahmed2021,Ahmed2024PoF,Zheng2025CnF\]\.
Recently, advances in computational power and data acquisition have enabled the widespread adoption of data\-driven modeling approaches across science and engineering\. In this context, artificial intelligence \(AI\) and machine learning \(ML\) have emerged as scalable and flexible tools for developing predictive models of complex physical systems\. In particular, ML methods have shown a strong ability to capture highly nonlinear relationships where traditional approaches face limitations\. Their ability to learn complex mappings directly from data makes them well suited for modeling multiscale and strongly coupled physical phenomena, enabling faster prediction and data\-driven discovery across a wide range of applications\[Raissi2019PINN,Karniadakis2021PIML,Raissi2020HiddenFluid,Raissi2018DeepHiddenPhysics\]\.
Following these developments, a wide range of ML algorithms have been proposed, differing in their structure, input–output representation, and target applications\. Among these, neural networks \(NNs\) have attracted significant attention due to their capacity to represent complex nonlinear relationships within data\[Zhang2016RandomizedNN\]\. Inspired by biological neural systems, NNs consist of interconnected processing units that learn functional mappings between inputs and outputs through adaptive optimization of network parameters\. Various architectures, including ANNs, DNNs, CNNs, and RNNs, have demonstrated strong performance across diverse physical applications\[Mahmoudabadbozchelou2021MFNN\]\.
Despite these successes, standard data\-driven approaches face several fundamental limitations in scientific settings\. In particular, neural networks typically require large volumes of high\-quality training data to achieve reliable predictive performance, which is often unavailable in practice\. Moreover, such models are generally limited to interpolation within the range of the training data and may fail to generalize under extrapolative conditions\. Another key limitation is the lack of embedded physical constraints, as conventional ML models primarily learn statistical correlations rather than governing principles, often leading to reduced interpretability and robustness\[Carvalho2019,Lennon2023,Buhrmester2021\]\.
To address these challenges, physics\-informed machine learning frameworks have been developed to incorporate prior physical knowledge into the learning process\. A prominent example is the Physics\-Informed Neural Network \(PINN\) framework\[Raissi2019PINN,Karniadakis2021PIML\], where governing equations are embedded into the loss function as soft constraints\. By enforcing physical consistency, PINNs reduce reliance on large datasets and improve generalization in data\-scarce regimes\. Recent developments have extended these approaches to more complex settings, including forward and inverse modeling without requiring labeled data for internal fields\[Zolfaghari2026NonlocalPINN,Mahmoudabadbozchelou2022nnPINNs\]\. Several variants have also been proposed, including PPINN for parallel\-in\-time learning\[Meng2019PPINN\], MPINN for multi\-fidelity data integration\[Meng2020MPINN,Perdikaris2017MF,Forrester2007MF,Mahmoudabadbozchelou2021MFNN,Mahmoudabadbozchelou2022DigitalRheometerTwins,Saadat2024MFNN\], and fPINN for fractional differential equations\[Dabiri2023RhINN,Dabiri2025fPINNReview,Pang2019fPINN\], along with approaches such as nPINNs and DeepONet for modeling nonlocal and operator\-based systems\[Lu2019DeepONet,Pang2020nPINN,Mangal2025\]\. In parallel, alternative paradigms such as neural operator learning have been introduced to directly learn solution operators of complex systems, enabling efficient surrogate modeling across varying conditions\[saberi2025rheoformer\]\.
In general, physical knowledge can be incorporated into neural networks either explicitly, through governing differential equations, or implicitly, through structured model design and inductive biases\. While explicit approaches are effective when accurate constitutive relations are available, implicit strategies are more suitable when the underlying physics is complex, partially known, or difficult to enforce directly\. In settings where data are limited and physics is only partially known, multi\-fidelity learning provides an effective alternative by combining data from multiple sources with different levels of accuracy\. In this framework, low\-fidelity \(Lo\-Fi\) data capture dominant trends and provide a structured baseline, while high\-fidelity \(Hi\-Fi\) data—though limited and expensive to obtain—serve to correct systematic discrepancies and anchor predictive accuracy\[Saadat2024MFNN,Mahmoudabadbozchelou2021MFNN\]\. Lo\-Fi data can be generated from simplified models, empirical correlations, or synthetic approximations, whereas Hi\-Fi data typically originate from high\-resolution experiments or detailed simulations\. This hierarchical integration enables physically consistent, robust, and data\-efficient modeling, particularly in problems where neither purely data\-driven nor fully physics\-constrained approaches are sufficient\.
In this work, we develop a data\-driven framework to model the coupled geometric and reactive behavior of turbulent premixed hydrogen flames across a wide range of thermodynamic conditions\. Leveraging limited high\-fidelity measurements of the three\-dimensional flame surface areaA3DA\_\{3D\}, flame surface radiusr3Dr\_\{3D\}, and turbulent mass burning velocityutmu\_\{tm\}, we employ a multi\-fidelity learning strategy to overcome data scarcity and capture their nonlinear interdependence across operating regimes\. This work is motivated by the lack of predictive models capable of consistently capturing the interaction between flame morphology and global burning rate in regimes where intrinsic instability and turbulence coexist, particularly in the absence of sufficient high\-fidelity data and reliable physics\-based closures\.
## 2Problem setup and Methodology
Here, we present a physics\-guided, data\-driven modeling framework that leverages neural network \(NN\) capabilities to predict experimentally measured observables from limited high\-quality data\. To this end, we adopt aMulti\-Fidelity Neural Network\(MuFiNNs\) architecture that systematically integrates structured low\-fidelity approximations with sparse high\-fidelity experimental measurements\. The objective is to construct a surrogate model that preserves the underlying physical structure of the system while achieving quantitative agreement with high\-fidelity observations across a range of operating conditions\. In the present study, the MuFiNNs framework is trained and evaluated using a limited set of turbulent premixed flame datasets for methane and hydrogen under varying thermodynamic and turbulence conditions\. The experimental database spans a range of equivalence ratios, temperatures, pressures, and turbulence intensities, thereby enabling assessment of model performance across distinct combustion regimes\. The complete operating conditions are summarized in Table[1](https://arxiv.org/html/2605.08232#S2.T1)\.
For each operating condition, the experimentally measured high\-fidelity \(Hi\-Fi\) quantities include the time\-resolved evolution of the flame radiusr\(t\)r\(t\), the wrinkled flame surface areaA\(t\)A\(t\), and the turbulent burning velocityutmu\_\{tm\}\. Consequently, the dataset exhibits heterogeneous input–output mappings, involving both temporal evolution and parameter\-dependent relationships, which necessitates a flexible learning architecture\. These measurements provide physically accurate but sparsely sampled information across turbulence intensities and different stages of flame development\. This heterogeneity, combined with the limited availability of high\-fidelity data, poses a significant challenge for conventional modeling approaches and motivates the adoption of a multi\-fidelity learning framework\.
The low\-fidelity \(Lo\-Fi\) representations are synthetically generated using regression\-based approximations designed to capture the dominant structural trends of the experimental data\. Depending on the target variable, these Lo\-Fi models are constructed using linear regression and second\-order two\-dimensional polynomial fits\. These approximations provide smooth and physically interpretable trend representations that preserve large\-scale behavior but do not fully reproduce the nonlinear complexities observed in the Hi\-Fi measurements\. The MuFiNNs architecture subsequently learns the corrective mapping between these regression\-based Lo\-Fi approximations and the experimentally measured Hi\-Fi data across all investigated combustion regimes\. By training across multiple fuels and thermodynamic conditions, the framework is designed to evaluate both interpolation within measured operating ranges and generalization across distinct combustion environments\.
### 2\.1Multi\-Fidelity Framework
In a multi\-fidelity surrogate setting, the multi\-fidelity prediction, denoted here asyMFy\_\{\\text\{MF\}\}, can be interpreted as a combination of the low\-fidelity predictionyLFy\_\{\\text\{LF\}\}and the high\-fidelity input variablesxHFx\_\{\\text\{HF\}\}\. A classical approach assumes that the high\-fidelity response can be represented as a corrected version of the low\-fidelity approximation\. This idea is often expressed through a linear correlation structure,
yHF=ρ\(xHF\)yLF\+δ\(xHF\)y\_\{\\text\{HF\}\}=\\rho\(x\_\{\\text\{HF\}\}\)\\,y\_\{\\text\{LF\}\}\+\\delta\(x\_\{\\text\{HF\}\}\)\(1\)
whereρ\(⋅\)\\rho\(\\cdot\)andδ\(⋅\)\\delta\(\\cdot\)denote multiplicative and additive correlation functions, respectively\. In this formulation,ρ\\rhoandδ\\deltaare typically calibrated to map the low\-fidelity response to the high\-fidelity level\. These correlation functions are problem\-dependent and do not possess predefined analytical forms\. Depending on the application, they may be approximated using polynomial functions, basis expansions, or data\-driven models whose coefficients are determined from the available data\.
While the linear correction model provides a useful starting point, combustion systems often exhibit strongly nonlinear interactions between turbulence intensity, flame geometry, and thermodynamic state\. As a result, the discrepancy between low\- and high\-fidelity representations cannot generally be captured through linear scaling alone\. A more flexible representation is therefore introduced,
yHF=𝒩\(xHF,yLF\)y\_\{\\text\{HF\}\}=\\mathcal\{N\}\(x\_\{\\text\{HF\}\},y\_\{\\text\{LF\}\}\)\(2\)
where𝒩\(⋅\)\\mathcal\{N\}\(\\cdot\)denotes a nonlinear transformation that maps the low\-fidelity approximation and high\-fidelity input variables to the high\-fidelity response\. In the proposed MuFiNNs architecture, this nonlinear operator is decomposed into two complementary components,
yHF=𝒩L\(xHF,yLF\)\+𝒩NL\(xHF,yLF\)y\_\{\\text\{HF\}\}=\\mathcal\{N\}\_\{L\}\(x\_\{\\text\{HF\}\},y\_\{\\text\{LF\}\}\)\+\\mathcal\{N\}\_\{NL\}\(x\_\{\\text\{HF\}\},y\_\{\\text\{LF\}\}\)\(3\)
where𝒩L\\mathcal\{N\}\_\{L\}captures the primary linear correction between fidelity levels, while𝒩NL\\mathcal\{N\}\_\{NL\}accounts for nonlinear discrepancies that cannot be represented through simple scaling\. This decomposition improves training stability by separating global trend correction from higher\-order effects, enabling the nonlinear branch to capture complex interactions associated with turbulence intensity, flame geometry, and thermodynamic state\.
Fig\. 1:Schematic representation of the Multi\-Fidelity Neural Network \(MuFiNNs\) architecture developed in the present work\. Thennnumber of input variablesxix\_\{i\}are first processed through the low\-fidelity \(LF\) neural network to synthetically generate a structured approximation,yLFy\_\{\\text\{LF\}\}\. This LF prediction, together with the original input variables, is then fed into two high\-fidelity correction branches: a linear network and a nonlinear network\. The final multi\-fidelity prediction,y^MF\\hat\{y\}\_\{\\text\{MF\}\}, is obtained by summing the outputs of these two branches\.The schematic representation of the proposed MuFiNNs architecture is shown in Fig\.[1](https://arxiv.org/html/2605.08232#S2.F1)\. The framework consists of three neural networks: a low\-fidelity \(LF\) network and two high\-fidelity \(HF\) correction branches, namely a linear branch and a nonlinear branch\. This general structure defines the multi\-fidelity correction mechanism adopted throughout the present study\. In the present work, multiple MuFiNNs models are constructed to address different prediction tasks and operating conditions\. While the set of input variables and the overall architectural structure remain consistent with that illustrated in Fig\.[1](https://arxiv.org/html/2605.08232#S2.F1), the number of hidden layers and neurons per layer are selected individually for each model\. This task\-specific configuration reflects the varying complexity of the underlying input–output relationships associated with each prediction problem\. The architectural hyperparameters are therefore tailored to the complexity of the respective task, while preserving a unified multi\-fidelity correction framework across all cases\.
Accordingly, the low\-fidelity neural network,𝒩LF\\mathcal\{N\}\_\{\\mathrm\{LF\}\}, takes the LF input variables and predicts a structured low\-fidelity response,yLFy\_\{\\mathrm\{LF\}\}, which captures the dominant global behavior of the turbulent burning parameters across the input space\. The LF outputyLFy\_\{\\mathrm\{LF\}\}, together with the original input vector𝐱\\mathbf\{x\}, is then supplied to the linear and nonlinear HF correction branches\. The linear branch producesyHF,Ly\_\{\\mathrm\{HF\},L\}and captures first\-order correlations between fidelity levels, while the nonlinear branch producesyHF,NLy\_\{\\mathrm\{HF\},NL\}and models higher\-order interactions and localized discrepancies\.
The final multi\-fidelity prediction is obtained as
yMF=yHF,L\+yHF,NLy\_\{\\mathrm\{MF\}\}=y\_\{\\mathrm\{HF\},L\}\+y\_\{\\mathrm\{HF\},NL\}\(4\)
whereyHF,Ly\_\{\\mathrm\{HF\},L\}andyHF,NLy\_\{\\mathrm\{HF\},NL\}denote the outputs of the linear and nonlinear correction networks, respectively\. The distinction between fidelity levels is rooted in the data construction strategy\. The high\-fidelity \(Hi\-Fi\) data are predetermined by the available experimental measurements for each flow condition\. In contrast, the construction of the low\-fidelity \(Lo\-Fi\) dataset is user\-defined and depends on the chosen simplified representation of the physical system\. In the present study, the Lo\-Fi data are synthetically generated from global trend models designed to capture the dominant structural behavior of the turbulent burning parameters\. The methodology used for Lo\-Fi data generation is described in the next section, along with further implementation details\. The trainable parameters of the MuFiNNs architecture are obtained by minimizing a compound loss function defined as
ϕ=MSELF\+MSEHF\+λLF∑wLF2\+λHF,NL∑wHF,NL2\\phi=\\text\{MSE\}\_\{LF\}\+\\text\{MSE\}\_\{HF\}\+\\lambda\_\{LF\}\\sum w\_\{LF\}^\{2\}\+\\lambda\_\{HF,NL\}\\sum w\_\{HF,NL\}^\{2\}\(5\)
where the mean squared error \(MSE\) is defined as
MSE=1N∑i=1N\(yactual,i−ypredicted,i\)2\\text\{MSE\}=\\frac\{1\}\{N\}\\sum\_\{i=1\}^\{N\}\\left\(y\_\{\\text\{actual\},i\}\-y\_\{\\text\{predicted\},i\}\\right\)^\{2\}\(6\)
and represents the deviation between predicted and actual data\. In Eq\. \([5](https://arxiv.org/html/2605.08232#S2.E5)\),MSELF\\text\{MSE\}\_\{LF\}andMSEHF\\text\{MSE\}\_\{HF\}denote the deviations between predicted and measured data for the low\- and high\-fidelity datasets, respectively\. In Eq\. \([5](https://arxiv.org/html/2605.08232#S2.E5)\), the first two terms enforce consistency with both fidelity levels, ensuring that the model remains faithful to the structured trends embedded in the Lo\-Fi data while simultaneously matching the experimentally measured Hi\-Fi observations\. The last two terms correspond toL2L\_\{2\}regularization \(weight decay\) applied to the LF and nonlinear high\-fidelity branches, respectively, whereλLF\\lambda\_\{LF\}andλHF,NL\\lambda\_\{HF,NL\}are regularization coefficients\. These regularization terms penalize large weight magnitudes, thereby preventing overfitting and improving generalization, particularly when the high\-fidelity dataset is limited\.
In this framework, training of the MuFiNNs models is performed using a hybrid optimization strategy\. In the first stage, all trainable parameters are optimized using the Adam optimizer with a warmup–cosine learning\-rate schedule and gradient clipping to ensure stable convergence of the coupled low\- and high\-fidelity loss terms\. To prevent unnecessary over\-training and improve computational efficiency, a plateau\-based stopping criterion is applied during the Adam stage\. For each case, training is terminated when the variation of the total loss over a moving window falls below a prescribed threshold, resulting in a case\-dependent number of iterations determined automatically by the loss behavior\. In the second stage, the solution is refined using the L\-BFGS algorithm implemented through TensorFlow Probability\. This quasi\-Newton method leverages curvature information to accelerate local convergence and improve minimization of the multi\-fidelity loss function\. All trainable parameters of the LF network and both the linear and nonlinear HF correction branches are optimized jointly during both stages\. This two\-step optimization strategy enhances numerical stability and improves final predictive accuracy\.
The multi\-fidelity formulation enables the model to benefit simultaneously from the accuracy of the high\-fidelity data and the abundance of the low\-fidelity data\. Because experimentally measured Hi\-Fi samples are comparatively sparse, the LF network provides global structural trends across the input space, guiding the HF correction branches during training\. This guidance stabilizes the optimization process and reduces the risk of divergence toward non\-physical solutions\. As a result, the MuFiNNs architecture achieves a balanced trade\-off between physical structure preservation, experimental accuracy, and predictive generalization\.
### 2\.2Training Datasets
The high\-fidelity \(Hi\-Fi\) dataset consists of three complementary data components: \(i\) geometric information derived from the reconstructed three\-dimensional flame surface, \(ii\) volume\-based measures of flame growth, and \(iii\) pressure\-based turbulent burning\-velocity measurements\. The geometric component is obtained from imaging\-based three\-dimensional flame reconstructions\. The wrinkled flame surface areaA3D\(t\)A\_\{3D\}\(t\)is computed directly from the triangulated 3D flame surface\. For each case, the reconstructed flame also provides the time\-resolved burnt volumeV3D\(t\)V\_\{3D\}\(t\), from which a volume\-equivalent radiusr3D\(t\)r\_\{3D\}\(t\)is obtained\. These quantities characterize the geometric evolution of the expanding turbulent flame\. In particular,r3Dr\_\{3D\}is used to define the smooth reference areaa3D\(t\)=4πr3D2a\_\{3D\}\(t\)=4\\pi r\_\{3D\}^\{2\}, enabling evaluation of the wrinkling ratioA3D/a3DA\_\{3D\}/a\_\{3D\}as mentioned earlier\. The third component of the Hi\-Fi dataset is the turbulent burning velocityutmu\_\{tm\}, which is computed from the measured pressure trace using a thermodynamic \(fractional burning\-rate\) formulation based ondP/dt\\mathrm\{d\}P/\\mathrm\{d\}t, yielding a time\-resolved signalutm\(t\)u\_\{tm\}\(t\)\. However, since the flame radius provides a geometry\-based coordinate that is directly linked to flame surface area and burning dynamics—unlike time, which depends on the specific evolution rate of each experiment—the burning velocity is re\-parameterized using the corresponding radius history to obtainutm\(r\)u\_\{tm\}\(r\)\. This transformation removes dependence on the experimental time scale and enables a geometry\-consistent representation of the burning process, allowing meaningful comparison across different operating conditions\.
In contrast, the low\-fidelity \(Lo\-Fi\) dataset is synthetically generated using a simplified physics\-inspired trend model that approximates the dominant dependence of the system on the governing variables, namely flame radius and turbulence intensity\. The Lo\-Fi representation captures the primary structural trends observed in the experiments while remaining computationally inexpensive and densely sampled over the domain\.
To rigorously evaluate generalization, the Hi\-Fi data corresponding to selected turbulence intensity levels are excluded during training\. These masked cases are reserved exclusively for testing, enabling assessment of both interpolation within the observed range and extrapolation beyond the training regime\. This strategy provides a comprehensive evaluation of the predictive capability of the MuFiNNs framework\.
The experimental database considered in this study consists of seven operating conditions, summarized in Table[1](https://arxiv.org/html/2605.08232#S2.T1), including methane \(CH4\) and hydrogen \(H2\) mixtures spanning different equivalence ratiosϕ\\phi, initial temperaturesTT, pressuresPP, and turbulence intensity rangesu′u^\{\\prime\}\. These cases provide a broad range of thermo\-chemical and flow conditions for assessing the robustness and generalization capability of the MuFiNNs framework\.
Table 1:Summary of the investigated combustion cases and operating conditions\.
## 3Results and Discussion
In this section, the procedure for generating the Lo\-Fi data for the target variables considered in this study is described for each case\. Model performance is assessed through both interpolation and extrapolation tests, including the systematic masking of selected high\-fidelity data during training\. The predictive capability of the MuFiNNs framework is evaluated by comparing reconstructed and held\-out cases, demonstrating its ability to generalize across different turbulence intensity levels\.
### 3\.1Flame Geometry: Surface Area and Radius \(A3DA\_\{3D\},r3Dr\_\{3D\}\)
The geometric evolution of the turbulent flame is first examined through the three\-dimensional surface areaA3DA\_\{3D\}and the corresponding volume\-equivalent radiusr3Dr\_\{3D\}\. These quantities provide a direct measure of flame growth and surface wrinkling, capturing the key geometric characteristics of the expanding turbulent flame\.
For the methane \(CH4\) cases \(i–iv\), the available data at each turbulence intensity level are limited, and as summarized in Table[1](https://arxiv.org/html/2605.08232#S2.T1), these cases correspond to different combinations of temperature, pressure, and equivalence ratio\. Due to this limited data availability, directly training a single model across all conditions would not reliably capture the correct trends in the evolution of the geometric quantities\. In particular, the model would struggle to disentangle the individual effects of the thermodynamic parameters on the flame behavior, limiting its ability to perform reliable interpolation and extrapolation across different operating conditions, which is the primary objective of this work\.
To address this limitation, a hierarchical multi\-fidelity modeling strategy is adopted\. In this approach, the system is trained in a structured, step\-by\-step manner, allowing it to progressively learn the influence of each parameter separately\. This hierarchical structure reduces the complexity of the learning task by decoupling the effects of different parameters, enabling more stable and physically consistent regression under limited data conditions\. A schematic representation of this procedure is shown in Fig\.[2](https://arxiv.org/html/2605.08232#S3.F2)\.
Fig\. 2:Schematic of the hierarchical construction of the low\-fidelity \(Lo\-Fi\) representation forA3DA\_\{3D\}andr3Dr\_\{3D\}, illustrating the stepwise regression of temporal evolution and subsequent parametric dependence on operating conditions\.Within this framework, the Hi\-Fi data forA3DA\_\{3D\}andr3Dr\_\{3D\}are first expressed as functions of time and operating conditions, i\.e\.,
A3D=A3D\(t,u′,ϕ,T,P\),r3D=r3D\(t,u′,ϕ,T,P\)\.A\_\{3D\}=A\_\{3D\}\(t,u^\{\\prime\},\\phi,T,P\),\\quad r\_\{3D\}=r\_\{3D\}\(t,u^\{\\prime\},\\phi,T,P\)\.\(7\)
Accordingly, the input space of the model consists of\(t,u′,ϕ,T,P\)\(t,u^\{\\prime\},\\phi,T,P\), while the outputs correspond to the geometric quantitiesA3DA\_\{3D\}andr3Dr\_\{3D\}\.
At fixed thermodynamic conditions\(ϕ,T,P\)\(\\phi,T,P\), curve fitting is performed to capture the temporal evolution of the flame geometry for each case separately\. Specifically, the evolution ofA3DA\_\{3D\}andr3Dr\_\{3D\}is well approximated by a second\-order polynomial in logarithmic time, and a second\-order polynomial regression is therefore applied in logarithmic space, which provides a compact representation of the observed nonlinear growth trends over multiple time scales, yielding
log\(A3D\)=a0\+a1logt\+a2\(logt\)2,\\log\(A\_\{3D\}\)=a\_\{0\}\+a\_\{1\}\\log t\+a\_\{2\}\(\\log t\)^\{2\},\(8\)log\(r3D\)=b0\+b1logt\+b2\(logt\)2\.\\log\(r\_\{3D\}\)=b\_\{0\}\+b\_\{1\}\\log t\+b\_\{2\}\(\\log t\)^\{2\}\.\(9\)
The resulting coefficientsaia\_\{i\}andbib\_\{i\}are then regressed as functions of turbulence intensity and equivalence ratio, i\.e\.,\(u′,ϕ\)\(u^\{\\prime\},\\phi\), to capture their parametric dependence within each thermodynamic case\.
To account for variations in thermodynamic conditions, a correction term is introduced such that the coefficients are expressed as
ai\(u′,ϕ,T,P\)=aibase\(u′,ϕ\)\+δi\(T,P\),a\_\{i\}\(u^\{\\prime\},\\phi,T,P\)=a\_\{i\}^\{\\mathrm\{base\}\}\(u^\{\\prime\},\\phi\)\+\\delta\_\{i\}\(T,P\),\(10\)bi\(u′,ϕ,T,P\)=bibase\(u′,ϕ\)\+δi\(r\)\(T,P\),b\_\{i\}\(u^\{\\prime\},\\phi,T,P\)=b\_\{i\}^\{\\mathrm\{base\}\}\(u^\{\\prime\},\\phi\)\+\\delta\_\{i\}^\{\(r\)\}\(T,P\),\(11\)
whereaibase\(u′,ϕ\)a\_\{i\}^\{\\mathrm\{base\}\}\(u^\{\\prime\},\\phi\)andbibase\(u′,ϕ\)b\_\{i\}^\{\\mathrm\{base\}\}\(u^\{\\prime\},\\phi\)denote the coefficients obtained at a reference thermodynamic condition, andδi\(T,P\)\\delta\_\{i\}\(T,P\)represents a thermodynamic correction term that captures the deviation of each case from this base trend\. This formulation enables the model to systematically account for differences between cases with varying temperature and pressure, thereby improving consistency across the dataset\.
This hierarchical construction enables the generation of a continuous and computationally efficient Lo\-Fi representation ofA3DA\_\{3D\}andr3Dr\_\{3D\}across the parameter space, which is subsequently used within the MuFiNNs framework\.
Fig\. 3:MuFiNNs predictions of the flame surface areaA3D\(t\)A\_\{3D\}\(t\)for methane \(CH4\) cases \(I–IV\) across different turbulence intensitiesu′u^\{\\prime\}\. Each column corresponds to a distinct operating condition, while the rows represent training \(top\), interpolation \(middle\), and extrapolation \(bottom\) assessments\. Light\-colored markers denote individual experimental realizations, and solid markers indicate their corresponding mean values used as high\-fidelity \(Hi\-Fi\) training targets\. Solid lines represent MuFiNNs predictions\. In the interpolation and extrapolation rows, filled markers indicate training data and hollow markers denote held\-out data, while dashed curves correspond to model predictions for unseen turbulence intensity levels\.Figure[3](https://arxiv.org/html/2605.08232#S3.F3)presents the MuFiNNs predictions for the geometric evolution of the turbulent flame, characterized by the surface areaA3D\(t\)A\_\{3D\}\(t\), across the methane \(CH4\) cases \(i–iv\)\. The results are organized into three rows corresponding to training \(top\), interpolation \(middle\), and extrapolation \(bottom\) assessments\.
In the top row, the MuFiNNs model is trained using the available high\-fidelity \(Hi\-Fi\) data\. For each operating condition, three independent experimental measurements are available, shown as light\-colored markers\. The corresponding mean values, indicated by solid markers, are used as the Hi\-Fi targets during training\. As observed, the MuFiNNs predictions closely match the training data across all turbulence intensity levels, accurately capturing both the nonlinear temporal evolution and the dependence on turbulence intensity\. The agreement between the predicted curves and the Hi\-Fi data demonstrates that the model successfully learns the underlying geometric growth behavior of the flame despite the limited number of training samples\.
To further evaluate the generalization capability of the model, an interpolation study is performed, as shown in the middle row of Fig\.[3](https://arxiv.org/html/2605.08232#S3.F3)\. In this step, selected intermediate turbulence intensity levels are excluded from the training process\. Specifically, for each case, two intermediateu′u^\{\\prime\}conditions are removed entirely, and the model is trained only on the remaining data\. The dashed curves represent the MuFiNNs predictions for these unseen conditions\. Despite the absence of these data during training, the model is able to reconstruct the corresponding flame evolution with high accuracy\. The predicted trends remain consistent with the experimental measurements, indicating that the MuFiNNs framework successfully captures the continuous dependence of flame geometry on turbulence intensity and can reliably interpolate within the observed parameter space\. However, in certain cases \(e\.g\., CH4\-I atu′=0\.5u^\{\\prime\}=0\.5m/s\), where the experimental data exhibit slight deviations from the overall trend of the dataset, the MuFiNNs predictions may not fully reproduce these local inconsistencies\. This behavior is expected, as the model is trained to learn the dominant underlying trend rather than isolated experimental fluctuations\. Nevertheless, the overall agreement remains strong, and the predicted evolution continues to follow the physically consistent trend of the system\.
In the bottom row, the extrapolation capability of the model is examined\. In particular, the model is tasked with predicting flame evolution at higher turbulence intensity levels that lie outside the range of the training data\. This regime is of particular interest, as high\-u′u^\{\\prime\}conditions are experimentally challenging due to increased flame instability, measurement uncertainty, and limitations in optical diagnostics, as also reported in previous experimental studies\. Despite these challenges, the MuFiNNs predictions remain physically consistent and follow the expected growth trends\. Although minor deviations from the experimental data are observed at the highest turbulence levels, the overall agreement remains strong, demonstrating that the model can generalize beyond the training domain while preserving the correct qualitative and quantitative behavior\.
Fig\. 4:MuFiNNs prediction of the flame surface areaA3D\(t\)A\_\{3D\}\(t\)for the excluded methane case CH4\-II across different turbulence intensitiesu′u^\{\\prime\}\. Symbols denote high\-fidelity experimental measurements, while dashed lines represent model predictions obtained without using any data from this operating condition during training\.To further assess the ability of the MuFiNNs framework to generalize across unseen thermodynamic conditions, an additional test is performed by entirely excluding one operating condition \(CH4\-II\) from the training dataset\. This case is selected deliberately, as it shares the same pressure level with CH4\-I and a similar temperature range with CH4\-III, allowing the model to be exposed to partial thermodynamic information without directly observing this specific combination\.
The model is then trained using the remaining cases and tasked with predicting the flame surface evolution for the excluded condition\. The corresponding results are shown in Fig\.[4](https://arxiv.org/html/2605.08232#S3.F4)\. As observed, although the predictions do not exactly match the experimental measurements, the model is able to capture the correct qualitative behavior, including the growth trend and the overall magnitude ofA3D\(t\)A\_\{3D\}\(t\)\.
This result highlights both the capability and the limitation of the current framework\. On one hand, the MuFiNNs model successfully learns a structured representation of the flame dynamics, enabling it to generalize across partially unseen thermodynamic combinations\. On the other hand, due to the limited number of available operating conditions in the dataset—specifically, only two distinct pressure and temperature levels—the model is constrained in its ability to fully reconstruct the precise quantitative response for completely unseen thermodynamic states\.
Consequently, while the model can reliably capture the dominant trends and provide reasonable estimates within the explored parameter space, accurate prediction outside the sampled thermodynamic range requires a denser coverage of temperature and pressure conditions\. This observation underscores the importance of incorporating a broader range of operating conditions in future datasets to further enhance the predictive capability of the multi\-fidelity framework\.
A similar analysis is performed for the hydrogen \(H2\) cases \(V and VI\), yielding comparable predictive performance\. However, since these cases vary only in equivalence ratio while sharing the same temperature and pressure, the learning problem is significantly simpler and does not require a hierarchical approach\.
Accordingly, the MuFiNNs model is trained only with respect to turbulence intensityu′u^\{\\prime\}, corresponding to the first stage of the hierarchical framework, and achieves accurate predictions across all conditions\. Therefore, the methane cases—featuring variations in equivalence ratio, temperature, and pressure—are adopted as the primary benchmark, providing a more comprehensive and challenging test of the proposed framework\.
For the final hydrogen case, corresponding to quiescent conditions \(u′=0u^\{\\prime\}=0\) with varying pressure, a simplified low\-fidelity construction is adopted\. Unlike the methane cases, where the flame evolution depends on multiple varying parameters and requires a hierarchical treatment, the present case involves only pressure variation under otherwise fixed conditions\. As a result, the Lo\-Fi representation is generated using a simple linear regression of the flame radius with time at each available pressure level, i\.e\.,r\(t\)≈a\(P\)t\+b\(P\)r\(t\)\\approx a\(P\)\\,t\+b\(P\)\. The pressure dependence of the linear coefficients is then inferred from the available training pressures and used to construct synthetic low\-fidelity trends for unseen pressure conditions\. This provides a computationally inexpensive baseline representation, upon which the multi\-fidelity correction is learned\.
To assess the predictive capability of the framework under unseen pressure conditions, three hold\-out studies are performed, as illustrated in Fig\.[5](https://arxiv.org/html/2605.08232#S3.F5)\. In the first case \[Fig\.[5](https://arxiv.org/html/2605.08232#S3.F5)\(a\)\], two intermediate pressure levels are excluded from training in order to evaluate interpolation within the available pressure range\. In the second case \[Fig\.[5](https://arxiv.org/html/2605.08232#S3.F5)\(b\)\], the lowest pressure condition \(1 bar\) is excluded together with one additional intermediate pressure level \(7 bar\), allowing simultaneous assessment of interpolation and extrapolation toward the lower\-pressure boundary of the domain\. In the third case \[Fig\.[5](https://arxiv.org/html/2605.08232#S3.F5)\(c\)\], the 3 bar and 10 bar conditions are removed, such that the model is required to extrapolate toward the highest pressure level\. This final test is of particular importance, as generating reliable experimental data at elevated pressures is considerably more challenging in practice, and the ability to accurately predict flame evolution in this regime is therefore of significant value\.
\\begin\{overpic\}\[width=433\.62pt\]\{r\_mean\_holdout\_3bar\_5bar\.png\} \\put\(1\.0,85\.0\)\{\{\(a\)\}\} \\end\{overpic\}
\\begin\{overpic\}\[width=433\.62pt\]\{r\_mean\_holdout\_1bar\_7bar\.png\} \\put\(1\.0,85\.0\)\{\{\(b\)\}\} \\end\{overpic\}
\\begin\{overpic\}\[width=433\.62pt\]\{r\_mean\_holdout\_10bar\_3bar\.png\} \\put\(1\.0,85\.0\)\{\{\(c\)\}\} \\end\{overpic\}
Fig\. 5:Multi\-fidelity predictions of the flame radius evolutionr\(t\)r\(t\)under varying chamber pressure, demonstrating the interpolation and extrapolation capability of the model\. Filled markers indicate training data, while hollow markers denote held\-out data\. Solid curves correspond to MFNN predictions for pressures included in training, and dashed curves represent predictions for unseen \(held\-out\) pressure conditions\. \(a\) Interpolation test, where intermediate pressures \(3 and 5 bar\) are excluded from training\. \(b,c\) Combined interpolation–extrapolation tests, where selected pressure levels are removed from training, requiring prediction both within and beyond the range of the available data\.As shown by the results, the proposed framework accurately captures the pressure\-dependent trends in flame\-radius evolution across all scenarios\. Both interpolation and extrapolation cases show excellent agreement with the experimental data\. In particular, the predictions at high pressure remain physically consistent and reasonably accurate despite the absence of training data in this regime\. These findings demonstrate that even with a simple regression\-based Lo\-Fi model, the MuFiNNs framework can reliably predict system behavior under experimentally challenging conditions\.
### 3\.2Turbulent Burning Velocity:utmu\_\{tm\}
The turbulent burning velocityutmu\_\{tm\}is examined as a key quantity describing the propagation of the turbulent flame\. To enable consistent comparison across different operating conditions, the burning velocity is analyzed as a function of the flame radius, i\.e\.,utm\(r\)u\_\{tm\}\(r\), for all cases\.
For the four cases CH4\-II, CH4\-III, H2\-V, and H2\-VI, the high\-fidelity \(Hi\-Fi\) turbulent mass burning velocityutmu\_\{tm\}is computed directly from the measured pressure traces using a thermodynamic fractional burning\-rate formulation\. Prior to this calculation, the raw pressure signals are denoised using a two\-stage Savitzky–Golay filter and subsequently downsampled to improve the numerical stability of the time derivativedP/dtdP/dt\. To avoid post\-peak oscillations and non\-physical artifacts, the signals are truncated at the peak pressurePfP\_\{f\}, retaining only the monotonic rising branch\.
In the implemented form,
utm=\(P0P\)1γu\{1−\(P0P\)1γu\[Pf−PPf−P0\]\}−23R03\(Pf−P0\)dPdt,u\_\{tm\}=\\left\(\\frac\{P\_\{0\}\}\{P\}\\right\)^\{\\frac\{1\}\{\\gamma\_\{u\}\}\}\\left\\\{1\-\\left\(\\frac\{P\_\{0\}\}\{P\}\\right\)^\{\\frac\{1\}\{\\gamma\_\{u\}\}\}\\left\[\\frac\{P\_\{f\}\-P\}\{P\_\{f\}\-P\_\{0\}\}\\right\]\\right\\\}^\{\-\\frac\{2\}\{3\}\}\\;\\frac\{R\_\{0\}\}\{3\(P\_\{f\}\-P\_\{0\}\)\}\\frac\{dP\}\{dt\},\(12\)
whereP0P\_\{0\}andPfP\_\{f\}denote the initial and peak pressures, respectively,γu\\gamma\_\{u\}is the specific heat ratio of the unburned gas, andR0R\_\{0\}is the combustion chamber radius\. The same formulation provides the corresponding flame radius as
rm=R0\{1−\(P0P\)1γu\[Pf−PPf−P0\]\}13\.r\_\{m\}=R\_\{0\}\\left\\\{1\-\\left\(\\frac\{P\_\{0\}\}\{P\}\\right\)^\{\\frac\{1\}\{\\gamma\_\{u\}\}\}\\left\[\\frac\{P\_\{f\}\-P\}\{P\_\{f\}\-P\_\{0\}\}\\right\]\\right\\\}^\{\\frac\{1\}\{3\}\}\.\(13\)
Thus, bothutmu\_\{tm\}andrrare obtained consistently within the same pressure\-based thermodynamic framework\. The resulting\(r,utm\)\(r,u\_\{tm\}\)pairs are sorted with respect torrand restricted to the analysis window used for training\. A second Savitzky–Golay smoothing is then applied to obtain the final Hi\-Fi target curvesutm\(r\)u\_\{tm\}\(r\)\. While both the merged raw measurements and the smoothed signals are retained for diagnostic purposes, only the smoothed curves are used as training targets in the MuFiNNs framework\.
The low\-fidelity \(Lo\-Fi\) dataset is synthetically generated using a structured global trend model inferred from the Hi\-Fi training subset via linear regression\. At each turbulence intensity levelu′u^\{\\prime\}, the dependence ofutmu\_\{tm\}on the flame radiusrris approximated by a linear relation,utm≈α\(u′\)r\+β\(u′\)u\_\{tm\}\\approx\\alpha\(u^\{\\prime\}\)\\,r\+\\beta\(u^\{\\prime\}\)\. The resulting coefficients are then regressed as smooth functions ofu′u^\{\\prime\}to capture the global dependence on turbulence intensity\.
Using these parametric representations, a dense Lo\-Fi dataset is constructed over a structured\(r,u′\)\(r,u^\{\\prime\}\)grid, yielding a smooth and computationally inexpensive approximation of the dominant trends inutm\(r,u′\)u\_\{tm\}\(r,u^\{\\prime\}\)across the parameter space\.
For turbulent burning velocity prediction, the MuFiNNs architecture takes the flame radiusrrand turbulence intensityu′u^\{\\prime\}as inputs, and predicts the corresponding turbulent burning velocityutmu\_\{tm\}\. The predictive capability of the proposed MuFiNNs framework is evaluated across all considered cases\. Representative results for Case H2\-V are presented in Fig\.[6](https://arxiv.org/html/2605.08232#S3.F6), which illustrates three training scenarios designed to assess interpolation and extrapolation performance across turbulence intensity levels\.
\\begin\{overpic\}\[width=433\.62pt\]\{H2\-V\-1\.png\} \\put\(1\.0,85\.0\)\{\{\(a\)\}\} \\end\{overpic\}
\\begin\{overpic\}\[width=433\.62pt\]\{H2\-V\-2\.png\} \\put\(1\.0,85\.0\)\{\{\(b\)\}\} \\end\{overpic\}
\\begin\{overpic\}\[width=433\.62pt\]\{H2\-V\-3\.png\} \\put\(1\.0,85\.0\)\{\{\(c\)\}\} \\end\{overpic\}
Fig\. 6:Multi\-fidelity prediction of the turbulent mass burning velocityutm\(r\)u\_\{tm\}\(r\)for Case H2\-V\. Faint scattered symbols represent the raw pressure\-derived measurements, while filled circular markers denote the processed \(smoothed\) high\-fidelity data used for supervision\. Continuous curves correspond to MuFiNNs predictions\. Open markers indicate turbulence intensity levels excluded from training \(held\-out cases\), used to assess interpolation and extrapolation performance\. Solid curves denote MuFiNNs predictions\. \(a\) Interpolation test, where two intermediate turbulence intensity levels are excluded from training\. \(b\) Combined interpolation–extrapolation test, where both an intermediate and the lowest turbulence intensity levels are held out\. \(c\) Extrapolation test, where the highest turbulence intensity level is excluded from training\.In the first configuration \[Fig\.[6](https://arxiv.org/html/2605.08232#S3.F6)\(a\)\], two intermediate turbulence intensity levels are excluded from the training dataset to evaluate interpolation capability\. In this setting, the MuFiNNs model reconstructs the held\-out conditions using only neighboring Hi\-Fi data together with the underlying low\-fidelity trend\. The predicted curves remain in strong agreement with the experimental measurements, indicating that the model effectively captures the continuous dependence ofutmu\_\{tm\}on bothrrandu′u^\{\\prime\}\.
To evaluate interpolation and extrapolation simultaneously, both an intermediate and the lowest turbulence intensity levels are excluded from the training dataset, as shown in Fig\.[6](https://arxiv.org/html/2605.08232#S3.F6)\(b\)\. In this case, the model is required to infer missing conditions both within and near the boundary of the training domain\. The predictions continue to follow the experimental trends closely, demonstrating robust generalization across partially observed regimes\.
The extrapolation capability of the model is examined in Fig\.[6](https://arxiv.org/html/2605.08232#S3.F6)\(c\) by withholding the highest turbulence intensity level during training\. This scenario is particularly important from a practical perspective, as experimental measurements at high turbulence intensities are often difficult or costly to obtain\. Despite the absence of direct supervision at highu′u^\{\\prime\}, the MuFiNNs predictions successfully capture both the overall trend and magnitude ofutmu\_\{tm\}\. Notably, even though the Hi\-Fi data exhibit increased scatter and a less well\-defined trend—particularly at smaller radii—the model remains stable and produces physically consistent predictions\. This demonstrates that the combination of Lo\-Fi guidance and nonlinear HF correction enables reliable extrapolation even in regimes with noisy or weakly structured data\.
## 4Conclusions
In this work, a hierarchical multi\-fidelity neural network \(MuFiNNs\) framework was developed for predictive modeling of turbulent premixed flames under limited high\-fidelity data availability\. By combining sparse experimental measurements with structured low\-fidelity representations, the framework captures coupled flame geometry and turbulent burning dynamics while preserving dominant physical trends and correcting nonlinear discrepancies through high\-fidelity learning\. In particular, the hierarchical formulation proved effective when the dimensionality of the parameter space exceeded what could be supported by the available experimental data, enabling stable learning across thermodynamic and turbulence\-dependent operating conditions\.
The results demonstrate accurate reconstruction of high\-fidelity measurements, reliable interpolation across unseen conditions, and meaningful extrapolation beyond the training domain, including experimentally challenging regimes involving elevated turbulence and pressure\. Importantly, the framework remained effective even when trained on noisy or weakly structured data, highlighting the potential of multi\-fidelity learning to extract predictive physical behavior from sparse experiments where purely data\-driven approaches may be unreliable\. More broadly, these findings establish hierarchical multi\-fidelity learning as a practical strategy for experimental combustion modeling in data\-limited regimes and suggest a pathway for integrating sparse experiments with structured surrogate physics in reactive\-flow prediction\. At the same time, quantitative extrapolation across entirely unseen thermodynamic combinations remains limited by the sparse coverage of the current dataset, emphasizing the need for broader pressure–temperature sampling in future studies\.
Although regression\-based approximations were used here to generate low\-fidelity representations, the framework is general and can naturally incorporate reduced\-order or physics\-based models when available, offering a route toward improved predictive fidelity and interpretability\. This generality makes the approach promising for more complex reactive\-flow configurations and broader scientific machine\-learning applications in combustion\.
## Declaration of competing interest
The authors declare no competing interests\.
## Acknowledgments
SZ and SJ would like to acknowledge support from DoD MURI Award N00014\-23\-1\-2499\. YX and JY would like to thank EPSRC \(Grant No\. EP/W002299/1\) for financial support\.
## ReferencesSimilar Articles
Uncertainty-aware Multi-fidelity Closure via Conditional Normalizing Flows
This paper proposes an uncertainty-aware multi-fidelity framework based on conditional normalizing flows to improve the predictive accuracy of reduced-order models (ROMs) for complex multiscale systems. The method learns a probabilistic mapping from low-fidelity to high-fidelity coefficients and is demonstrated on a vortex merging problem, showing improved accuracy with uncertainty quantification.
Full-field prediction for engineering-scale three-dimensional aircraft with multigrid-hierarchical learning
Proposes MHLF, a multigrid-hierarchical learning framework that accelerates engineering-scale 3D aircraft CFD simulations by 3-8x while preserving high-fidelity accuracy across subsonic, transonic, and supersonic regimes.
Finite Volume-Informed Neural Network Framework for 2D Shallow Water Equations: Rugged Loss Landscapes and the Importance of Data Guidance
This paper introduces 'Data-Guided FVM-PINN', a framework using finite-volume losses for 2D shallow water equations, demonstrating that sparse data guidance is crucial to prevent network collapse in rugged loss landscapes.
Benchmarking Machine Learning Uncertainty Quantification Methodologies for Predicting Turbine Gas Temperature Degradation
This paper benchmarks five uncertainty quantification methods for neural network predictions of turbine gas temperature, evaluating trade-offs in coverage, width, and stability to guide prognostics and health management in engines.
Improvise, Adapt, Overcome: An On-The-Fly Multifidelity Algorithm for Efficient Machine Learning
This paper introduces an adaptive on-the-fly multifidelity machine learning algorithm for quantum chemistry that autonomously determines training data composition across fidelities, reducing data generation costs by up to 30x compared to single-fidelity methods and up to 5x compared to standard multifidelity methods.