Low-Cost High-Order Singular Value Decomposition for Tensor-Based Reconstruction from Sparse Sensor Measurements: Urban Flow and Air-Quality Applications

arXiv cs.LG 06/25/26, 04:00 AM Papers
tensor-decomposition sparse-sensing urban-flow air-quality reduced-order-modeling svd hosvd
Summary
This paper introduces low-cost High-Order Singular Value Decomposition (lcHOSVD), a tensor-based method for reconstructing high-dimensional environmental fields from sparse sensor measurements. Applied to urban flow and air-quality datasets, it achieves lower reconstruction errors and greater robustness to uneven sensor distributions compared to matrix-based approaches.
arXiv:2606.24989v1 Announce Type: new Abstract: Urban flow and air-quality simulations generate high-dimensional datasets describing velocity and pollutant transport across multiple spatial, temporal, and physical-variable dimensions. Reconstructing these fields from sparse sensor measurements is a fundamental challenge in environmental monitoring, digital twins, forecasting, and data assimilation. Existing low-cost reconstruction approaches are commonly based on matrix decompositions, which require multidimensional datasets to be flattened into two-dimensional snapshot matrices, thereby discarding important structural information. This work introduces the low-cost High-Order Singular Value Decomposition (lcHOSVD), a novel tensor-based sparse-sensing reconstruction framework for high-dimensional environmental fields. To the authors' knowledge, this is the first methodology that combines sparse sensing and HOSVD for field reconstruction. Unlike matrix-based approaches, lcHOSVD preserves the natural tensor structure of the data, enabling the exploitation of correlations across spatial, temporal, and physical-variable dimensions while substantially reducing the computational requirements of conventional HOSVD. The methodology is applied to urban flow and air-quality datasets, where three-dimensional velocity and pollutant concentration fields are reconstructed using only 1-4% of the available spatial locations. While lcSVD provides larger computational speed-ups, lcHOSVD consistently achieves lower reconstruction errors in configurations characterized by strong multidimensional coupling and heterogeneous dynamics across dimensions. Additional sensor-anisotropy analyses demonstrate that the tensor formulation is significantly more robust to uneven sensor distributions, a common situation in practical environmental monitoring networks.
Original Article
View Cached Full Text
Cached at: 06/25/26, 05:09 AM
# Low-Cost High-Order Singular Value Decomposition for Tensor-Based Reconstruction from Sparse Sensor Measurements: Urban Flow and Air-Quality Applications
Source: [https://arxiv.org/html/2606.24989](https://arxiv.org/html/2606.24989)
Arindam SenguptaETSI Aeronáutica y del Espacio, Universidad Politécnica de Madrid, Plaza Cardenal Cisneros, 3, Madrid, 28040, SpainCorresponding authors:a\.sengupta@upm\.es\(Arindam Sengupta\),soledad\.leclainche@upm\.es\(Soledad Le Clainche\) Co\-authors:paul\.jeanney@arup\.com rvinuesa@umich\.edu josemiguel\.perez@upm\.esPaul JeanneyETSI Aeronáutica y del Espacio, Universidad Politécnica de Madrid, Plaza Cardenal Cisneros, 3, Madrid, 28040, SpainOve Arup, C\. de Alfonso XI, 12, Retiro, Madrid, 28014, SpainRicardo VinuesaJose Miguel PérezETSI Aeronáutica y del Espacio, Universidad Politécnica de Madrid, Plaza Cardenal Cisneros, 3, Madrid, 28040, SpainSoledad Le ClaincheETSI Aeronáutica y del Espacio, Universidad Politécnica de Madrid, Plaza Cardenal Cisneros, 3, Madrid, 28040, SpainCorresponding authors:a\.sengupta@upm\.es\(Arindam Sengupta\),soledad\.leclainche@upm\.es\(Soledad Le Clainche\) Co\-authors:paul\.jeanney@arup\.com rvinuesa@umich\.edu josemiguel\.perez@upm\.es

###### Abstract

Urban flow and air\-quality simulations generate high\-dimensional datasets describing velocity and pollutant transport across multiple spatial, temporal, and physical\-variable dimensions\. Reconstructing these fields from sparse sensor measurements is a fundamental challenge in environmental monitoring, digital twins, forecasting, and data assimilation\. Existing low\-cost reconstruction approaches are commonly based on matrix decompositions, which require multidimensional datasets to be flattened into two\-dimensional snapshot matrices, thereby discarding important structural information\. This work introduces the low\-cost High\-Order Singular Value Decomposition \(lcHOSVD\), a novel tensor\-based sparse\-sensing reconstruction framework for high\-dimensional environmental fields\. To the authors’ knowledge, this is the first methodology that combines sparse sensing and HOSVD for field reconstruction\. Unlike matrix\-based approaches, lcHOSVD preserves the natural tensor structure of the data, enabling the exploitation of correlations across spatial, temporal, and physical\-variable dimensions while substantially reducing the computational requirements of conventional HOSVD\. The methodology is applied to urban flow and air\-quality datasets, where three\-dimensional velocity and pollutant concentration fields are reconstructed using only 1–4% of the available spatial locations\. A systematic comparison with low\-cost Singular Value Decomposition \(lcSVD\) is performed to assess the trade\-offs between matrix\- and tensor\-based formulations\. While lcSVD provides larger computational speed\-ups, lcHOSVD consistently achieves lower reconstruction errors in configurations characterized by strong multidimensional coupling and heterogeneous dynamics across dimensions\. Additional sensor\-anisotropy analyses demonstrate that the tensor formulation is significantly more robust to uneven sensor distributions, a common situation in practical environmental monitoring networks\. The proposed framework extends sparse\-sensing reduced\-order modelling to tensor representations and demonstrates significant potential for environmental monitoring, forecasting, digital twins, and data\-assimilation applications, where complete environmental fields must be estimated from limited observations\.

Keywords: Modal decomposition, lcSVD, lcHOSVD, urban flows, pollutant concentration, reduced\-order modeling, sensors\.

## 1\. Introduction

Road traffic is one of the leading sources of NOxemissions in European cities, releasing nitrogen oxides directly at street level in densely populated areas\[[14](https://arxiv.org/html/2606.24989#bib.bib1),[13](https://arxiv.org/html/2606.24989#bib.bib2)\]\. Particulate matter \(PM2\.5\) from traffic and combustion sources compounds the problem, exposing 96% of the EU urban population to concentrations above WHO guideline levels\[[13](https://arxiv.org/html/2606.24989#bib.bib2)\]\. How these pollutants accumulate at street level is governed not only by emission rates but also by the surrounding urban geometry and the local wind field\. High\-fidelity computational fluid dynamics \(CFD\) can simulate this behaviour, resolving the velocity field, turbulent kinetic energy, and pollutant transport together across domains that range from individual buildings to entire urban districts\[[4](https://arxiv.org/html/2606.24989#bib.bib7),[40](https://arxiv.org/html/2606.24989#bib.bib9)\]\. The resulting datasets are no longer simple scalar snapshots, but they are multi\-variable, three\-dimensional, and, in the time\-resolved case, extremely large\. Modern fluid or urban flow databases, whether time\-resolved particle image velocimetry \(PIV\) measurements\[[15](https://arxiv.org/html/2606.24989#bib.bib3),[32](https://arxiv.org/html/2606.24989#bib.bib4)\], large\-eddy simulations \(LES\) of reactive flows\[[5](https://arxiv.org/html/2606.24989#bib.bib5),[42](https://arxiv.org/html/2606.24989#bib.bib6)\], and high\-fidelity CFD simulations\[[4](https://arxiv.org/html/2606.24989#bib.bib7),[6](https://arxiv.org/html/2606.24989#bib.bib8),[40](https://arxiv.org/html/2606.24989#bib.bib9)\], all produce datasets whose sheer size renders direct manipulation computationally expensive\. While these databases contain rich physical information about the dominant flow patterns and pollutant pathways, their dimensionality creates bottlenecks in terms of storage, computational cost, and real\-time applicability\.

Modal decomposition methods have long offered a principled response to these challenges\. Proper Orthogonal Decomposition \(POD\)/ Singular Value Decomposition \(SVD\)\[[16](https://arxiv.org/html/2606.24989#bib.bib10),[28](https://arxiv.org/html/2606.24989#bib.bib11)\]remains the most widely used technique for flow analysis, as it provides the low\-rank approximation of the data\. Its applications span turbulence analysis, flow reconstruction, predictions, and the construction of low\-dimensional dynamical models\[[3](https://arxiv.org/html/2606.24989#bib.bib12),[20](https://arxiv.org/html/2606.24989#bib.bib13),[1](https://arxiv.org/html/2606.24989#bib.bib14)\]\. In urban flow studies, Liu et al\.\[[27](https://arxiv.org/html/2606.24989#bib.bib15)\]applied POD to large\-eddy simulation data over real urban morphology and demonstrated that a small number of modes is sufficient to capture the dominant wind patterns, while Xiang et al\.\[[47](https://arxiv.org/html/2606.24989#bib.bib16)\]built a non\-intrusive reduced\-order model for urban airflow with dynamic boundary conditions by coupling POD with regression techniques, achieving good agreement with full CFD solutions at a fraction of the computational cost\. Xiao et al\.\[[48](https://arxiv.org/html/2606.24989#bib.bib23)\]developed a non\-intrusive reduced\-order model for turbulent urban flows using Gaussian process regression combined with POD, demonstrating predictions several orders of magnitude faster than the full LES solver\. Dynamic Mode Decomposition \(DMD\)\[[39](https://arxiv.org/html/2606.24989#bib.bib17)\]is another commonly used modal decomposition method, which extends modal analysis by associating each mode with a specific temporal frequency, making it particularly effective for identifying coherent flow structures in unsteady and turbulent problems\.

Modal decomposition techniques have been widely used to identify dominant flow structures and develop reduced\-order predictive models\[[10](https://arxiv.org/html/2606.24989#bib.bib22),[41](https://arxiv.org/html/2606.24989#bib.bib25),[46](https://arxiv.org/html/2606.24989#bib.bib26)\]\. Urban flow datasets typically comprise velocity and scalar fields defined over large three\-dimensional domains, where the flow behaviour is strongly influenced by building layouts and street\-canyon interactions\. Modal decomposition methods can capture the principal dynamics of these urban wind fields while significantly reducing the dimensionality of the data\[[48](https://arxiv.org/html/2606.24989#bib.bib23),[27](https://arxiv.org/html/2606.24989#bib.bib15),[47](https://arxiv.org/html/2606.24989#bib.bib16)\]\. More broadly, Masoumi\-Verki et al\.\[[31](https://arxiv.org/html/2606.24989#bib.bib24)\]reviewed recent developments in reduced\-order modelling for urban airflow and pollutant dispersion, identifying reduced\-order models as the practical alternative to CFD simulations and noting that the high computational cost of full CFD simulations prevents their use in near real\-time and long\-term applications\.

Despite their success, conventional POD and DMD approaches typically require the data to be arranged into a two\-dimensional snapshot matrix, obtained by flattening spatial dimensions and physical variables into a single vector representation\. Although this reshaping preserves the underlying information, it does not explicitly exploit the inherent multiway structure of the data\. As a result, spatial directions and physical variables are treated collectively within a single index, preventing the direct identification of independent modal bases associated with each dimension and variable\. Tensor\-based decompositions provide a direct solution by representing the data as a multiway array and decomposing it along each mode independently\. These methods preserve the directional structure of the flow and enable mode bases to be computed separately for each spatial direction and variable\. Among them, the High\-Order SVD \(HOSVD\)\[[7](https://arxiv.org/html/2606.24989#bib.bib19),[8](https://arxiv.org/html/2606.24989#bib.bib20)\]provides a multilinear generalization of the matrix SVD that extends naturally to tensors\. In the Tucker decomposition form\[[24](https://arxiv.org/html/2606.24989#bib.bib21)\], HOSVD produces a core tensor and a set of orthonormal factor matrices, one per mode, enabling independent rank selection along each axis\. These properties make HOSVD attractive for fluid dynamics datasets, where the dominant complexity may differ significantly between, say, the streamwise and wall\-normal directions\. Also, Higher\-Order Dynamic Mode Decomposition \(HODMD\) was later introduced by Le Clainche & Vega\[[26](https://arxiv.org/html/2606.24989#bib.bib18)\]by generalising the snapshot matrix to includeddconsecutive delayed time steps, enabling the identification of dynamics that a single\-step decomposition cannot resolve\.

Although tensor\-based decompositions offer a structurally richer representation of multi\-dimensional flow data, their practical use is often limited by the high computational cost of full tensor decompositions\. Computing HOSVD requires performing a large matrix SVD along each unfolding of the tensor, and as the spatial resolution or number of variables grows, this cost accumulates rapidly across modes\. For the kind of high\-resolution urban or reactive flow datasets increasingly common in research, even a single unfolding may involve matrices too large and require high\-capacity resources\.

A complementary strategy to reduce computational overhead comes from using sparse measurements\. Instead of decomposing the full dataset, the decomposition can be performed on a reduced subset of the data, and then the full solution can be reconstructed\[[19](https://arxiv.org/html/2606.24989#bib.bib27)\]\. In lcSVD, SVD is applied to a reduced snapshot matrix formed by selecting a small number of spatial locations, either randomly, equidistantly, or via optimal sensor placement\[[29](https://arxiv.org/html/2606.24989#bib.bib28),[9](https://arxiv.org/html/2606.24989#bib.bib29)\], and the full\-resolution spatial modes and temporal coefficients are then recovered by projecting back onto the original spatial domain\. Hetherington & Le Clainche\[[19](https://arxiv.org/html/2606.24989#bib.bib27)\]introduced and validated the method on a range of test cases spanning laminar and turbulent flows in two and three dimensions, reporting speed\-up factors of up to 630 times compared to classical SVD and memory reductions of approximately 37%, with reconstruction errors comfortably below 5% for laminar configurations\. A different route is taken by Nav et al\.\[[33](https://arxiv.org/html/2606.24989#bib.bib46)\], who reconstructed sparse\-sensor wind fields through a hierarchical, learning\-based pipeline\. A single sparse\-to\-fine inversion is replaced by a sequence of coarse\-to\-fine resolution upgrades\. Each learned with an LSTM in a POD coefficient space, with sensors placed by QR pivoting\. The two strategies differ mainly in how the unobserved information is restored\. lcSVD obtains the full field in one projection step from the sampled modes\. In contrast, the hierarchical method restores it progressively through trained surrogates that must be calibrated offline for each resolution transition and scenario\. lcSVD relies on a single modal basis\. In contrast, the hierarchical approach maintains a separate POD basis and a learned mapping at every level\. Recent diffusion\-based methods have also explored sparse reconstruction from a different perspective\. Diff\-SPORT\[[44](https://arxiv.org/html/2606.24989#bib.bib47)\]combines a conditional generative diffusion model for turbulent\-flow reconstruction with an explainable\-deep\-learning strategy for optimal sensor placement, where the sensor locations are identified from the features the network itself deems most informative\. This pairing of generative reconstruction with interpretable placement further highlights the growing interest in sparse sensing for urban\-flow applications\.

Pillai et al\.\[[38](https://arxiv.org/html/2606.24989#bib.bib30)\]applied lcSVD to reactive combustion databases, demonstrating its capacity to reconstruct POD modes from sparse sensors and merge heterogeneous numerical and experimental datasets\. For the laminar coflow flame case, reconstruction accuracy remained within 1% for the dominant species, while the method ran more than ten times faster than standard SVD and reduced the data volume by a factor exceeding 2000\. The results presented confirmed that the low\-cost framework holds up even under strong multivariate coupling and experimental noise\. The lcSVD framework was subsequently extended to data assimilation\. Jeanney et al\.\[[23](https://arxiv.org/html/2606.24989#bib.bib31)\]applied low\-cost Singular Value Decomposition \(lcSVD\) within a data assimilation framework for fluid dynamics\. Their study showed that reduced\-resolution computations, combined with lcSVD\-based reconstruction, can substantially reduce computational time and memory usage compared to the traditional methods\. Just like SVD, lcSVD is effective at reducing computational cost, but it does not preserve the structure of the data\. A tensor\-based low\-cost approach would address this issue by retaining the structural advantages of HOSVD and operating at a reduced cost enabled by sparse sampling\.

This paper introduces the low\-cost High\-Order SVD \(lcHOSVD\) framework to address this gap\. Building on the theoretical foundations of HOSVD and the practical advances demonstrated by lcSVD, the proposed method performs the full decomposition directly from sparse sensor observations, recovering the complete mode bases and core tensor through projection onto the original domain\. In this way, the computational cost of conventional HOSVD is substantially reduced while its representation is preserved, with independent mode bases along each spatial direction and physical variable\. To the authors’ knowledge, this is the first methodology combining sparse sensing and HOSVD for field reconstruction, and the first application of low\-cost tensor decomposition techniques to urban flow and pollutant transport datasets\. Depending on the test case and on the decay of the singular values, the method can be applied to each variable individually or to the tensor as a whole\. A systematic comparison between lcSVD and lcHOSVD is carried out to assess the trade\-offs between the matrix\- and tensor\-based formulations, identifying when the tensor structure provides a meaningful advantage in accuracy and under anisotropic sensor distributions, and extending sparse\-sensing reduced\-order modelling to tensor representations\. The framework is validated on two urban flow configurations of increasing geometric complexity: a single\-snapshot, multi\-variable dataset of the Vallecas urban district in Madrid, and a time\-resolved velocity dataset of the turbulent flow around two wall\-mounted buildings\.

The remainder of this paper is organized as follows\. Section[2](https://arxiv.org/html/2606.24989#S2)details the lcHOSVD formulation and its relationship to HOSVD and lcSVD\. Section[3](https://arxiv.org/html/2606.24989#S3)describes the datasets used for validation\. Section[4](https://arxiv.org/html/2606.24989#S4)presents reconstruction accuracy, computational performance, and a comparison between methods\. The main conclusions are drawn in Section[5](https://arxiv.org/html/2606.24989#S5)\.

## 2\. Methodology

This section describes the complete methodological pipeline adopted in the present work\. The data are first organised into a structured tensor representation that preserves the multiway nature of the flow fields\. Two low\-cost decomposition strategies are then applied, the matrix\-based lcSVD and the tensor\-based lcHOSVD introduced here\. Before decomposition, all variables are normalised \(Eq\. \([7](https://arxiv.org/html/2606.24989#S2.E7)\)\) to ensure comparability across physical quantities of different magnitudes\. The reconstruction quality and flow structure are assessed through the relative root mean square error \(RRMSE\) and the Q\-criterion\. A schematic overview of the full pipeline is shown in Fig\.[1](https://arxiv.org/html/2606.24989#S2.F1)\.

![Refer to caption](https://arxiv.org/html/2606.24989v1/method.png)Figure 1:Schematic overview of the methodology: from raw CFD data through tensor construction, normalisation, low\-cost decomposition \(lcSVD and lcHOSVD\), and reconstruction\.### 2\.1Data organisation

The datasets considered in this work consist of time\-resolved flow fields represented on structured spatial grids\. Following the classical snapshot approach, the data can be arranged as a collection of flow states,

𝑿=𝑽1K=\[𝑽1,𝑽2,…,𝑽k,𝑽k\+1,…,𝑽K\],\\boldsymbol\{X\}=\\boldsymbol\{V\}\_\{1\}^\{K\}=\[\\boldsymbol\{V\}\_\{1\},\\boldsymbol\{V\}\_\{2\},\\ldots,\\boldsymbol\{V\}\_\{k\},\\boldsymbol\{V\}\_\{k\+1\},\\ldots,\\boldsymbol\{V\}\_\{K\}\],\(1\)
where𝑽k\\boldsymbol\{V\}\_\{k\}denotes the complete flow field at time instanttkt\_\{k\}, andKKis the total number of snapshots\. In the conventional matrix formulation, each snapshot is reshaped into a column vector, producing a data matrix of dimensionsJ×KJ\\times K, whereJJdenotes the total number of spatial degrees of freedom\. For three\-dimensional flow fields containing multiple physical variables,

J=Tvar×Nx×Ny×Nz,J=T\_\{\\mathrm\{var\}\}\\,\\times N\_\{x\}\\times N\_\{y\}\\times N\_\{z\},\(2\)
whereTvarT\_\{\\mathrm\{var\}\}is the number of variables andNxN\_\{x\},NyN\_\{y\}, andNzN\_\{z\}are the numbers of grid points in the streamwise, wall\-normal, and spanwise directions, respectively\.

For three\-dimensional problems, the data are represented as a fifth\-order tensor\[[18](https://arxiv.org/html/2606.24989#bib.bib32)\]:

𝒱∈ℝTvar×Nx×Ny×Nz×K,\\mathcal\{V\}\\in\\mathbb\{R\}^\{T\_\{\\mathrm\{var\}\}\\times N\_\{x\}\\times N\_\{y\}\\times N\_\{z\}\\times K\},\(3\)
where the first index corresponds to the physical variables, the next three correspond to the spatial directions, and the final index contains the temporal snapshots\.

Each tensor entry is defined as:

𝒱i,j2,j3,j4,k=ϕi\(xj2,yj3,zj4,tk\),\\mathcal\{V\}\_\{i,j\_\{2\},j\_\{3\},j\_\{4\},k\}=\\phi\_\{i\}\\left\(x\_\{j\_\{2\}\},y\_\{j\_\{3\}\},z\_\{j\_\{4\}\},t\_\{k\}\\right\),\(4\)
wherei∈1,…,Tvari\\in\{1,\\ldots,T\_\{\\mathrm\{var\}\}\}indexes the physical variableϕi\\phi\_\{i\},j2j\_\{2\},j3j\_\{3\}, andj4j\_\{4\}denote the spatial locations in thexx,yy, andzzdirections, respectively, andkkdenotes the temporal snapshot\.

For flow datasets with three velocity components along streamwise \(uu\), normal \(vv\), and spanwise directions \(ww\), the tensor can be expressed as:

ϕ1=u,ϕ2=v,ϕ3=w,\\phi\_\{1\}=u,\\hskip 18\.49988pt\\phi\_\{2\}=v,\\hskip 18\.49988pt\\phi\_\{3\}=w,\(5\)
yielding a tensor of dimensions

𝒱∈ℝ3×Nx×Ny×Nz×K\.\\mathcal\{V\}\\in\\mathbb\{R\}^\{3\\times N\_\{x\}\\times N\_\{y\}\\times N\_\{z\}\\times K\}\.\(6\)

### 2\.2Normalisation

Urban flow databases typically contain variables with substantially different physical scales\. For example, velocity components expressed in m/s and pollutant concentrations expressed in kg/m3may differ by several orders of magnitude\. Without normalisation, variables with larger absolute values dominate the decomposition, and physically less energetic but equally important fields are effectively suppressed\. To ensure all variables contribute comparably to the modal analysis, each variableϕi\\phi\_\{i\}is normalised independently using min\-max scaling\[[37](https://arxiv.org/html/2606.24989#bib.bib33)\], given by:

ϕ~i=ϕi−min⁡\(ϕi\)max⁡\(ϕi\)−min⁡\(ϕi\),\\tilde\{\\phi\}\_\{i\}=\\frac\{\\phi\_\{i\}\-\\min\(\\phi\_\{i\}\)\}\{\\max\(\\phi\_\{i\}\)\-\\min\(\\phi\_\{i\}\)\},\(7\)
whereϕi\\phi\_\{i\}is the original field of theii\-th variable,min⁡\(ϕi\)\\min\(\\phi\_\{i\}\)andmax⁡\(ϕi\)\\max\(\\phi\_\{i\}\)are its global minimum and maximum over all spatial points and time instants, andϕ~i∈\[0,1\]\\tilde\{\\phi\}\_\{i\}\\in\[0,1\]is the resulting normalised field\. The inverse transformation is applied after reconstruction to recover the physical values\.

### 2\.3Low\-cost decomposition techniques

This section presents the two low\-cost decomposition strategies compared in this work, the matrix\-based lcSVD, introduced by Hetherington & Le Clainche\[[19](https://arxiv.org/html/2606.24989#bib.bib27)\], and the tensor\-based lcHOSVD proposed here\. Figure[2](https://arxiv.org/html/2606.24989#S2.F2)illustrates the low\-cost pipeline\.

![Refer to caption](https://arxiv.org/html/2606.24989v1/meth1.png)Figure 2:Schematic overview of the lcHOSVD and lcSVD methodology pipelines: from raw CFD data through tensor construction, normalisation, sparse sensor selection, low\-cost decomposition, and field reconstruction\.#### 2\.3\.1Low\-cost singular value decomposition \(lcSVD\)

In lcSVD, the key idea is to perform the decomposition on a spatially reduced version of the data and subsequently recover the full\-resolution spatial modes through projection\. Starting from the classical Singular Value Decomposition \(SVD\),

𝐕1K=𝐔𝚺𝐓⊤,\\mathbf\{V\}\_\{1\}^\{K\}=\\mathbf\{U\}\\,\\mathbf\{\\Sigma\}\\,\\mathbf\{T\}^\{\\top\},\(8\)
where𝐔\\mathbf\{U\}contains the spatial modes,𝚺\\mathbf\{\\Sigma\}contains the singular values, and𝐓⊤\\mathbf\{T\}^\{\\top\}contains the temporal modes\. The lcSVD methodology proposed by Hetherington & Le Clainche\[[19](https://arxiv.org/html/2606.24989#bib.bib27)\]builds on classical SVD \(Eq\. \([8](https://arxiv.org/html/2606.24989#S2.E8)\)\) by applying the decomposition to a spatially sampled representation of the dataset\. The procedure consists of the following steps:

1\. Construction of the reduced snapshot matrix:A subset ofJ¯\\bar\{J\}spatial locations is selected from the full domain, either equidistantly, randomly, or via optimal sensor placement\. The reduced snapshot matrix is then:

𝐕¯1K∈ℝJ¯×K,\\bar\{\\mathbf\{V\}\}\_\{1\}^\{K\}\\in\\mathbb\{R\}^\{\\bar\{J\}\\times K\},\(9\)
whereJ¯≪J\\bar\{J\}\\ll Jand each column contains the flow field sampled at the selected sensor locations at timetmt\_\{m\}\.

2\. SVD of the reduced matrix:The singular value decomposition is applied to the reduced matrix \(Eq\.\([9](https://arxiv.org/html/2606.24989#S2.E9)\)\):

𝐕¯1K≈𝐔¯𝚺¯𝐓¯⊤,\\bar\{\\mathbf\{V\}\}\_\{1\}^\{K\}\\approx\\bar\{\\mathbf\{U\}\}\\bar\{\\mathbf\{\\Sigma\}\}\\bar\{\\mathbf\{T\}\}^\{\\top\},\(10\)
where𝐔¯∈ℝJ¯×N¯\\bar\{\\mathbf\{U\}\}\\in\\mathbb\{R\}^\{\\bar\{J\}\\times\\bar\{N\}\}contains the reduced spatial modes in its columns,𝚺¯∈ℝN¯×N¯\\bar\{\\mathbf\{\\Sigma\}\}\\in\\mathbb\{R\}^\{\\bar\{N\}\\times\\bar\{N\}\}is a diagonal matrix of singular valuesσ1≥σ2≥⋯≥σN¯\>0\\sigma\_\{1\}\\geq\\sigma\_\{2\}\\geq\\cdots\\geq\\sigma\_\{\\bar\{N\}\}\>0, and𝐓¯∈ℝK×N¯\\bar\{\\mathbf\{T\}\}\\in\\mathbb\{R\}^\{K\\times\\bar\{N\}\}contains the temporal coefficients\. The number of retained modesN¯\\bar\{N\}is determined by the tolerance criterion:

σN¯\+1σ1≤εSVD,\\frac\{\\sigma\_\{\\bar\{N\}\+1\}\}\{\\sigma\_\{1\}\}\\leq\\varepsilon\_\{\\text\{SVD\}\},\(11\)
whereεSVD\\varepsilon\_\{\\text\{SVD\}\}is a user\-defined threshold\.

3\. Recovery of full\-resolution modes and temporal coefficients:The spatial modes and temporal coefficients of the full dataset are recovered from two semi\-reduced matrices:𝐕J,K¯∈ℝJ×K¯\\mathbf\{V\}^\{J,\\bar\{K\}\}\\in\\mathbb\{R\}^\{J\\times\\bar\{K\}\}, containing the full spatial resolution at the retained snapshot columns, and𝐕J¯,K∈ℝJ¯×K\\mathbf\{V\}^\{\\bar\{J\},K\}\\in\\mathbb\{R\}^\{\\bar\{J\}\\times K\}, containing the sensor rows at all snapshots\. Equations \([12](https://arxiv.org/html/2606.24989#S2.E12)–[13](https://arxiv.org/html/2606.24989#S2.E13)\) define these two recovery steps:

𝐔rec=𝐕J,K¯𝐓¯𝚺¯−1,\\mathbf\{U\}^\{\\mathrm\{rec\}\}=\\mathbf\{V\}^\{J,\\bar\{K\}\}\\,\\bar\{\\mathbf\{T\}\}\\,\\bar\{\\boldsymbol\{\\Sigma\}\}^\{\-1\},\(12\)𝐓rec=\(𝐕J¯,K\)⊤𝐔¯𝚺¯−1,\\mathbf\{T\}^\{\\mathrm\{rec\}\}=\\left\(\\mathbf\{V\}^\{\\bar\{J\},K\}\\right\)^\{\\\!\\top\}\\bar\{\\mathbf\{U\}\}\\,\\bar\{\\boldsymbol\{\\Sigma\}\}^\{\-1\},\(13\)where𝐔rec∈ℝJ×N¯\\mathbf\{U\}^\{\\mathrm\{rec\}\}\\in\\mathbb\{R\}^\{J\\times\\bar\{N\}\}and𝐓rec∈ℝK×N¯\\mathbf\{T\}^\{\\mathrm\{rec\}\}\\in\\mathbb\{R\}^\{K\\times\\bar\{N\}\}\.

4\. Reconstruction of the full dataset:The full snapshot matrix is reconstructed as:

𝐕1K,rec=𝐔rec𝚺¯\(𝐓rec\)⊤,\\mathbf\{V\}\_\{1\}^\{K,\\text\{rec\}\}=\\mathbf\{U\}^\{\\text\{rec\}\}\\bar\{\\mathbf\{\\Sigma\}\}\\left\(\\mathbf\{T\}^\{\\text\{rec\}\}\\right\)^\{\\top\},\(14\)
which is then reshaped back into the tensor form \(Eq\.\([3](https://arxiv.org/html/2606.24989#S2.E3)\)\) and denormalised to recover physical values\.

### 2\.4Low\-cost higher\-order singular value decomposition \(lcHOSVD\)

While lcSVD operates on the flattened snapshot matrix, the lcHOSVD framework preserves the full structure of the data tensor\. The method is based on the standard HOSVD, which decomposes the fifth\-order tensor into a core tensor multiplied by one orthonormal factor matrix per mode:

𝒱ij2j3j4k≃∑p1=1P1∑p2=1P2∑p3=1P3∑p4=1P4∑n=1N𝒮p1p2p3p4nUip1\(var\)Uj2p2\(1\)Uj3p3\(2\)Uj4p4\(3\)Tkn,\\mathcal\{V\}\_\{ij\_\{2\}j\_\{3\}j\_\{4\}k\}\\simeq\\sum\_\{p\_\{1\}=1\}^\{P\_\{1\}\}\\sum\_\{p\_\{2\}=1\}^\{P\_\{2\}\}\\sum\_\{p\_\{3\}=1\}^\{P\_\{3\}\}\\sum\_\{p\_\{4\}=1\}^\{P\_\{4\}\}\\sum\_\{n=1\}^\{N\}\\mathcal\{S\}\_\{p\_\{1\}p\_\{2\}p\_\{3\}p\_\{4\}n\}\\,U^\{\(\\mathrm\{var\}\)\}\_\{ip\_\{1\}\}\\,U^\{\(1\)\}\_\{j\_\{2\}p\_\{2\}\}\\,U^\{\(2\)\}\_\{j\_\{3\}p\_\{3\}\}\\,U^\{\(3\)\}\_\{j\_\{4\}p\_\{4\}\}\\,T\_\{kn\},\(15\)
where𝒮\\mathcal\{S\}is the core tensor,U\(1\)U^\{\(1\)\},U\(2\)U^\{\(2\)\}, andU\(3\)U^\{\(3\)\}are the spatial factor matrices, andTTcontains the temporal modes\.

In the low\-cost variant introduced here, the decomposition is computed from sparse sensor measurements along each axis, and the full\-resolution factor matrices are then recovered through projection, in direct analogy with the lcSVD procedure\.

1\. Sensor selection along each axis:A subset of sensor locations is selected equidistantly and independently along each spatial axis\. LetN¯x\\bar\{N\}\_\{x\},N¯y\\bar\{N\}\_\{y\},N¯z\\bar\{N\}\_\{z\}denote the number of retained points along thexx,yy, andzzdirections respectively, withN¯x≪Nx\\bar\{N\}\_\{x\}\\ll N\_\{x\},N¯y≪Ny\\bar\{N\}\_\{y\}\\ll N\_\{y\},N¯z≪Nz\\bar\{N\}\_\{z\}\\ll N\_\{z\}, and letIxI\_\{x\},IyI\_\{y\},IzI\_\{z\}be the corresponding sensor index sets,

Iα⊂\{1,…,Nα\},\|Iα\|=N¯α,α∈\{x,y,z\}\.I\_\{\\alpha\}\\subset\\\{1,\\dots,N\_\{\\alpha\}\\\},\\hskip 18\.49988pt\|I\_\{\\alpha\}\|=\\bar\{N\}\_\{\\alpha\},\\hskip 18\.49988pt\\alpha\\in\\\{x,y,z\\\}\.\(16\)These index sets define the rows retained in each mode unfolding in the following step\.

2\. Mode\-nnunfolding and SVD:For each spatial modeα∈\{x,y,z\}\\alpha\\in\\\{x,y,z\\\}, the mode\-α\\alphaunfolding of the data is restricted to the sensor indicesIαI\_\{\\alpha\}along that direction, giving𝐕¯\(α\)∈ℝN¯α×Mα\\bar\{\\mathbf\{V\}\}\_\{\(\\alpha\)\}\\in\\mathbb\{R\}^\{\\bar\{N\}\_\{\\alpha\}\\times M\_\{\\alpha\}\}, whereMαM\_\{\\alpha\}is the product of all remaining dimensions at full resolution\. Each column of𝐕¯\(α\)\\bar\{\\mathbf\{V\}\}\_\{\(\\alpha\)\}is a mode\-α\\alphafibre evaluated at the sensor indices\. Forα=x\\alpha=x, it is obtained by fixing the indices alongyy,zz, the variable mode, and the temporal mode, and retaining the values at theN¯x\\bar\{N\}\_\{x\}sensor positions alongxx\. SVD is then applied independently to each unfolding:

𝐕¯\(α\)≈𝐔¯α𝚺¯α𝐓¯α⊤,\\bar\{\\mathbf\{V\}\}\_\{\(\\alpha\)\}\\approx\\bar\{\\mathbf\{U\}\}\_\{\\alpha\}\\bar\{\\mathbf\{\\Sigma\}\}\_\{\\alpha\}\\bar\{\\mathbf\{T\}\}\_\{\\alpha\}^\{\\top\},\(17\)where𝐔¯α∈ℝN¯α×rα\\bar\{\\mathbf\{U\}\}\_\{\\alpha\}\\in\\mathbb\{R\}^\{\\bar\{N\}\_\{\\alpha\}\\times r\_\{\\alpha\}\}contains the left singular vectors \(reduced factor matrix for modeα\\alpha\),𝚺¯α∈ℝrα×rα\\bar\{\\mathbf\{\\Sigma\}\}\_\{\\alpha\}\\in\\mathbb\{R\}^\{r\_\{\\alpha\}\\times r\_\{\\alpha\}\}is the diagonal matrix of singular values, and𝐓¯α∈ℝMα×rα\\bar\{\\mathbf\{T\}\}\_\{\\alpha\}\\in\\mathbb\{R\}^\{M\_\{\\alpha\}\\times r\_\{\\alpha\}\}contains the right singular vectors\. Here𝐓¯α\\bar\{\\mathbf\{T\}\}\_\{\\alpha\}spans the remaining dimensionsMαM\_\{\\alpha\}at full resolution and is shared between the sensor\-restricted and full\-resolution mode\-α\\alphaunfoldings, which is the basis exploited in the lift of Eq\. \([18](https://arxiv.org/html/2606.24989#S2.E18)\)\. The rankrαr\_\{\\alpha\}along each mode is selected independently using the tolerance criterion \(Eq\. \([11](https://arxiv.org/html/2606.24989#S2.E11)\)\), or prescribed directly by the user based on singular value analysis\.

3\. Recovery of full\-resolution factor matrices:The full\-resolution factor matrix for each spatial modeα\\alphais recovered by projecting the full\-resolution mode\-α\\alphaunfolding𝐕\(α\)∈ℝNα×Mα\\mathbf\{V\}\_\{\(\\alpha\)\}\\in\\mathbb\{R\}^\{N\_\{\\alpha\}\\times M\_\{\\alpha\}\}of the original tensor onto the reduced right singular basis𝐓¯α\\bar\{\\mathbf\{T\}\}\_\{\\alpha\}obtained from the reduced unfolding \(Eq\. \([17](https://arxiv.org/html/2606.24989#S2.E17)\)\):

𝐔αrec=𝐕\(α\)𝐓¯α\(𝚺¯α\)−1,\\mathbf\{U\}\_\{\\alpha\}^\{\\text\{rec\}\}=\\mathbf\{V\}\_\{\(\\alpha\)\}\\bar\{\\mathbf\{T\}\}\_\{\\alpha\}\\left\(\\bar\{\\mathbf\{\\Sigma\}\}\_\{\\alpha\}\\right\)^\{\-1\},\(18\)where𝐔αrec∈ℝNα×rα\\mathbf\{U\}\_\{\\alpha\}^\{\\text\{rec\}\}\\in\\mathbb\{R\}^\{N\_\{\\alpha\}\\times r\_\{\\alpha\}\}is the reconstructed full\-resolution factor matrix for modeα\\alpha\. This step lifts the reduced factor matrix, computed from the sparse sensor measurements, back to the full spatial resolutionNαN\_\{\\alpha\}by exploiting the right singular vectors as a shared basis between the reduced and full unfoldings\.

4\. Core tensor computation:The core tensor𝒢∈ℝnvar×rx×ry×rz×rt\\mathcal\{G\}\\in\\mathbb\{R\}^\{n\_\{\\text\{var\}\}\\times r\_\{x\}\\times r\_\{y\}\\times r\_\{z\}\\times r\_\{t\}\}is obtained by projecting the original tensor𝒱\\mathcal\{V\}onto all five reconstructed factor matrices:

𝒢p1p2p3p4p5=∑i,j,k,l,m𝒱ijklm\(Uvarrec\)ip1\(Uxrec\)jp2\(Uyrec\)kp3\(Uzrec\)lp4\(Utrec\)mp5,\\mathcal\{G\}\_\{p\_\{1\}p\_\{2\}p\_\{3\}p\_\{4\}p\_\{5\}\}=\\sum\_\{i,j,k,l,m\}\\mathcal\{V\}\_\{ijklm\}\\,\(U\_\{\\text\{var\}\}^\{\\text\{rec\}\}\)\_\{ip\_\{1\}\}\\,\(U\_\{x\}^\{\\text\{rec\}\}\)\_\{jp\_\{2\}\}\\,\(U\_\{y\}^\{\\text\{rec\}\}\)\_\{kp\_\{3\}\}\\,\(U\_\{z\}^\{\\text\{rec\}\}\)\_\{lp\_\{4\}\}\\,\(U\_\{t\}^\{\\text\{rec\}\}\)\_\{mp\_\{5\}\},\(19\)whereii,jj,kk,ll,mmare the indices along the variable,xx,yy,zz, and temporal modes of𝒱\\mathcal\{V\}, andp1,…,p5p\_\{1\},\\ldots,p\_\{5\}are the corresponding compressed indices of𝒢\\mathcal\{G\}\. The factor matrices along the variable and temporal modes,𝐔varrec\\mathbf\{U\}\_\{\\mathrm\{var\}\}^\{\\mathrm\{rec\}\}and𝐔trec\\mathbf\{U\}\_\{t\}^\{\\mathrm\{rec\}\}, are computed directly from the standard SVD of the corresponding full unfoldings\.

5\. Reconstruction of the full tensor:The full\-resolution tensor is recovered by expanding the core tensor back through all factor matrices:

𝒱ijklmrec=∑p1,p2,p3,p4,p5𝒢p1p2p3p4p5\(Uvarrec\)ip1\(Uxrec\)jp2\(Uyrec\)kp3\(Uzrec\)lp4\(Utrec\)mp5,\\mathcal\{V\}^\{\\text\{rec\}\}\_\{ijklm\}=\\sum\_\{p\_\{1\},p\_\{2\},p\_\{3\},p\_\{4\},p\_\{5\}\}\\mathcal\{G\}\_\{p\_\{1\}p\_\{2\}p\_\{3\}p\_\{4\}p\_\{5\}\}\\,\(U\_\{\\text\{var\}\}^\{\\text\{rec\}\}\)\_\{ip\_\{1\}\}\\,\(U\_\{x\}^\{\\text\{rec\}\}\)\_\{jp\_\{2\}\}\\,\(U\_\{y\}^\{\\text\{rec\}\}\)\_\{kp\_\{3\}\}\\,\(U\_\{z\}^\{\\text\{rec\}\}\)\_\{lp\_\{4\}\}\\,\(U\_\{t\}^\{\\text\{rec\}\}\)\_\{mp\_\{5\}\},\(20\)where the indicesii,jj,kk,ll,mmnow run over the full spatial and temporal dimensionsnvarn\_\{\\text\{var\}\},NxN\_\{x\},NyN\_\{y\},NzN\_\{z\}, andKKrespectively, recovering the complete flow field at full resolution\.

![Refer to caption](https://arxiv.org/html/2606.24989v1/method1.png)Figure 3:Schematic overview of the steps in the lcHOSVD methodology\.Step\-by\-step illustration of the lcHOSVD methodology is presented in Fig\.[3](https://arxiv.org/html/2606.24989#S2.F3)\. Sensors are selected independently along each spatial axis \(red planes\)\. Each mode\-α\\alphaunfolding is restricted to the sensor rows and decomposed, and the full\-resolution factor matrices are recovered by lifting through the right singular basis\. The core tensor is obtained by projection onto all factor matrices, and the full field is reconstructed by expanding the core through the factor bases\.

This formulation also admits a direct extension to data assimilation, since the combination of sensor information and CFD data is already built into Eqs\. \([18](https://arxiv.org/html/2606.24989#S2.E18)\) and \([19](https://arxiv.org/html/2606.24989#S2.E19)\)\. The sensor measurements enter the method through the restricted unfoldings \(Eq\. \([17](https://arxiv.org/html/2606.24989#S2.E17)\)\), which provide the reduced bases𝐓¯α\\bar\{\\mathbf\{T\}\}\_\{\\alpha\}and𝚺¯α\\bar\{\\boldsymbol\{\\Sigma\}\}\_\{\\alpha\}, while the CFD data is processed through the lift \(Eq\. \([18](https://arxiv.org/html/2606.24989#S2.E18)\)\) and the core projection \(Eq\. \([19](https://arxiv.org/html/2606.24989#S2.E19)\)\), where the full\-resolution fields are combined with the sensor\-derived bases\. In the present work, the sensor values are extracted from the CFD solution itself, but the same equations apply when the restricted unfoldings are assembled from real observations in place of, or merged with, the CFD values at the sensor locations, following a hybrid\-matrix strategy\. The lift then propagates the assimilated information to the full spatial resolution through the factor matrix of each direction\. Since each spatial mode holds its own restricted unfolding, this extension preserves the tensor structure throughout the assimilation cycle and allows direction\-dependent weighting of the observational and model contributions, a possibility that the flattened formulation does not offer\.

### 2\.5Performance assessment

Three metrics are used to assess reconstruction quality and flow structure: the relative root mean square error \(RRMSE\) for quantitative accuracy, the Q\-criterion for physical assessment of the reconstructed structures, and the compression ratio \(CR\) for savings\.

#### 2\.5\.1Relative root mean square error

The RRMSE measures the pointwise agreement between the reconstructed field and the reference CFD solution\. For a given variable, it is defined as:

RRMSE=1N∑i=1N\(𝐕i−𝐕irec\)21N∑i=1N𝐕i2,\\mathrm\{RRMSE\}=\\frac\{\\sqrt\{\\dfrac\{1\}\{N\}\\displaystyle\\sum\_\{i=1\}^\{N\}\\left\(\\mathbf\{V\}\_\{i\}\-\\mathbf\{V\}^\{\\mathrm\{rec\}\}\_\{i\}\\right\)^\{2\}\}\}\{\\sqrt\{\\dfrac\{1\}\{N\}\\displaystyle\\sum\_\{i=1\}^\{N\}\\mathbf\{V\}\_\{i\}^\{2\}\}\},\(21\)where𝐕i\\mathbf\{V\}\_\{i\}represents the reference values,𝐕irec\\mathbf\{V\}\_\{i\}^\{rec\}the reconstructed values obtained from the low\-cost method, and𝑵\\boldsymbol\{N\}the total number of samples\. The RRMSE is evaluated exclusively over fluid \(non\-building\) cells, since building cells carry no physical flow information and their inclusion would artificially affect the error metric\.

#### 2\.5\.2Q\-criterion

The Q\-criterion\[[21](https://arxiv.org/html/2606.24989#bib.bib34)\]is a widely used approach for identifying vortical structures in reconstructed three\-dimensional velocity fields\. Although global error metrics provide an overall measure of reconstruction accuracy, they do not necessarily reflect the ability of the reduced\-order models to preserve physically relevant flow structures\. Therefore, a qualitative comparison based on the Q\-criterion is performed to assess the reconstruction\. It is defined as the second invariant of the velocity gradient tensor∇𝐮\\nabla\\mathbf\{u\}:

Q=12\(‖𝛀‖F2−‖𝐒‖F2\),Q=\\frac\{1\}\{2\}\\left\(\\\|\\mathbf\{\\Omega\}\\\|\_\{F\}^\{2\}\-\\\|\\mathbf\{S\}\\\|\_\{F\}^\{2\}\\right\),\(22\)
where𝛀\\mathbf\{\\Omega\}is the antisymmetric rotation rate tensor,𝐒\\mathbf\{S\}is the symmetric strain rate tensor, and∥⋅∥F\\\|\\cdot\\\|\_\{F\}denotes the Frobenius norm\. Regions whereQ\>0Q\>0indicate that rotation dominates over strain\.

#### 2\.5\.3Compression factor

The compression factor \(CF\) is defined as the fraction of spatial locations retained relative to the full spatial domain:

CF=NxNyNzNsensors,CF=\\frac\{N\_\{x\}N\_\{y\}N\_\{z\}\}\{N\_\{\\mathrm\{sensors\}\}\},\(23\)
whereNsensorsN\_\{\\mathrm\{sensors\}\}denotes the number of retained sensor locations andNxNyNzN\_\{x\}N\_\{y\}N\_\{z\}represents the total number of spatial grid points in the full domain\. The corresponding compression percentage is defined as:

𝒞\(%\)=\(1−NsensorsNxNyNz\)×100%,\\mathcal\{C\(\\%\)\}=\\left\(1\-\\frac\{N\_\{\\mathrm\{sensors\}\}\}\{N\_\{x\}N\_\{y\}N\_\{z\}\}\\right\)\\times 100\\%,\(24\)
where𝒞\\mathcal\{C\}represents the percentage reduction in spatial degrees of freedom\.

## 3\. Datasets

Two urban flow datasets of different flow natures and geometric complexities are considered in this study\. A large\-scale three\-dimensional urban CFD simulation of the Vallecas district in Madrid and a turbulent two\-building case computed with high\-fidelity LES were used to evaluate the low\-cost framework\.

### 3\.1Three\-dimensional urban CFD dataset \(Vallecas, Madrid\)

The first dataset corresponds to a large\-scale urban CFD case study in the Vallecas district of southeastern Madrid\[[22](https://arxiv.org/html/2606.24989#bib.bib48)\], selected because of its dense residential and educational land use and its proximity to the A\-3 and M\-40 motorway corridors, two of the busiest roads in the Madrid metropolitan area\. It is further motivated by the ongoing municipal redevelopment plan that includes the construction of approximately 1,400 housing units and a student residence in the area\[[11](https://arxiv.org/html/2606.24989#bib.bib36)\]\.

The three\-dimensional urban geometry was reconstructed at Level of Detail 2\.2 using the open\-source tool city4CFD\[[34](https://arxiv.org/html/2606.24989#bib.bib37)\], which fits the true roof shape of each building from the LiDAR point cloud obtained from the Madrid Geoportal\[[2](https://arxiv.org/html/2606.24989#bib.bib38)\]\. Terrain, buildings, vegetation, and water bodies are represented as distinct surface layers\. The computational domain follows the best\-practice guidelines of Blocken et al\.\[[4](https://arxiv.org/html/2606.24989#bib.bib7)\], with a horizontal extent of approximately 15 times the maximum building height and a vertical extent of 6 times the maximum building height\. This yields a domain diameter of approximately 3410 m and a vertical extent ranging between 317 m and 358 m, due to variations in terrain elevation across the domain\. The resulting mesh comprises approximately 100 million cells\.

Steady\-state incompressible RANS simulations \(Eq\.[25](https://arxiv.org/html/2606.24989#S3.E25)to[28](https://arxiv.org/html/2606.24989#S3.E28)\) were carried out with OpenFOAM\[[45](https://arxiv.org/html/2606.24989#bib.bib39)\]using the SimpleFoam solver\[[36](https://arxiv.org/html/2606.24989#bib.bib40)\], which solves the continuity and momentum equations:

∇⋅𝐔=0,\\nabla\\cdot\\mathbf\{U\}=0,\(25\)∇⋅\(𝐔𝐔\)=−∇p\+∇⋅\[\(ν\+νt\)\(∇𝐔\+∇𝐔T\)\]\.\\nabla\\cdot\\left\(\\mathbf\{U\}\\mathbf\{U\}\\right\)=\-\\nabla p\+\\nabla\\cdot\\bigl\[\(\\nu\+\\nu\_\{t\}\)\(\\nabla\\mathbf\{U\}\+\\nabla\\mathbf\{U\}^\{T\}\)\\bigr\]\.\(26\)
where𝐔\\mathbf\{U\}is the Reynolds\-averaged velocity vector,ppis the kinematic pressure,ν\\nuis the kinematic viscosity, andνt\\nu\_\{t\}is the turbulent eddy viscosity\.

Turbulence closure is provided by the standardkk–ε\\varepsilonmodel\[[25](https://arxiv.org/html/2606.24989#bib.bib41)\], with constants adjusted for a neutral atmospheric boundary layer \(ABL\)\[[17](https://arxiv.org/html/2606.24989#bib.bib43)\]and custom wall functions\[[35](https://arxiv.org/html/2606.24989#bib.bib42)\]\. Inlet profiles follow the logarithmic ABL law\[[35](https://arxiv.org/html/2606.24989#bib.bib42)\]:

U\(z\)=u∗κln⁡z\+z0z0,k=u∗2Cμ,ε=u∗3κ\(z\+z0\),U\(z\)=\\frac\{u\_\{\*\}\}\{\\kappa\}\\ln\\\!\\frac\{z\+z\_\{0\}\}\{z\_\{0\}\},\\qquad k=\\frac\{u\_\{\*\}^\{2\}\}\{\\sqrt\{C\_\{\\mu\}\}\},\\qquad\\varepsilon=\\frac\{u\_\{\*\}^\{3\}\}\{\\kappa\(z\+z\_\{0\}\)\},\(27\)whereu∗u\_\{\*\}is the friction velocity derived from the hourly reference wind speedUrefU\_\{\\mathrm\{ref\}\}measured atzref=5\.5z\_\{\\mathrm\{ref\}\}=5\.5m, andz0=0\.25z\_\{0\}=0\.25m is the aerodynamic roughness length representative of the dense urban fabric\. Distinct values are assigned to vegetated areas \(z0,veg=0\.10z\_\{0,\\mathrm\{veg\}\}=0\.10m\) and water surfaces \(z0,water=0\.002z\_\{0,\\mathrm\{water\}\}=0\.002m\)\. A zero\-gradient condition is imposed at the outlet and a symmetry condition at the top boundary\. The selected domain height is sufficient to minimize the influence of the upper boundary condition on the flow field\.

Concentrations of carbon monoxide \(CO\), nitrogen oxides \(NOx\), and particulate matter \(PM\) are modelled by a passive scalar transport equation coupled to the converged velocity field:

∇⋅\(𝐔ϕ\)=∇⋅\[\(νSc\+νtSct\)∇ϕ\]\+Sϕ,\\nabla\\cdot\(\\mathbf\{U\}\\phi\)=\\nabla\\cdot\\\!\\left\[\\left\(\\frac\{\\nu\}\{Sc\}\+\\frac\{\\nu\_\{t\}\}\{Sc\_\{t\}\}\\right\)\\nabla\\phi\\right\]\+S\_\{\\phi\},\(28\)whereScScandSctSc\_\{t\}are the molecular and turbulent Schmidt numbers, andSϕS\_\{\\phi\}is a volumetric source term derived from hourly traffic counts on the A\-3 and M\-40 corridors, disaggregated by vehicle category \(passenger cars, light\-duty vehicles, heavy\-duty vehicles, motorcycles\) from camera\-detection data \(Dirección General de Tráfico, DGT\)\. Emission factors follow the COPERT methodology\[[12](https://arxiv.org/html/2606.24989#bib.bib44)\]and are distributed as line\-source intensities over the road\-surface cells of the mesh\.

The dataset comprises hourly steady\-state simulations for the 24 hours of 1st February 2024\. This was selected because it recorded the highest daily pollutant concentrations of the year at the nearest station of the Madrid air\-quality monitoring network\. Wind conditions on that day are predominantly from the north and northeast, with a mean reference speed of 0\.7 m/s and a maximum of 1\.2 m/s, yielding a Reynolds number of approximately10710^\{7\}\. The methodology has been tested on the first hour \(00:00 to 01:00 hrs\) of data from the hourly steady\-state simulations\.

![Refer to caption](https://arxiv.org/html/2606.24989v1/gt_physical_uvw_pk.png)Figure 4:Ground\-truth velocity fields extracted at AGL \(z=5z=5m\) for the Vallecas urban\-flow dataset\. From left to right \(1st row\): streamwise, wall\-normal, and spanwise velocity components \(uu\(m/s\),vv\(m/s\), andww\(m/s\)\)\. From left to right \(2nd row\): Kinematic pressure and turbulent kinetic energy \(pp\(m2/s2\) andkk\(m2/s2\)\)\. The lines mark where terrain or buildings rise above thez=5z=5m AGL plane\.![Refer to caption](https://arxiv.org/html/2606.24989v1/gt_pollutants_co_nox_pm.png)Figure 5:Ground\-truth pollutant concentration fields extracted at AGL \(z=5z=5m\) for the Vallecas urban\-flow dataset\. From left to right: carbon monoxide, nitrogen oxides, and particulate matter \(CO \(kg/m3\), NOx\(kg/m3\), and PM \(kg/m3\)\)\.For the ROM analysis, the data are transferred to a structured Cartesian grid via a terrain\-following interpolation\. For each horizontal cell\(xi,yj\)\(x\_\{i\},y\_\{j\}\), the terrain elevationzterrain\(xi,yj\)z\_\{\\mathrm\{terrain\}\}\(x\_\{i\},y\_\{j\}\)is extracted using a KD\-tree nearest\-neighbour query\[[43](https://arxiv.org/html/2606.24989#bib.bib45)\]on the unstructured mesh, and the field variables are then sampled atNz=30N\_\{z\}=30height levels above ground level \(AGL\) from 5 m to 70 m\. This produces a five\-dimensional tensor:

𝐕∈ℝnvar×Nx×Ny×Nz×Nt,\\mathbf\{V\}\\in\\mathbb\{R\}^\{n\_\{\\mathrm\{var\}\}\\times N\_\{x\}\\times N\_\{y\}\\times N\_\{z\}\\times N\_\{t\}\},\(29\)wherenvar=8n\_\{\\mathrm\{var\}\}=8corresponds to the variables\[u,v,w,p,CO,NOx,PM,k\]\[u,\\,v,\\,w,\\,p,\\,\\mathrm\{CO\},\\,\\mathrm\{NO\}\_\{x\},\\,\\mathrm\{PM\},\\,k\],\(Nx,Ny\)=\(500,500\)\(N\_\{x\},N\_\{y\}\)=\(500,500\)are the horizontal grid dimensions,Nz=30N\_\{z\}=30the vertical levels, andNtN\_\{t\}the number of simulation hours\. Hereppdenotes the kinematic pressure,kkthe turbulent kinetic energy, andCO\\mathrm\{CO\},NOx\\mathrm\{NO\}\_\{x\},PM\\mathrm\{PM\}the concentrations of carbon monoxide, nitrogen oxides, and particulate matter, respectively\. The ground truth snapshots have been presented in Fig\.[4](https://arxiv.org/html/2606.24989#S3.F4)and[5](https://arxiv.org/html/2606.24989#S3.F5), and the pollutant concentrations are displayed scaled by the factor indicated in each colour\-bar label \(10−810^\{\-8\}kg/m3for CO and NOx, and10−1010^\{\-10\}kg/m3for PM\)\. The lines indicate the boundaries of elevated terrain and building surfaces intersecting thez=5z=5m horizontal plane, delineating regions where the ground or urban canopy rises above the selected height level\. In contrast to data\-driven surrogates that must be trained on input\-output pairs, the present framework is purely reconstructive\. The sparse input is obtained by sampling the reference field at the sensor locations, so for every case considered in this study both the input and the full\-resolution reference are known, allowing the reconstruction error to be evaluated directly\.

For datasets containing a single temporal snapshot, the rank along the temporal mode is one, which results in an exact reconstruction and provides no information about the dominant spatial structures\. To enable a meaningful modal analysis, the spatial dimensions are reshaped into a two\-dimensional snapshot matrix\. Among the possible arrangements, the\(X,Z\)×Y\(X,Z\)\\times Yconfiguration is adopted here, where the streamwise and spanwise directions are combined into a single spatial coordinate of sizeNxNzN\_\{x\}N\_\{z\}, yielding a matrix of dimensionsNxNz×NyN\_\{x\}N\_\{z\}\\times N\_\{y\}\. The\(X,Y\)×Z\(X,Y\)\\times Zarrangement was not adopted as it places the spanwise direction in the snapshot role, limiting the maximum number of extractable modes toNz=30N\_\{z\}=30\. Given the spatial complexity of the horizontal flow field across a500×500500\\times 500grid, this rank ceiling is too restrictive to capture the dominant structures adequately\. The arrangement\(Y,Z\)×X\(Y,Z\)\\times Xwas also tested and produced comparable results\.

### 3\.2Two\-building urban flow dataset

The second dataset consists of a DNS simulation of flow around two wall\-mounted rectangular obstacles, representing a simplified urban street configuration, which was obtained from the work of Á\. Martínez\-Sánchez et al\.\[[30](https://arxiv.org/html/2606.24989#bib.bib35)\]\. The governing equations and modelling approach differ from those described previously and are detailed in Ref\.\[[30](https://arxiv.org/html/2606.24989#bib.bib35)\]\. The simulations were performed using the spectral\-element code Nek5000 at a Reynolds numberReh=10,000Re\_\{h\}=10\{,\}000based on the obstacle heighthhand the free\-stream velocityU∞U\_\{\\infty\}\.

The domain consists of two identical wall\-mounted obstacles\. Each obstacle has heighthh, taken as the reference length and used to define the Reynolds numberRehRe\_\{h\}; all lengths are normalised byhh\. The streamwise length of each obstacle iswbw\_\{b\}, and its spanwise width isbb, withwb/h=b/h=0\.5w\_\{b\}/h=b/h=0\.5\. The obstacles sit in a channel of wall\-normal extentLy=3hL\_\{y\}=3hand spanwise extentLz=4hL\_\{z\}=4h, and are separated in the streamwise direction by a distanceℓ\\ell\. Three separation ratiosℓ/h=1,2,4\\ell/h=1,2,4were simulated, corresponding to the skimming\-flow \(SF\), wake\-interference \(WI\), and isolated\-roughness \(IR\) regimes\. The tensor used in this study was obtained for the SF case \(ℓ/h=1\\ell/h=1\)\.

The mesh employs an eight\-point Gauss\-Lobatto Legendre \(GLL\) quadrature within each spectral element, refined near the obstacle surfaces\. The resolution satisfies the criteria with the wall\-normal and spanwise directions held fixed across all three cases at approximately6×1066\\times 10^\{6\}grid points for the SF regime\. The inflow is located atx/h=−10x/h=\-10, with the tripping force applied atx/h=−9x/h=\-9, allowing the boundary layer to develop over the region−8≤x/h≤−1\-8\\leq x/h\\leq\-1, reaching fully turbulent conditions \(Reτ≈175Re\_\{\\tau\}\\approx 175,Reθ≈450Re\_\{\\theta\}\\approx 450\) upstream of the obstacle atx/h=−2x/h=\-2\.

![Refer to caption](https://arxiv.org/html/2606.24989v1/1.png)

![Refer to caption](https://arxiv.org/html/2606.24989v1/2.png)

![Refer to caption](https://arxiv.org/html/2606.24989v1/3.png)

Figure 6:Ground\-truth \(left\) and HOSVD\-reconstructed \(right\) snapshots for the two\-building dataset with retained modes\(19,14,15\)\(19,14,15\)\. From top to bottom: streamwise velocityu/U∞u/U\_\{\\infty\}, wall\-normal velocityv/U∞v/U\_\{\\infty\}, and spanwise velocityw/U∞w/U\_\{\\infty\}\. All quantities are non\-dimensional: velocities are scaled by the free\-stream velocityU∞U\_\{\\infty\}, and the spatial axes, shown asxxandyyin the panels, denote the coordinates normalized by the obstacle height, i\.e\.x/hx/handy/hy/h\.Figure[6](https://arxiv.org/html/2606.24989#S3.F6)shows representative ground\-truth and HOSVD\-reconstructed snapshots for the three velocity components from the SF regime\. The comparison is shown foruu,vv, andww, providing a direct visual assessment of the reconstruction quality across the different flow components\. The raw data contain a broad range of turbulent scales and small\-scale fluctuations that are not retained within a reduced\-order representation\. Since the objective of the present work is to assess the ability of the proposed low\-cost methodology to reproduce the dominant flow structures, comparisons for this test case are performed against the HOSVD reconstruction\. This provides a consistent basis for evaluation by isolating the effects of the sparse sampling from the discrepancies associated with unresolved small\-scale turbulence\. The HOSVD reconstruction represents the best approximation attainable at the chosen modes when the full data are available, so it constitutes the natural upper bound for any low\-cost variant operating at the same modes\. The difference between a low\-cost reconstruction and this reference measures only the error introduced by computing the decomposition from the sensor subset\. The HOSVD reference is computed with19,14,1519,14,15modes along thexx,yy, andzzdirections, selected from the singular\-value decay as discussed in Section[4\.2](https://arxiv.org/html/2606.24989#S4.SS2)\.

Table[1](https://arxiv.org/html/2606.24989#S3.T1)summarises the two datasets used in this study, including the tensor shape, the physical variables retained, and the number of temporal snapshots\. This generalised formulation accommodates datasets of varying composition\. In the two\-building flow case considered here,Tvar=3T\_\{\\mathrm\{var\}\}=3corresponds to the three velocity components, while for the Vallecas urban CFD case, the tensor includes velocity components, pressure, turbulent kinetic energy, and multiple pollutant concentrations, giving a largerTvarT\_\{\\mathrm\{var\}\}\. In both cases, the structure \(Eqs\. \([1](https://arxiv.org/html/2606.24989#S2.E1)\-[6](https://arxiv.org/html/2606.24989#S2.E6)\)\) remains unchanged; only the number of variable slices in the first tensor mode varies\. The tensor shape follows the convention\(nvar×Nx×Ny×Nz×Nt\)\(n\_\{\\text\{var\}\}\\times N\_\{x\}\\times N\_\{y\}\\times N\_\{z\}\\times N\_\{t\}\), wherenvarn\_\{\\text\{var\}\}is the number of physical variables,NxN\_\{x\}andNyN\_\{y\}are the number of grid points in the streamwise and wall\-normal directions,NzN\_\{z\}in the spanwise direction, andNtN\_\{t\}is the number of temporal snapshots\.

Table 1:Summary of the two datasets used in this study\.

## 4\. Results

The results of the lcSVD and lcHOSVD reconstructions are presented for both datasets\. For each case, the number of sensors is first determined, followed by a quantitative assessment of the reconstruction accuracy via the relative root\-mean\-square error \(RRMSE\), computed on fluid cells only\. The compression \(Eqs\. \([23](https://arxiv.org/html/2606.24989#S2.E23)\) and \([24](https://arxiv.org/html/2606.24989#S2.E24)\)\), representative reconstructed snapshots, and Q\-criterion isosurfaces \(Eq\. \([22](https://arxiv.org/html/2606.24989#S2.E22)\)\) are shown to assess the quality of the recovered flow structures\.

### 4\.1Mode and sensor selection

The Vallecas domain covers approximately1705×17051705\\times 1705m in the horizontal plane and 70 m in the vertical, discretised with500×500×30500\\times 500\\times 30grid points, about7\.5×1067\.5\\times 10^\{6\}spatial locations\. The retained modes are chosen from the singular\-value decay shown in Fig\.[7](https://arxiv.org/html/2606.24989#S4.F7)forkkand PM, with the remaining variables given in Appendix[A\.2](https://arxiv.org/html/2606.24989#A1.SS2)\. For the velocity components andkk, the10−210^\{\-2\}threshold is reached at approximately 35\-55 modes in the streamwise and normal directions and 11\-18 modes in the spanwise direction, so the retained counts are set torx=ry=50r\_\{x\}=r\_\{y\}=50,rz=20r\_\{z\}=20, with 30 modes for lcSVD\. The spanwise velocity and the pressure exhibit higher small\-scale features and more localized flow structures, and so the threshold requires slightly more, andrx=ry=60r\_\{x\}=r\_\{y\}=60,rz=20r\_\{z\}=20are retained, with 40 modes for lcSVD\. The pollutant spectra behave quite differently, where the10−210^\{\-2\}crossing is not reached until modes 178\-195, reflecting the fine spatial gradients of the road\-corridor emission plumes\. The retained count is nevertheless kept at 50 modes for these variables\. In terms of energy, the contribution of the additional modes is minimal, and extending the basis towards the10−210^\{\-2\}crossing increases the captured energy only marginally\. The sensor requirement, in contrast, grows with the retained modes, since the number of sensors along each axis must reach the number of modes retained along that axis\. Matching the crossing would demand close to 180 sensor planes per horizontal direction, a network beyond any realistic monitoring deployment, for a limited gain in accuracy\. Retaining 50 modes keeps the pollutants within the same configuration as the velocity fields and preserves the compression\.

The sensor networks follow directly from these counts\. A network of50×50×2050\\times 50\\times 20sensors is employed for the first group of variables \(50,000 locations, a horizontal spacing of about 34 m andCF=150×CF=150\\times\), and60×60×2060\\times 60\\times 20sensors for the second \(72,000 locations, about 29 m spacing andCF=104×CF=104\\times\)\. Both configurations use the minimum admissible budget per direction to maximise compression, retaining less than 1% of the grid points with roughly 1–2 sensors per street block\. Figure[8](https://arxiv.org/html/2606.24989#S4.F8)illustrates representative subsets of the sensor distributions for both datasets\. Only a fraction of the networks is displayed, intended to convey the equidistant arrangement rather than the full density\.

![Refer to caption](https://arxiv.org/html/2606.24989v1/k_sv_decay.png)\(a\)Turbulent kinetic energy \(kk\)
![Refer to caption](https://arxiv.org/html/2606.24989v1/PM_sv_decay.png)\(b\)Particulate matter \(PM\)

Figure 7:Singular\-value decayσk/σ0\\sigma\_\{k\}/\\sigma\_\{0\}for the turbulent kinetic energy \(kk\) and particulate matter \(PM\) fields in the Vallecas dataset\.![Refer to caption](https://arxiv.org/html/2606.24989v1/4.png)\(a\)Vallecas urban domain\.
![Refer to caption](https://arxiv.org/html/2606.24989v1/5.png)\(b\)Two\-building domain\.

Figure 8:Representative sensor distributions employed for the low\-cost decompositions\. A subset of 2,000 \(Vallecas\) and 500 \(two\-building\) sensor locations is displayed out of the actual networks of 50,000 and 25,000 measurement points, respectively\.The choice of this configuration is supported by an anisotropy analysis, which examines how the reconstruction responds when the sensor count along one direction departs from the nominal value\. In practical monitoring campaigns, the sensor distribution is rarely uniform, as access and cost constraints often limit the number of measurement planes along a given spatial direction\. To assess the response of both methods to this situation, the reconstruction is repeated with a deliberately anisotropic sensor arrangement\. The reconstruction is repeated with the sensor counts alongxxandzzfixed at their nominal values, while the number of sensor planes alongyy,ns,yn\_\{s,y\}, is reduced progressively\. Both methods receive the same total sensor budget at every point of the sweep, and for lcHOSVD only the number of modes alongyyis capped atns,yn\_\{s,y\}\. Figure[9](https://arxiv.org/html/2606.24989#S4.F9)shows the resulting RRMSE\. Adding sensors beyond the nominal configuration brings only marginal gains where increasingns,yn\_\{s,y\}from 50 to full sampling at 500 planes, a tenfold increase in the budget, reduces the lcHOSVD error from 6\.00% to 5\.73% foruu, from 11\.10% to 10\.02% forkk, and from 17\.45% to 15\.56% for NOx\. Reducing the sensors below the nominal value, in contrast, degrades the reconstruction rapidly, and far more severely for lcSVD\. Atns,y=10n\_\{s,y\}=10, its error reaches 62\.71% for NOx, 61\.07% for CO, and 62\.34% for PM, against 46\.75%, 46\.19%, and 46\.74% for lcHOSVD\. The origin of this behaviour is structural, as in the matrix formulation, theyy\-planes act as the snapshots of the unfolded matrix, so the rank of the lcSVD reconstruction cannot exceedns,yn\_\{s,y\}, and starving one direction penalises all three at once\. In the tensor formulation, each spatial direction holds its own factor matrix, and the loss of information remains confined to the y modes\. The only requirement is that the number of sensor planes reaches the y\-rank of the decomposition\. As the number of measurement planesns,yn\_\{s,y\}increases, the lcHOSVD error falls steadily and approaches the full HOSVD reference for every variable\. This convergence confirms that the larger errors seen at the sparsest sensor configurations come from the limited amount of measurement data rather than from any intrinsic limitation of lcHOSVD, which recovers the full\-data baseline once enough planes are sampled\.

For the two\-building dataset, the10−210^\{\-2\}threshold of the singular\-value decay is reached within 10\-20 modes depending on the direction and velocity component, andrx=19,ry=14,rz=15r\_\{x\}=19,r\_\{y\}=14,r\_\{z\}=15modes are retained for lcHOSVD, covering the most demanding component in each direction, and 8 modes are retained for the lcSVD formulation\. The10−110^\{\-1\}threshold is reached within 3 modes in all directions for every component, reflecting the dominance of large\-scale structures in the skimming\-flow regime\. The sensor configuration is40×25×2540\\times 25\\times 25, corresponding to 25,000 measurement locations, 4% of the 625,000 grid points \(CF=25×CF=25\\times\)\. The same anisotropic sweep is applied to this case, with 40 and 25 sensor planes fixed alongxxandzz, and the result is shown in Fig\.[10](https://arxiv.org/html/2606.24989#S4.F10)\. The lcSVD error remains nearly constant, since its rank is fixed by the retained modes rather than by the sensor layout, while the lcHOSVD error decreases withns,yn\_\{s,y\}and settles once the 14 modes alongyyare matched by the sensor planes, where it lies below the lcSVD error by 2\.7 percentage points foruu, 13\.0 forvvand up to 15\.5 forww\. For the smallest sensor counts,ns,y≤3n\_\{s,y\}\\leq 3, theyybasis of lcHOSVD also collapses, and the advantage disappears foruuandww\. Above this threshold, the tensor method degrades gracefully under anisotropic sampling\.

![Refer to caption](https://arxiv.org/html/2606.24989v1/anisotropy_all_vars.png)Figure 9:Sensor anisotropy study for the Vallecas case\. Relative root\-mean\-square error \(RRMSE\) of lcHOSVD and lcSVD as the number of sensor planes in theyydirection \(ns,yn\_\{s,y\}\) is progressively reduced while keeping the sensor counts alongxxandzzfixed\. 50,50 and 20 modes are retained in thexx,yy,zzdirections for the streamwise velocity \(uu\), normal velocity \(vv\), turbulent kinetic energy \(kk\), carbon monoxide \(CO\), nitrogen oxides \(NOx\), and particulate matter \(PM\), and 60,60,20 for the spanwise velocity \(ww\) and pressure \(pp\), while the lcSVD formulation retains 30 and 40 modes, respectively\. The top row presents, from left to right,uu,vv,ww, andpp\. The bottom row presentskk, CO, NOx, and PM\.![Refer to caption](https://arxiv.org/html/2606.24989v1/anisotropy_uvw.png)Figure 10:Sensor anisotropy study for the two\-building case\. Relative root\-mean\-square error \(RRMSE\) of lcHOSVD and lcSVD for decreasing numbers of sensor planes in theyydirection \(ns,yn\_\{s,y\}\)\. 19,14 and 15 modes are retained along thexx,yy, andzzdirections for all velocity components, while the lcSVD formulation retains 8 modes\. From left to right, streamwise velocity \(uu\), normal velocity \(vv\), and spanwise velocity \(ww\)\.This situation is frequently encountered in practical monitoring campaigns, where sensor placement is constrained by accessibility, infrastructure, and cost considerations\. As a result, measurements are often more densely distributed in some spatial directions than in others\. By preserving the tensor structure of the data and treating each dimension independently, lcHOSVD can better accommodate such anisotropies, explaining its improved reconstruction performance compared with lcSVD\.

### 4\.2Qualitative Comparison of Reconstructed Fields

The reconstructed snapshots for the Vallecas dataset are presented in Fig\.[11](https://arxiv.org/html/2606.24989#S4.F11)for thekkandPM\\mathrm\{PM\}fields, while the remaining variables are provided in Appendix[A](https://arxiv.org/html/2606.24989#A1)\. In both cases, the low\-cost input contains only a sparse representation of the original field, requiring the dominant spatial structures to be recovered through the reduced\-order reconstruction\. Both methods reconstruct the large\-scale distribution of the variables, reproducing the principal gradients and features observed in the reference solution\.

LcHOSVD captures the dominant structures ofkkandPM\\mathrm\{PM\}slightly better and produces smoother spatial fields\. Both methods exhibit reconstruction artefacts arising from the sparse spatial sampling; however, these artefacts appear less pronounced in the lcHOSVD reconstructions\. The lcSVD results display a scattered appearance, particularly in regions of strong spatial variation\. Nevertheless, the overall agreement between the two methods remains high, confirming that both approaches recover the dominant behaviour of the urban\-flow fields despite the substantial spatial compression employed\.

![Refer to caption](https://arxiv.org/html/2606.24989v1/k_row2_lowcost_lchosvd_lcsvd.png)

![Refer to caption](https://arxiv.org/html/2606.24989v1/PM_row2_lowcost_lchosvd_lcsvd.png)

Figure 11:From left to right: comparison of the low\-cost input, lcHOSVD reconstruction, and lcSVD reconstruction for the turbulent kinetic energy \(kk\) and particulate matter \(PM\) fields in the Vallecas urban\-flow dataset, corresponding to 50,000 sensor locations out of the7\.5×1067\.5\\times 10^\{6\}grid points \(CF=150×CF=150\\times\), with\(50,50,20\)\(50,50,20\)modes for lcHOSVD and 30 modes for lcSVD\. The lines mark where terrain or buildings rise above the z = 5 m AGL plane\.The corresponding results for the two\-building dataset are shown in Fig\.[12](https://arxiv.org/html/2606.24989#S4.F12)\. Foruu, both lcHOSVD and lcSVD reproduce the wake deficit and the downstream recovery of the flow with comparable accuracy\. Differences become more evident in the transverse velocity components\. In the normal velocity field, lcHOSVD provides a clearer reconstruction of the alternating flow structures generated by the interaction of the obstacle wakes, whereas the lcSVD reconstruction is more diffused\. This behaviour is even more apparent forww, where lcHOSVD preserves a greater portion of the coherent structures and sharper gradients visible in the reference field\. Both methods recover the dominant flow patterns successfully, although the preservation of the tensor structure in lcHOSVD contributes to improved reconstruction of more complex turbulent flows\.

![Refer to caption](https://arxiv.org/html/2606.24989v1/u_row2_lowcost_lchosvd_lcsvd.png)

![Refer to caption](https://arxiv.org/html/2606.24989v1/v_row2_lowcost_lchosvd_lcsvd.png)

![Refer to caption](https://arxiv.org/html/2606.24989v1/w_row2_lowcost_lchosvd_lcsvd.png)

Figure 12:From left to right: comparison of the low\-cost input, lcHOSVD reconstruction, and lcSVD reconstruction\. from top to bottom: streamwise \(uu\), wall\-normal \(vv\), and spanwise \(ww\) velocity components att=224t=224in the two\-building dataset, corresponding to 25,000 sensor locations out of the 625,000 grid points \(CF=25×CF=25\\times\), with\(19,14,15\)\(19,14,15\)modes for lcHOSVD and 8 modes for lcSVD\.
### 4\.3Quantitative Comparison of Reconstruction Accuracy and Speed\-Up

The reconstruction accuracy and computational performance of the low\-cost decompositions are reported in Tables[2](https://arxiv.org/html/2606.24989#S4.T2)and[3](https://arxiv.org/html/2606.24989#S4.T3)\. For the Vallecas dataset, the results are reported for all variables, including the standard HOSVD reference, lcHOSVD, and lcSVD\. For the two\-building dataset, the comparison is restricted to lcHOSVD and lcSVD, and the error is reported relative to the HOSVD reconstruction\. This is appropriate for the turbulent case, where the HOSVD reconstruction represents the reduced\-order reference solution\.

Table 2:Reconstruction error and timing for HOSVD, lcHOSVD, and lcSVD on the Vallecas case\.Table 3:RRMSE, computational time, and speed\-up for the two\-building dataset\.A clear trade\-off is observed between reconstruction accuracy and computational efficiency\. In both datasets, lcSVD provides substantially larger computational savings, whereas lcHOSVD consistently achieves lower reconstruction errors by preserving the multi\-dimensional structure of the original data during the decomposition process\. The difference in reconstruction accuracy for the urban flow case between the two approaches remains relatively modest\. The largest discrepancy is observed for the PM field, where the RRMSE \(Eq\. \([21](https://arxiv.org/html/2606.24989#S2.E21)\)\) increases from 17\.48% for lcHOSVD to 22\.66% for lcSVD, while for thekkfield the increase is from 11\.10% to 13\.25%\. The comparatively high RRMSE values obtained for theww,pp, and pollutant fields are a consequence of the low magnitude of these variables over large portions of the domain\. As the RRMSE is divided by zero or near\-zero values, it leads to disproportionately large relative errors, thereby inflating the overall RRMSE between the reconstructed and reference fields\. Despite these differences, lcSVD provides a substantial computational advantage\. Depending on the variable considered, the speed\-up increases from approximately 51×\\timesto 72×\\timesfor lcSVD, compared with only 3\.6×\\timesto 4\.0×\\timesfor lcHOSVD\. Given that the urban\-flow dataset contains a single temporal step, the increase in reconstruction error remains relatively small compared with the significant reduction in computational cost, making lcSVD an attractive option when rapid reconstruction is the primary objective\.

The two\-building case shows a different balance between the two methods\. With respect to the standard HOSVD reference, lcHOSVD increases the RRMSE by only 0\.28 percentage points foruu, 2\.72 forvv, and 1\.35 forww, while lcSVD adds 3\.08, 15\.73, and 16\.47 points, respectively\. In the direct comparison, lcHOSVD is therefore more accurate than lcSVD by 2\.80, 13\.01, and 15\.12 percentage points, while lcSVD remains 14\-18 times faster \(mean runtimes of 1\.42 s against 22\.62 s, corresponding to speed\-ups of 34×\\timesand 2\.2×\\timeswith respect to HOSVD\)\. The larger accuracy gap for thevvandwwcomponents indicates that the tensor formulation gains importance as the directional anisotropy of the flow increases\. Although lcSVD retains a clear computational advantage, the better reconstruction obtained with lcHOSVD forvvandwwshows that preserving the multi\-way structure of the data is beneficial when representing the wake dynamics characteristic of turbulent building flows\.

The difference between the two methods can also be quantified through the storage cost in the reduced representation\. For the Vallecas dataset, the core tensor combines the directional modes, so the entire storage spans50×50×2050\\times 50\\times 20admissible mode combinations, whereas the 30 lcSVD modes provide exactly 30 global spatial patterns\. In other words, the tensor representation retains a substantially larger set of admissible reduced\-order interactions as compared to lcSVD\. This bigger set of modal interactions allows lcHOSVD to capture more complex spatial structures and directional dependencies, which contribute to its lower reconstruction errors\. The computational cost follows the opposite logic, where lcSVD computes a single SVD of one small subsampled matrix, while lcHOSVD requires one SVD per spatial direction plus the core requirements, which is why the lcSVD speed\-ups exceed those of lcHOSVD despite its smaller parameter count\.

### 4\.4Q\-criterion

Figure[13](https://arxiv.org/html/2606.24989#S4.F13)compares the standardized Q\-criterion isosurfaces \(Eq\. \([22](https://arxiv.org/html/2606.24989#S2.E22)\)\) obtained from the reference solution, lcHOSVD reconstruction, and lcSVD reconstruction\. The Q\-criterion is standardized using the statistics of the reference field and visualized using identical thresholds and colour scales\. A zoomed view of the Vallecas domain is presented to improve visual clarity and allow a more detailed inspection of the coherent vortical structures present within the urban canopy\. The Q\-criterion comparison suggests that both reconstruction methods capture the dominant large\-scale vortical structures present in the reference solution\. However, the reconstructed fields appear to contain fewer Q\-criterion isosurfaces than the original CFD data, indicating a partial loss of smaller\-scale coherent structures\. This effect seems to be less pronounced for lcHOSVD, which preserves a larger fraction of the vortical features visible in the reference field\. Although lcHOSVD captures more features than lcSVD, both approaches provide a comparable representation of the large\-scale coherent motions present in the urban flow\.

![Refer to caption](https://arxiv.org/html/2606.24989v1/qcriterion_3way_zoom.png)
![Refer to caption](https://arxiv.org/html/2606.24989v1/qcriterion_standardized_isosurfaces.png)

Figure 13:From left to right: Comparison of standardized Q\-criterion isosurfaces for the Vallecas dataset\(top\) and the two\-building dataset \(bottom\) obtained from the ground\-truth, lcHOSVD reconstruction, and lcSVD reconstruction, respectively\.The behaviour is more clearly differentiated in the two\-building case, where the ground truth contains more complex turbulent structures\. The reconstructed Q\-criterion isosurfaces show that the lcHOSVD reconstruction preserves the shape, continuity, and spatial distribution of these vortical structures more accurately, while capturing the larger features\. The lcSVD reconstruction exhibits a greater degree of fragmentation and loss of features\. This observation agrees with the reconstruction\-error analysis of the velocity components, particularly for the transverse and vertical velocities, where the tensor\-based formulation outperformed the matrix\-based approach along the normal and spanwise directions\. The results indicate that the advantages of preserving the multidimensional structure of the data become increasingly important as the flow complexity and turbulence content increase\.

From a practical standpoint, the subdomain shown in Fig\.[15](https://arxiv.org/html/2606.24989#S4.F15), extracted from the urban domain presented in Fig\.[14](https://arxiv.org/html/2606.24989#S4.F14), suggests that an urban area of approximately 400 m × 400 m can be adequately characterized using around 9 sensors along each horizontal direction and 20 sensors along the vertical direction\. This corresponds to one measurement every 44 m in the X\-Y plane\. This sampling density is feasible in real urban deployments at a local scale\. Low\-cost air quality and wind sensors mounted on lamp posts, traffic lights, and rooftops, complemented by a small number of vertical profiles from sodars or drone\-based surveys, can supply the input fields required by the reduced\-order model without dense instrumentation\. The vertical resolution is the most demanding part of the requirement, but it can be relaxed in operational settings by placing measurements at carefully chosen heights, since the dominant flow features and pollutant gradients are concentrated near the surface and within the urban canopy\. The combination of sparse spatial sampling and modal reconstruction makes the approach suitable for city\-scale wind and air quality monitoring, where dense reference data is rarely available\.

![Refer to caption](https://arxiv.org/html/2606.24989v1/roi_location_full_domain.png)Figure 14:Location of the 400×\\times400m area \(red box\) within the Vallecas domain, centred at\(X,Y\)=\(75,150\)\(X,Y\)=\(75,150\)m\.![Refer to caption](https://arxiv.org/html/2606.24989v1/qcriterion_3way_zoom_standardized.png)Figure 15:Q\-criterion isosurfaces within the 400×\\times400m region\. From left to right: ground\-truth, lcHOSVD reconstruction, and lcSVD reconstruction\.

## 5\. Conclusions

This study introduced low\-cost variants of SVD and HOSVD for the reconstruction of high\-dimensional fluid\-flow and urban air\-quality datasets from sparse sensor measurements\. The proposed methodology performs the decomposition using spatially reduced observations and subsequently reconstructs the full\-resolution fields, enabling accurate recovery of complex flow structures while significantly reducing the computational burden associated with conventional modal decompositions\.

The approach was assessed using datasets with distinct dynamical characteristics\. For the urban\-flow and pollutant\-transport case, both lcSVD and lcHOSVD successfully reproduced the dominant flow and concentration patterns using only a small fraction of the available spatial locations\. Reconstruction differences between the two formulations remained relatively modest, with lcHOSVD providing slightly cleaner reconstructions and lcSVD achieving substantially larger computational speed\-ups\.

Different behaviour was observed for the two\-building configuration\. Although lcSVD remained considerably faster, reconstruction errors increased more noticeably for the transverse and vertical velocity components\. The advantages of the tensor\-based formulation became particularly evident in the recovery of coherent vortical structures and small\-scale turbulent dynamics, where lcHOSVD consistently produced more accurate reconstructions\. These results demonstrate that preserving the multidimensional structure of the data becomes increasingly important as the complexity, unsteadiness, and coupling of the underlying dynamics increase\.

The sensor\-anisotropy study further demonstrated that the advantages of lcHOSVD extend beyond reconstruction accuracy alone\. While lcSVD is fundamentally constrained by the most sparsely sampled direction, lcHOSVD degrades more gracefully because each spatial direction is represented independently through its own factor matrix\. This property makes the tensor formulation more robust to anisotropic sampling and non\-uniform sensor distributions, situations that commonly arise in practical monitoring networks due to physical, logistical, and economic constraints\.

The principal contribution of this work is the introduction of lcHOSVD, a novel sparse\-sensing tensor\-reconstruction framework that extends low\-cost reduced\-order modelling of multidimensional datasets beyond matrix\-based formulations\. To the authors’ knowledge, this is the first methodology capable of performing HOSVD\-based field reconstruction directly from sparse sensor observations while retaining the advantages of tensor representations\. By preserving the natural multidimensional organisation of the data, lcHOSVD exploits correlations across spatial, temporal, and physical\-variable dimensions that cannot be fully captured using matrix\-based approaches\.

This work also presents the first application of low\-cost SVD\- and HOSVD\-based reconstruction techniques to urban flow and air\-quality datasets\. The comparative analysis provides practical guidelines for selecting between matrix\- and tensor\-based formulations\. When computational speed is the primary requirement, and the underlying dynamics remain relatively simple, lcSVD offers an attractive solution\. Conversely, lcHOSVD provides a more robust framework for strongly unsteady and multidimensional problems, particularly when different spatial directions exhibit distinct levels of complexity or sensor availability\.

Beyond reconstruction, the proposed methodologies offer significant potential for data assimilation, digital twins, environmental monitoring, and forecasting applications, where complete flow and concentration fields must be estimated from limited observations\. The ability to recover accurate high\-dimensional states from sparse sensor networks opens new opportunities for integrating experimental measurements with large\-scale numerical simulations in urban and environmental systems\.

Future work will focus on incorporating the proposed framework into data\-assimilation and digital\-twin environments for urban\-flow and air\-quality forecasting\. Additional developments will investigate low\-cost tensor\-based modal decomposition techniques, including a low\-cost Higher\-Order Dynamic Mode Decomposition \(lcHODMD\), adaptive sensor\-placement strategies, and the use of low\-cost modal representations as compressed inputs for deep\-learning models\. Further applications to larger multi\-physics and environmental datasets will also be explored\.

## Conflicts of Interest

The authors declare no conflict of interest\.

## Code and Data Availability

## Acknowledgments

The authors acknowledge the MODELAIR project that has received funding from the European Union’s Horizon Europe research and innovation programme under the Marie Sklodowska\-Curie grant agreement No\. 101072559\. The results of this publication reflect only the author’s view and do not necessarily reflect those of the European Union\. The European Union cannot be held responsible for them\. S\.L\.C\. acknowledges the grant PID2023\-147790OB\-I00 funded by MCIU/AEI/10\.13039 /501100011033 /FEDER, UE\. The authors gratefully acknowledge the Universidad Politécnica de Madrid \(www\.upm\.es\) for providing computing resources on the Magerit Supercomputer\.

## Appendix AAppendix

The remaining reconstruction and singular value decay results for the Vallecas and the two\-building urban\-flow dataset are presented in the section below\.

### A\.1Additional Reconstruction Results for the Vallecas Dataset

The figures include comparisons of the low\-cost input, lcHOSVD reconstruction, and lcSVD reconstruction for the velocity components \(uu,vv, andww\), pressure \(pp\), carbon monoxide \(CO\), and nitrogen oxides \(NOx\)\. These variables exhibit trends similar to those discussed in the main text, where both low\-cost approaches recover the dominant flow and pollutant transport patterns\. The corresponding reconstruction results are shown in Fig\.[A1](https://arxiv.org/html/2606.24989#A1.F1)and[A2](https://arxiv.org/html/2606.24989#A1.F2), respectively\. The second section presents the singular value decay plots\.

![Refer to caption](https://arxiv.org/html/2606.24989v1/u_row2_lowcost_lchosvd_lcsvd1.png)

![Refer to caption](https://arxiv.org/html/2606.24989v1/v_row2_lowcost_lchosvd_lcsvd1.png)

![Refer to caption](https://arxiv.org/html/2606.24989v1/w_row2_lowcost_lchosvd_lcsvd1.png)

![Refer to caption](https://arxiv.org/html/2606.24989v1/p_row2_lowcost_lchosvd_lcsvd.png)

Figure A1:From left to right: low\-cost input, lcHOSVD reconstruction, and lcSVD reconstruction for the physical flow variables in the Vallecas dataset, corresponding to 50,000 sensor locations out of the7\.5×1067\.5\\times 10^\{6\}grid points \(CF=150×CF=150\\times\), with\(50,50,20\)\(50,50,20\)modes for lcHOSVD and 30 modes for lcSVD\. From top to bottom: streamwise velocity, normal velocity, spanwise velocity, and pressure\. The lines mark where terrain or buildings rise above thez=5z=5m AGL plane\.![Refer to caption](https://arxiv.org/html/2606.24989v1/CO_row2_lowcost_lchosvd_lcsvd.png)

![Refer to caption](https://arxiv.org/html/2606.24989v1/NOx_row2_lowcost_lchosvd_lcsvd.png)

Figure A2:From left to right: low\-cost input, lcHOSVD reconstruction, and lcSVD reconstruction for the CO and NOx pollutant fields in the Vallecas dataset, corresponding to 50,000 sensor locations out of the7\.5×1067\.5\\times 10^\{6\}grid points \(CF=150×CF=150\\times\), with\(50,50,20\)\(50,50,20\)modes for lcHOSVD and 30 modes for lcSVD\. From top to bottom: CO and NOx\.
### A\.2Singular\-value decay

The singular\-value decay curves for all remaining variables are shown in Fig\.[A3](https://arxiv.org/html/2606.24989#A1.F3)and[A4](https://arxiv.org/html/2606.24989#A1.F4)\.

![Refer to caption](https://arxiv.org/html/2606.24989v1/u_sv_decay.png)\(a\)uu
![Refer to caption](https://arxiv.org/html/2606.24989v1/v_sv_decay.png)\(b\)vv
![Refer to caption](https://arxiv.org/html/2606.24989v1/w_sv_decay.png)\(c\)ww
![Refer to caption](https://arxiv.org/html/2606.24989v1/p_sv_decay.png)\(d\)pp
![Refer to caption](https://arxiv.org/html/2606.24989v1/CO_sv_decay.png)\(e\)CO
![Refer to caption](https://arxiv.org/html/2606.24989v1/NOx_sv_decay.png)\(f\)NOx

Figure A3:Singular\-value decayσk/σ0\\sigma\_\{k\}/\\sigma\_\{0\}for the remaining variables of the Vallecas dataset \(uu,vv,ww,pp, CO, NOx\)\.![Refer to caption](https://arxiv.org/html/2606.24989v1/u_sv_decay1.png)\(a\)uu
![Refer to caption](https://arxiv.org/html/2606.24989v1/v_sv_decay2.png)\(b\)vv
![Refer to caption](https://arxiv.org/html/2606.24989v1/w_sv_decay3.png)\(c\)ww

Figure A4:Singular\-value decayσk/σ0\\sigma\_\{k\}/\\sigma\_\{0\}for the two\-building datset \(uu,vv,ww\.\)

## References

- \[1\]R\. Abadía\-Heredia, M\. López\-Martín, B\. Carro, J\. I\. Arribas, J\. M\. Pérez, and S\. Le Clainche\(2022\)A predictive hybrid reduced order model based on proper orthogonal decomposition combined with deep learning architectures\.Expert Systems with Applications187,pp\. 115910\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p2.1)\.
- \[2\]Ayuntamiento de Madrid\(2025\)Geoportal – red de vigilancia de la calidad del aire\.Note:[https://geoportal\.madrid\.es/IDEAM\_WBGEOPORTAL/index\.iam](https://geoportal.madrid.es/IDEAM_WBGEOPORTAL/index.iam)Cited by:[§3\.1](https://arxiv.org/html/2606.24989#S3.SS1.p2.1)\.
- \[3\]G\. Berkooz, P\. Holmes, and J\. L\. Lumley\(1993\)The proper orthogonal decomposition in the analysis of turbulent flows\.Annual review of fluid mechanics25\(1\),pp\. 539–575\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p2.1)\.
- \[4\]B\. Blocken\(2015\)Computational fluid dynamics for urban physics: importance, scales, possibilities, limitations and ten tips and tricks towards accurate and reliable simulations\.Building and environment91,pp\. 219–245\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p1.2),[§3\.1](https://arxiv.org/html/2606.24989#S3.SS1.p2.1)\.
- \[5\]G\. A\. Brès, P\. Jordan, V\. Jaunet, M\. Le Rallic, A\. V\. Cavalieri, A\. Towne, S\. K\. Lele, T\. Colonius, and O\. T\. Schmidt\(2018\)Importance of the nozzle\-exit boundary\-layer state in subsonic turbulent jets\.Journal of Fluid Mechanics851,pp\. 83–124\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p1.2)\.
- \[6\]S\. L\. Clainche, J\. M\. Pérez, and J\. M\. Vega\(2018\)Spatio\-temporal flow structures in the three\-dimensional wake of a circular cylinder\.Fluid Dynamics Research50\(5\),pp\. 051406\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p1.2)\.
- \[7\]L\. De Lathauwer, B\. De Moor, and J\. Vandewalle\(2000\)A multilinear singular value decomposition\.SIAM journal on Matrix Analysis and Applications21\(4\),pp\. 1253–1278\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p4.1)\.
- \[8\]L\. De Lathauwer, B\. De Moor, and J\. Vandewalle\(2000\)On the best rank\-1 and rank\-\(r 1, r 2,…, rn\) approximation of higher\-order tensors\.SIAM journal on Matrix Analysis and Applications21\(4\),pp\. 1324–1342\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p4.1)\.
- \[9\]B\. M\. De Silva, K\. Manohar, E\. Clark, B\. W\. Brunton, S\. L\. Brunton, and J\. N\. Kutz\(2021\)PySensors: a python package for sparse sensor placement\.arXiv preprint arXiv:2102\.13476\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p6.1)\.
- \[10\]S\. Ding and R\. Yang\(2021\)Reduced\-order modelling of urban wind environment and gaseous pollutants dispersion in an urban\-scale street canyon\.Journal of Safety Science and Resilience2\(4\),pp\. 238–245\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p3.1)\.
- \[11\]E\. Economista\(2024\-10,\)La gran transformación dentro de Vallecas: de zona degradada a nuevo barrio con 1\.400 viviendas y una residencia de estudiantes\.Note:[https://www\.eleconomista\.es/vivienda\-inmobiliario/noticias/13060987/10/24/la\-gran\-transformacion\-dentro\-de\-vallecas\-de\-zona\-degradada\-a\-nuevo\-barrio\-con\-1400\-viviendas\-y\-una\-residencia\-de\-estudiantes\.html](https://www.eleconomista.es/vivienda-inmobiliario/noticias/13060987/10/24/la-gran-transformacion-dentro-de-vallecas-de-zona-degradada-a-nuevo-barrio-con-1400-viviendas-y-una-residencia-de-estudiantes.html)Cited by:[§3\.1](https://arxiv.org/html/2606.24989#S3.SS1.p1.1)\.
- \[12\]European Environment Agency\(2023\)EMEP/EEA air pollutant emission inventory guidebook 2023\.Technical reportEuropean Environment Agency,Copenhagen, Denmark\.External Links:[Link](https://www.eea.europa.eu/emep-eea-guidebook)Cited by:[§3\.1](https://arxiv.org/html/2606.24989#S3.SS1.p6.4)\.
- \[13\]European Environment Agency\(2024\)Europe’s Air Quality Status 2024\.Technical reportEuropean Environment Agency,Copenhagen, Denmark\.Note:Available at:[https://www\.eea\.europa\.eu/en/analysis/publications/europes\-air\-quality\-status\-2024](https://www.eea.europa.eu/en/analysis/publications/europes-air-quality-status-2024)Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p1.2)\.
- \[14\]European Environment Agency\(2024\)Sustainability of Europe’s mobility systems\.Technical reportEuropean Environment Agency,Copenhagen, Denmark\.Note:Available at:[https://www\.eea\.europa\.eu/en/analysis/publications/sustainability\-of\-europes\-mobility\-systems](https://www.eea.europa.eu/en/analysis/publications/sustainability-of-europes-mobility-systems)Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p1.2)\.
- \[15\]C\. Fang and L\. Hong\(2018\)Particle image velocimetry for combustion measurements: applications and developments\.Chinese Journal of Aeronautics31\(7\),pp\. 1407–1427\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p1.2)\.
- \[16\]G\. H\. Golub and C\. Reinsch\(1971\)Singular value decomposition and least squares solutions\.InLinear algebra,pp\. 134–151\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p2.1)\.
- \[17\]D\. Hargreaves and N\. G\. Wright\(2007\)On the use of the k–ε\\varepsilonmodel in commercial cfd software to model the neutral atmospheric boundary layer\.Journal of wind engineering and industrial aerodynamics95\(5\),pp\. 355–369\.Cited by:[§3\.1](https://arxiv.org/html/2606.24989#S3.SS1.p5.2)\.
- \[18\]A\. Hetherington, A\. Corrochano, R\. Abadía\-Heredia, E\. Lazpita, E\. Muñoz, P\. Díaz, E\. Maiora, M\. López\-Martín, and S\. Le Clainche\(2024\)ModelFLOWs\-app: data\-driven post\-processing and reduced order modelling tools\.Computer Physics Communications301,pp\. 109217\.Cited by:[§2\.1](https://arxiv.org/html/2606.24989#S2.SS1.p6.1)\.
- \[19\]A\. Hetherington and S\. Le Clainche\(2025\)Low\-cost singular value decomposition with optimal sensor placement\.Physics of Fluids37\(8\)\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p6.1),[§2\.3\.1](https://arxiv.org/html/2606.24989#S2.SS3.SSS1.p3.3),[§2\.3](https://arxiv.org/html/2606.24989#S2.SS3.p1.1)\.
- \[20\]P\. Holmes\(2012\)Turbulence, coherent structures, dynamical systems and symmetry\.Cambridge university press\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p2.1)\.
- \[21\]J\. C\. Hunt, A\. A\. Wray, and P\. Moin\(1988\)Eddies, streams, and convergence zones in turbulent flows\.Studying turbulence using numerical simulation databases, 2\. Proceedings of the 1988 summer program\.Cited by:[§2\.5\.2](https://arxiv.org/html/2606.24989#S2.SS5.SSS2.p1.1)\.
- \[22\]P\. Jeanney, C\. García\-Sánchez, S\. Saiz, J\. M\. Pérez, and S\. Le Clainche\(2026\)Large\-scale cfd analysis of urban airflow and pollutant transport over a 24\-hour period with pod analysis: the vallecas district \(madrid\) case study\.arXiv preprint\.Cited by:[§3\.1](https://arxiv.org/html/2606.24989#S3.SS1.p1.1)\.
- \[23\]P\. Jeanney, A\. Hetherington, S\. E\. Ahmed, D\. Lanceta, S\. Saiz, J\. M\. Perez, and S\. L\. Clainche\(2025\)Ensemble kalman filter for data assimilation coupled with low\-resolution computations techniques applied in fluid dynamics\.arXiv preprint arXiv:2507\.00539\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p7.1)\.
- \[24\]T\. G\. Kolda and B\. W\. Bader\(2009\)Tensor decompositions and applications\.SIAM review51\(3\),pp\. 455–500\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p4.1)\.
- \[25\]B\. E\. Launder and D\. B\. Spalding\(1983\)The numerical computation of turbulent flows\.InNumerical prediction of flow, heat transfer, turbulence and combustion,pp\. 96–116\.Cited by:[§3\.1](https://arxiv.org/html/2606.24989#S3.SS1.p5.2)\.
- \[26\]S\. Le Clainche and J\. M\. Vega\(2017\)Higher order dynamic mode decomposition\.SIAM Journal on Applied Dynamical Systems16\(2\),pp\. 882–925\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p4.1)\.
- \[27\]Y\. Liu, C\. Liu, G\. P\. Brasseur, and C\. Y\. Chao\(2023\)Proper orthogonal decomposition of large\-eddy simulation data over real urban morphology\.Sustainable Cities and Society89,pp\. 104324\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p2.1),[§1](https://arxiv.org/html/2606.24989#S1.p3.1)\.
- \[28\]J\. L\. Lumley\(1967\)The structure of inhomogeneous turbulent flows\.Atmospheric turbulence and radio wave propagation,pp\. 166–178\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p2.1)\.
- \[29\]K\. Manohar, B\. W\. Brunton, J\. N\. Kutz, and S\. L\. Brunton\(2018\)Data\-driven sparse sensor placement for reconstruction: demonstrating the benefits of exploiting known patterns\.IEEE Control Systems Magazine38\(3\),pp\. 63–86\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p6.1)\.
- \[30\]Á\. Martínez\-Sánchez, E\. Lazpita, A\. Corrochano, S\. Le Clainche, S\. Hoyas, and R\. Vinuesa\(2023\)Data\-driven assessment of arch vortices in simplified urban flows\.International Journal of Heat and Fluid Flow100,pp\. 109101\.Cited by:[§3\.2](https://arxiv.org/html/2606.24989#S3.SS2.p1.3)\.
- \[31\]S\. Masoumi\-Verki, F\. Haghighat, and U\. Eicker\(2022\)A review of advances towards efficient reduced\-order models \(rom\) for predicting urban airflow and pollutant dispersion\.Building and Environment216,pp\. 108966\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p3.1)\.
- \[32\]M\. A\. Mendez, D\. Hess, B\. B\. Watz, and J\. Buchlin\(2020\)Multiscale proper orthogonal decomposition \(mpod\) of tr\-piv data—a case study on stationary and transient cylinder wake flows\.Measurement Science and Technology31\(9\),pp\. 094014\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p1.2)\.
- \[33\]F\. M\. Nav, R\. Snaiki, and F\. Ding\(2026\)A hierarchical machine learning framework for real\-time reconstruction of urban wind fields from sparse sensor networks\.Building and Environment,pp\. 114875\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p6.1)\.
- \[34\]I\. Pađen, C\. García\-Sánchez, and H\. Ledoux\(2022\)Towards automatic reconstruction of 3d city models tailored for urban flow simulations\.Frontiers in Built Environment8,pp\. 899332\.Cited by:[§3\.1](https://arxiv.org/html/2606.24989#S3.SS1.p2.1)\.
- \[35\]A\. Parente, C\. Gorlé, J\. van Beeck, and C\. Benocci\(2011\)A comprehensive modelling approach for the neutral atmospheric boundary layer: consistent inflow conditions, wall function and turbulence model\.Boundary\-layer meteorology140\(3\),pp\. 411–428\.Cited by:[§3\.1](https://arxiv.org/html/2606.24989#S3.SS1.p5.2)\.
- \[36\]S\. V\. Patankar and D\. B\. Spalding\(1983\)A calculation procedure for heat, mass and momentum transfer in three\-dimensional parabolic flows\.InNumerical prediction of flow, heat transfer, turbulence and combustion,pp\. 54–73\.Cited by:[§3\.1](https://arxiv.org/html/2606.24989#S3.SS1.p3.1)\.
- \[37\]S\. Patro and K\. K\. Sahu\(2015\)Normalization: a preprocessing stage\.arXiv preprint arXiv:1503\.06462\.Cited by:[§2\.2](https://arxiv.org/html/2606.24989#S2.SS2.p1.2)\.
- \[38\]P\. Pillai, A\. I\. Hetherington, L\. Saavedra, and S\. Le Clainche\(2025\)A low\-cost singular value decomposition\-based data assimilation technique for analysis of heterogeneous combustion data\.Physics of Fluids37\(8\)\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p7.1)\.
- \[39\]P\. J\. Schmid\(2010\)Dynamic mode decomposition of numerical and experimental data\.Journal of fluid mechanics656,pp\. 5–28\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p2.1)\.
- \[40\]Y\. Tominaga and T\. Stathopoulos\(2016\)Ten questions concerning modeling of near\-field pollutant dispersion in the built environment\.Building and Environment105,pp\. 390–402\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p1.2)\.
- \[41\]L\. Vervecken, J\. Camps, and J\. Meyers\(2015\)Stable reduced\-order models for pollutant dispersion in the built environment\.Building and Environment92,pp\. 360–367\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p3.1)\.
- \[42\]L\. Vervisch and T\. Poinsot\(1998\)Direct numerical simulation of non\-premixed turbulent flames\.Annual review of fluid mechanics30\(1\),pp\. 655–691\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p1.2)\.
- \[43\]P\. Virtanen, R\. Gommers, T\. E\. Oliphant, M\. Haberland, T\. Reddy, D\. Cournapeau, E\. Burovski, P\. Peterson, W\. Weckesser, J\. Bright,et al\.\(2020\)SciPy 1\.0: fundamental algorithms for scientific computing in python\.Nature methods17\(3\),pp\. 261–272\.Cited by:[§3\.1](https://arxiv.org/html/2606.24989#S3.SS1.p8.3)\.
- \[44\]A\. Vishwasrao, S\. B\. C\. Gutha, A\. Cremades, K\. Wijk, A\. Patil, C\. Gorle, B\. J\. McKeon, H\. Azizpour, and R\. Vinuesa\(2025\)Diff\-sport: diffusion\-based sensor placement optimization and reconstruction of turbulent flows in urban environments\.arXiv preprint arXiv:2506\.00214\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p6.1)\.
- \[45\]H\. G\. Weller, G\. Tabor, H\. Jasak, and C\. Fureby\(1998\)A tensorial approach to computational continuum mechanics using object\-oriented techniques\.Computers in physics12\(6\),pp\. 620–631\.Cited by:[§3\.1](https://arxiv.org/html/2606.24989#S3.SS1.p3.1)\.
- \[46\]P\. Wu, Z\. Qin, and Y\. Yang\(2026\)LLM\-rom: a novel framework for efficient spatiotemporal prediction of urban pollutant dispersion\.AI7\(3\),pp\. 104\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p3.1)\.
- \[47\]S\. Xiang, X\. Fu, J\. Zhou, Y\. Wang, Y\. Zhang, X\. Hu, J\. Xu, H\. Liu, J\. Liu, J\. Ma,et al\.\(2021\)Non\-intrusive reduced order model of urban airflow with dynamic boundary conditions\.Building and Environment187,pp\. 107397\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p2.1),[§1](https://arxiv.org/html/2606.24989#S1.p3.1)\.
- \[48\]D\. Xiao, C\. Heaney, L\. Mottet, F\. Fang, W\. Lin, I\. Navon, Y\. Guo, O\. Matar, A\. Robins, and C\. Pain\(2019\)A reduced order model for turbulent flows in the urban environment using machine learning\.Building and Environment148,pp\. 323–337\.Cited by:[§1](https://arxiv.org/html/2606.24989#S1.p2.1),[§1](https://arxiv.org/html/2606.24989#S1.p3.1)\.
Low-Cost High-Order Singular Value Decomposition for Tensor-Based Reconstruction from Sparse Sensor Measurements: Urban Flow and Air-Quality Applications

Similar Articles

Geodesic Flow Matching for Denoising High-Dimensional Structured Representations

Two-Valued Symmetric Circulant Matrices: Applications in Deep Learning

Robust Subspace-Constrained Quadratic Models for Low-Dimensional Structure Learning

Lift4D: Harmonizing Single-View 3D Estimation for 4D Reconstruction In-the-Wild

Amortized Probabilistic Retrieval of Atmospheric CO2 from OCO-2 Spectra Using Deep Learning with Laplace Approximations and Normalizing Flows

Submit Feedback

Similar Articles

Geodesic Flow Matching for Denoising High-Dimensional Structured Representations
Two-Valued Symmetric Circulant Matrices: Applications in Deep Learning
Robust Subspace-Constrained Quadratic Models for Low-Dimensional Structure Learning
Lift4D: Harmonizing Single-View 3D Estimation for 4D Reconstruction In-the-Wild
Amortized Probabilistic Retrieval of Atmospheric CO2 from OCO-2 Spectra Using Deep Learning with Laplace Approximations and Normalizing Flows