Reconstructing GRACE Terrestrial Water Storage with Spatio-Temporal Graph Neural Networks: An Application to South America

arXiv cs.LG 06/24/26, 04:00 AM Papers
Summary
This paper presents a deep learning approach using a spatio-temporal graph neural network (MTGNN) to reconstruct GRACE terrestrial water storage anomalies back to 1940 for South America, achieving high accuracy and outperforming previous methods with fewer predictors.
arXiv:2606.23833v1 Announce Type: new Abstract: Terrestrial water storage (TWS) integrates snow, soil moisture, surface water, and groundwater and is a key indicator of how climate variability and human activity reshape the global water cycle. The GRACE and GRACE-FO satellite missions provide the only direct, globally consistent observations of TWS change, but their record only begins in 2002 which is too short for many climate-scale analyses. We present a deep learning application that reconstructs monthly GRACE-like TWS anomalies (TWSA) back to 1940 by learning the relationship between daily ERA5 meteorological forcing (precipitation, evapotranspiration, runoff) and monthly GRACE observations. In contrast to prior reconstruction approaches based on grid-cell-wise regression, CNNs, or LSTMs, we adapt a multi-variate time series graph neural network (MTGNN) architecture, which was originally developed for mobility and traffic forecasting on urban sensor networks to this satellite-geodesy task. Spatial dependencies are encoded in a static, interpretable hybrid adjacency matrix that combines geodesic proximity with lagged correlations of climatic time series, capturing both local hydrological coupling and large-scale teleconnections. The reconstruction achieves a grid-cell Pearson correlation of 0.69, a basin-mean correlation of 0.94, and a near-zero bias, and it reproduces the spatial fingerprints of the 2015/16 El Ni\~no and 2020/21 La Ni\~na events. A systematic comparison with established reconstruction approaches (GTWS-MLrec, RM-REC, GRAiCE) shows that the graph-based model is statistically competitive at basin scale, reaching a correlation within 0.025 of the best baseline while using only roughly half to a tenth of the predictors the other models require and revealing characteristic weaknesses in arid regions in all models. The complete implementation is publicly available at github.com/hcu-cml/MTGNN-TWS-Reconstruction-GRACE
Original Article
View Cached Full Text
Cached at: 06/24/26, 07:48 AM
# Reconstructing GRACE Terrestrial Water Storage with Spatio-Temporal Graph Neural Networks: An Application to South America [Applications]
Source: [https://arxiv.org/html/2606.23833](https://arxiv.org/html/2606.23833)
\(2026\)

###### Abstract\.

Terrestrial water storage \(TWS\) integrates snow, soil moisture, surface water, and groundwater and is a key indicator of how climate variability and human activity reshape the global water cycle\. The GRACE and GRACE\-FO satellite missions provide the only direct, globally consistent observations of TWS change, but their record only begins in 2002 which is too short for many climate\-scale analyses\. We present a deep learning application that reconstructs monthly GRACE\-like TWS anomalies \(TWSA\) back to 1940 by learning the relationship between daily ERA5 meteorological forcing \(precipitation, evapotranspiration, runoff\) and monthly GRACE observations\. In contrast to prior reconstruction approaches based on grid\-cell\-wise regression, CNNs, or LSTMs, we adapt a multivariate time series graph neural network \(MTGNN\) architecture, which was originally developed for mobility and traffic forecasting on urban sensor networks to this satellite\-geodesy task\. Spatial dependencies are encoded in a static, interpretable hybrid adjacency matrix that combines geodesic proximity with lagged correlations of climatic time series, capturing both local hydrological coupling and large\-scale teleconnections\. Evaluated over South America against GRACE/GRACE\-FO \(2002–2023\), the reconstruction achieves a grid\-cell Pearson correlation of 0\.69, a basin\-mean correlation of 0\.94, and a near\-zero bias, and it reproduces the spatial fingerprints of the 2015/16 El Niño and 2020/21 La Niña events\. A systematic comparison with established reconstruction approaches \(GTWS\-MLrec, RM\-REC, GRAiCE\) shows that the graph\-based model is statistically competitive at basin scale, reaching a correlation within 0\.025 of the best baseline while using only roughly half to a tenth of the predictors the other models require and revealing characteristic weaknesses in arid regions in all models\. We discuss best practices and lessons learned from deploying graph deep learning in a satellite\-geodesy application, and outline extensions via additional predictors and physics\-informed constraints based on the terrestrial water balance equation\. To support reproducibility and future research, the complete implementation is publicly available at[https://github\.com/hcu\-cml/MTGNN\-TWS\-Reconstruction\-GRACE](https://github.com/hcu-cml/MTGNN-TWS-Reconstruction-GRACE)\.

Terrestrial water storage, GRACE, Graph neural networks, Spatio\-temporal modeling, ERA5, Climate reconstruction, GeoAI

††conference:The 34th ACM International Conference on Advances in Geographic Information Systems; November 3–6, 2026; Riverside, CA, USA††journalyear:2026††copyright:acmlicensed††doi:XXXXXXX\.XXXXXXX††conference:Make sure to enter the correct conference title from your rights confirmation email; June 03–05, 2018; Woodstock, NY††isbn:978\-1\-4503\-XXXX\-X/2018/06††ccs:Applied computing Earth and atmospheric sciences††ccs:Computing methodologies Supervised learning by regression††ccs:Computing methodologies Neural networks## 1\.Introduction

Understanding changes in the Earth’s water storage is crucial for quantifying the impacts of climate variability and anthropogenic activity on the global water cycle\. Terrestrial water storage \(TWS\) comprises the water stored above and below the land surface in the form of snow, ice, soil moisture, surface water, and groundwater, and acts as a key integrator of hydrological processes\. TWS is highly sensitive to climate change: it registers shifts in precipitation, temperature, and extreme weather, and its variations relate directly to the occurrence of droughts, floods, and changes in seasonal water availability\. Accurate TWS data are therefore essential for separating natural variability from human\-induced trends and for managing freshwater resources under accelerating global change\(Yinet al\.,[2023](https://arxiv.org/html/2606.23833#bib.bib47); Humphrey and Gudmundsson,[2019](https://arxiv.org/html/2606.23833#bib.bib18)\)\.

Since 2002, the Gravity Recovery and Climate Experiment\(Tapleyet al\.,[2004](https://arxiv.org/html/2606.23833#bib.bib12)\)and its successor GRACE\-FO\(Landereret al\.,[2020](https://arxiv.org/html/2606.23833#bib.bib13)\)have provided the only direct, globally consistent measurements of TWS change\. By tracking variations in the distance between two co\-orbiting satellites, the missions resolve monthly changes in the Earth’s gravity field, from which mass redistribution can be inferred, including continental water movement, ice mass change, and groundwater depletion\(Jensenet al\.,[2020](https://arxiv.org/html/2606.23833#bib.bib20); Tapleyet al\.,[2019](https://arxiv.org/html/2606.23833#bib.bib40)\)\. Unlike sparse ground\-based monitoring networks, GRACE captures subsurface and large\-scale hydrological processes with homogeneous accuracy, making it a benchmark dataset of Earth system science, particularly in regions where in\-situ networks are weak or absent\(Jensenet al\.,[2020](https://arxiv.org/html/2606.23833#bib.bib20)\)\.

The observational record, however, spans barely more than two decades\. This is a severe limitation for climate science, where multi\-decadal time series are required to identify trends, detect tipping points, and attribute variability to large\-scale drivers such as the El Niño–Southern Oscillation \(ENSO\)\(Humphrey and Gudmundsson,[2019](https://arxiv.org/html/2606.23833#bib.bib18)\)\. To cover the pre\-GRACE era, a growing body of work therefore reconstructs GRACE\-like signals from meteorological data using statistical and machine learning methods \(Section[3](https://arxiv.org/html/2606.23833#S3)\)\. Yet most existing approaches, which are grid\-cell\-wise regressions, CNNs, and LSTMs, treat grid cells as spatially independent or only implicitly model spatial context\. They thereby ignore the inherently networked character of the water cycle, in which hydrological connectivity and atmospheric teleconnections couple distant regions\.

This paper reports on the application of spatio\-temporal*graph*deep learning to this reconstruction problem\. Notably, the architecture we deploy was not designed for the Earth sciences at all: the multivariate time series graph neural network \(MTGNN\) ofWuet al\.\([2020](https://arxiv.org/html/2606.23833#bib.bib45)\)originates in the mobility domain, where it was developed for forecasting traffic conditions on urban road\-sensor networks\. The structural analogy that motivates the transfer is straightforward\. In traffic forecasting, sensors form the nodes of a graph, congestion propagates along the road network, and the model must capture how a disturbance at one location influences readings elsewhere with a delay\. In continental hydrology, grid cells form the nodes, water and atmospheric moisture propagate along river systems and circulation patterns, and a precipitation anomaly in one region influences storage elsewhere with a lag of days to months\. Both are multivariate time series regression problems on a fixed set of spatially embedded nodes whose mutual influence is structured, directional in time, and only partially explained by geometric proximity\. We adapt MTGNN to the gravimetric setting where1∘1^\{\\circ\}grid cells become graph nodes carrying daily ERA5 forcing variables, and a static, interpretable adjacency matrix, constructed from geodesic distance and lagged climatic correlations, replaces both the road network and MTGNN’s learned graph, encoding local hydrological coupling as well as long\-range teleconnections\. Trained on the GRACE/GRACE\-FO period, the model reconstructs monthly TWS anomalies \(TWSA\) back to 1940, the start of the ERA5 reanalysis\. The implementation presented here focuses on South America \(1,120 land nodes\) as a test region within a globally designed workflow\.

Our main contributions are practical and methodological:

- •We demonstrate that a graph architecture from urban mobility forecasting transfers to gravimetric satellite data with adaptation, and we document the design decisions, most notably replacing the learned graph structure with a static, domain\-informed hybrid adjacency matrix, that made the application robust and interpretable\.
- •We provide a multi\-stage evaluation against GRACE/GRACE\-FO \(2002–2023\), covering quantitative metrics, seasonal and spatial error structure, interannual residuals, and the spatial fingerprints of the 2015/16 El Niño and 2020/21 La Niña events\.
- •We benchmark the reconstruction against three existing approaches spanning the methodological spectrum \(GTWS\-MLrec\(Yinet al\.,[2023](https://arxiv.org/html/2606.23833#bib.bib47)\), RM\-REC\(Liet al\.,[2021](https://arxiv.org/html/2606.23833#bib.bib26)\), GRAiCE\(Palazzoliet al\.,[2025](https://arxiv.org/html/2606.23833#bib.bib31)\)\) and show that a graph model is statistically competitive at basin scale \(correlation 0\.94 with GRACE\)*using only three input variables*, where the better\-scoring baselines consume from roughly six to over twenty predictors, while sharing the community\-wide weakness in arid, human\-influenced regions\.
- •We report lessons learned and two concrete extension paths\. Input parameter expansion \(a temperature pilot already raises grid\-cell correlation from 0\.69 to 0\.71\) and physics\-informed training via the terrestrial water balance equation\.

## 2\.Background and Problem Definition

This section provides the domain context needed to follow the application without a background in geodesy or hydrology, and then states the learning problem formally\.

### 2\.1\.Why Satellite Gravimetry with GRACE?

GRACE, launched in 2002 as a collaboration between the National Aeronautics and Space Administration \(NASA\) and Deutsches Zentrum für Luft\- und Raumfahrt \(DLR\) \(eng\.*German Aerospace Center*\), consists of two satellites flying on the same orbit, one trailing the other\. When the leading satellite passes over a region of slightly higher mass, such as a mountain range or an aquifer after a wet season, it is accelerated marginally earlier than its follower, and the inter\-satellite distance changes\. A K\-band microwave ranging system measures these distance variations continuously\. In addition to that GRACE\-FO \(launched 2018 after GRACE ended in 2017\) adds a laser ranging interferometer of even higher precision\(Tapleyet al\.,[2019](https://arxiv.org/html/2606.23833#bib.bib40); Chenet al\.,[2022](https://arxiv.org/html/2606.23833#bib.bib3)\)\. Over the course of a month the satellite pair samples the entire globe, and the accumulated ranging data are inverted into a monthly model of the Earth’s gravity field\.

These monthly gravity fields are expressed as*spherical harmonic coefficients*—a frequency\-domain representation of a function on the sphere, analogous to a 2D Fourier transform, where low degrees encode planetary\-scale structures and higher degrees encode finer spatial details\. The solutions used here resolve up to degree 96 and after the necessary spatial filtering have an effective spatial resolution of roughly 300 km\)\(Kurtenbachet al\.,[2012](https://arxiv.org/html/2606.23833#bib.bib25); Chenet al\.,[2022](https://arxiv.org/html/2606.23833#bib.bib3)\)\. Subtracting a static mean field isolates the*time\-variable*component, which over land is dominated by water mass redistribution\(Sunet al\.,[2017](https://arxiv.org/html/2606.23833#bib.bib39)\)\. After standard corrections \(Section[4\.1](https://arxiv.org/html/2606.23833#S4.SS1)\), the coefficients are transformed onto a geographic grid and expressed as*equivalent water height*\(EWH\), representing the thickness of a hypothetical water layer that would produce the observed gravity change\. A value of−0\.2\-0\.2m EWH at a grid cell thus means the cell has lost mass equivalent to 20 cm of water spread over its area, regardless of whether that loss occurred in soil moisture, surface water, or groundwater\. Values are reported as*anomalies*\(TWSA\) relative to a 2004–2009 baseline mean, so the quantity of interest is the deviation from a reference state, not absolute storage\. This integrated view is what makes GRACE unique, as groundwater depletion at depths of hundreds of meters is invisible to optical and radar satellites but detectable by gravimetry\.

### 2\.2\.Reanalysis Data

The model input comes from ERA5\(Hersbachet al\.,[2020](https://arxiv.org/html/2606.23833#bib.bib16)\), an atmospheric*reanalysis*\. A reanalysis is best understood as a globally consistent, physics\-constrained interpolation of the historical observation record: a numerical weather model is run over the past, and at every assimilation step the model state is optimally corrected toward all available observations \(satellites, weather stations, radiosondes, ships, aircraft\)\(Hersbachet al\.,[2023](https://arxiv.org/html/2606.23833#bib.bib5)\)\. The result is a gap\-free, gridded estimate of atmospheric and land\-surface variables that is dynamically consistent across space, time, and variables, unlike raw station data, which are sparse and heterogeneous\. Crucial for this application, ERA5 extends back to 1940, more than six decades before the first GRACE observation, with global hourly coverage\. Its quality is not uniform over time \(fewer observations constrain the early decades, and the satellite era begins around 1979\), a caveat that applies to every reconstruction built on it\.

### 2\.3\.Problem Definition

LetV=\{v1,…,vN\}V=\\\{v\_\{1\},\\dots,v\_\{N\}\\\}be the set ofN=1,120N=1\{,\}120land grid cells of the1∘1^\{\\circ\}South American domain, embedded in a weighted undirected graphG=\(V,E,A\)G=\(V,E,A\)with adjacency matrixA∈ℝN×NA\\in\\mathbb\{R\}^\{N\\times N\}\(Section[5\.4](https://arxiv.org/html/2606.23833#S5.SS4)\)\. For each calendar monthmm, the input is a window ofT=30T=30daily data ofD=3D=3ERA5 flux variables \(precipitation, evapotranspiration, runoff\) at every node,Xm∈ℝT×N×DX\_\{m\}\\in\\mathbb\{R\}^\{T\\times N\\times D\}, and the target is the GRACE TWSA of that month at every node,ym∈ℝNy\_\{m\}\\in\\mathbb\{R\}^\{N\}\. The task is to learn a function

\(1\)fθ:\(Xm,G\)⟼y^m,f\_\{\\theta\}:\\bigl\(X\_\{m\},\\,G\\bigr\)\\longmapsto\\hat\{y\}\_\{m\},i\.e\., a node\-level regression conditioned on the graph structure, by supervised training on the months for which GRACE observations exist \(April 2002 – December 2023\)\. Reconstruction then consists of applying the trainedfθf\_\{\\theta\}to the ERA5 record outside the supervision period \(1940–2002\), where no gravimetric ground truth exists\. Two properties make this a spatial\-computing problem rather than a generic regression: the physical processes linking input and target are spatially networked \(water moves between cells; atmospheric teleconnections couple distant regions\), and the supervision signal lives at a coarser*temporal*resolution \(monthly\) than the input \(daily\), so the model must learn both spatial aggregation over the graph and temporal aggregation over the window\. The implicit assumption underlying any such reconstruction, shared by all related work in Section[3](https://arxiv.org/html/2606.23833#S3), is that the mapping from meteorological forcing to storage response learned in the satellite era is stationary enough to be applied to earlier decades\.

## 3\.Related Work

Approaches for extending TWSA beyond the GRACE/GRACE\-FO period fall into three categories: statistical methods, physically based hydrological models, and, increasingly, machine learning\. We summarize the data\-driven reconstructions that serve as methodological context and, later, as comparison baselines\.

Humphrey and Gudmundsson \([2019](https://arxiv.org/html/2606.23833#bib.bib18)\)present GRACE\-REC, a statistical reconstruction of climate\-driven TWS from 1901 to 2019\. A linear reservoir model with seasonally varying, temperature\-parameterized residence time is updated daily from precipitation and temperature, calibrated per GRACE mascon on de\-trended and de\-seasonalized GRACE data\. An ensemble of 100 simulations quantifies parameter and residual uncertainty\. Despite its simplicity, GRACE\-REC matches or outperforms complex hydrological models in reproducing interannual variability\.

Liet al\.\([2021](https://arxiv.org/html/2606.23833#bib.bib26)\)develop a global reconstruction \(1979–2020,0\.5∘0\.5^\{\\circ\}; referred to here as RM\-REC\) combining machine learning \(ANN, ARX, and MLR\) with statistical mode decomposition and time series decomposition\. Rather than fitting each grid cell independently, it reconstructs a few leading spatial modes of the GRACE field over regions of varying size \(continents, multi\-basins, and basins\), which allows it to assimilate predictors located outside the study area, such as sea surface temperature and climate indices, alongside precipitation, temperature, soil moisture, runoff, and evaporation\. Trained on 2002–2017 GRACE mascons and validated against GRACE\-FO, satellite laser ranging, and global mean sea level, it reproduces strong El Niño signals well\.

Yinet al\.\([2023](https://arxiv.org/html/2606.23833#bib.bib47)\)present GTWS\-MLrec, a global reconstruction \(1940–2022\) from an ensemble of five machine learning and statistical models trained pixel by pixel with locally selected predictors from meteorological, hydrological, land use, and vegetation categories\. Per\-cell model selection over eight input schemes yields several consistent global products tied to different mascon solutions\.

Palazzoliet al\.\([2025](https://arxiv.org/html/2606.23833#bib.bib31)\)introduce GRAiCE \(1984–2021,0\.5∘0\.5^\{\\circ\}\), training LSTM and BiLSTM models per grid cell with lags of up to 24 months and Optuna\-based hyperparameter search\. The five meteorological predictors are total precipitation, snow depth water equivalent, surface net solar radiation, surface air temperature, and relative humidity, with solar\-induced fluorescence added as an optional sixth predictor\. The reconstructions achieve global Pearson correlations above 0\.9 against GRACE/GRACE\-FO and capture ENSO extremes well, but accuracy declines in arid, human\-influenced regions\.

Gentneret al\.\([2026](https://arxiv.org/html/2606.23833#bib.bib11)\)propose DeepRec \(1941–2023\), a CNN encoder over17\.5∘17\.5^\{\\circ\}patches followed by an LSTM, fed by 16 ERA5 variables together with sea surface temperature, the ENSO index, ISIMIP land\-use and lake fractions, and engineered geographic/temporal features, and targeting the*full*TWS signal including human\-influenced trends, with validation against global mean sea level and satellite laser ranging\.

Table[1](https://arxiv.org/html/2606.23833#S3.T1)contrasts these approaches\. Common to all is that spatial structure is treated implicitly \(per\-cell models\) or through regular convolution on grids\. Graph neural networks, in contrast, operate directly on irregular neighborhood structures and have proven effective in climate and geospatial flow applications, e\.g\., ENSO forecasting with graph convolutions\(Cachayet al\.,[2020](https://arxiv.org/html/2606.23833#bib.bib34)\), spatially explicit GeoAI on networks\(Sunet al\.,[2021](https://arxiv.org/html/2606.23833#bib.bib38); Zhouet al\.,[2020](https://arxiv.org/html/2606.23833#bib.bib48)\), place characterization\(Wardet al\.,[2022](https://arxiv.org/html/2606.23833#bib.bib41)\), or semantic segmentation of historical urban plans\(Arzoumanidiset al\.,[2026](https://arxiv.org/html/2606.23833#bib.bib44)\)\. Within SIGSPATIAL, GNNs have recently been applied to inter\-county food flow prediction\(Zhanget al\.,[2025](https://arxiv.org/html/2606.23833#bib.bib24)\)and related spatial\-network problems, underscoring their ability to generalize across spatial scales from topological regularities\. To our knowledge, this work is the first to apply spatio\-temporal graph deep learning to TWS reconstruction, explicitly encoding hydroclimatic teleconnections in the graph topology\.

Table 1\.Published GRACE reconstruction approaches used for context and comparison\. Spatial resolution is given in degree\-sized patches\.
## 4\.Data

The application links two physically related datasets through the terrestrial water balance: GRACE/GRACE\-FO observations of integrated storage change serve as the prediction target, and ERA5 reanalysis fluxes serve as model input\.

### 4\.1\.GRACE/GRACE\-FO Terrestrial Water Storage

We use monthly gravity field solutions from the ITSG\-Grace2018 series \(GRACE\) and ITSG\-Grace\_operational \(GRACE\-FO\) provided by Graz University of Technology\(Mayer\-Gürret al\.,[2018](https://arxiv.org/html/2606.23833#bib.bib30)\), processed with the GROOPS software\(Mayer\-Gürret al\.,[2021](https://arxiv.org/html/2606.23833#bib.bib29)\)\. Standard corrections are applied: degree\-1 \(geocenter\) coefficients are restored followingChenget al\.\([2013](https://arxiv.org/html/2606.23833#bib.bib4)\); Sunet al\.\([2017](https://arxiv.org/html/2606.23833#bib.bib39)\); theC20C\_\{20\}coefficient is replaced by satellite laser ranging estimates\(Chenet al\.,[2022](https://arxiv.org/html/2606.23833#bib.bib3)\); glacial isostatic adjustment is removed with a GIA model; and a DDK3 decorrelation filter suppresses striping errors and high\-frequency noise\(Qianet al\.,[2022](https://arxiv.org/html/2606.23833#bib.bib32)\)\. The corrected spherical harmonic coefficients \(up to degree 96\) are converted to equivalent water heights \(EWH\)\(Guoet al\.,[2014](https://arxiv.org/html/2606.23833#bib.bib14)\)and expressed as anomalies relative to the 2004–2009 GRACE baseline\. Gaps in the monthly record, including the 2017/18 inter\-mission gap, are interpolated, and a leakage mask removes coastal cells affected by ocean signal leakage\(Eickeret al\.,[2020](https://arxiv.org/html/2606.23833#bib.bib7)\)\. The resulting target dataset covers April 2002 to December 2023 on a global geographical1∘×1∘1^\{\\circ\}\\times 1^\{\\circ\}grid with the South American subset, used in this work, containing 1,120 grid cells\.

### 4\.2\.ERA5 Meteorological Forcing

ERA5, the fifth\-generation atmospheric reanalysis of the European Centre for Medium\-Range Weather Forecasts \(ECMWF\)111[https://www\.ecmwf\.int/](https://www.ecmwf.int/), assimilates decades of observations into a globally consistent estimate of the atmosphere from 1940 to the present\(Hersbachet al\.,[2020](https://arxiv.org/html/2606.23833#bib.bib16),[2023](https://arxiv.org/html/2606.23833#bib.bib5)\)\. We use three single\-level variables that constitute the terrestrial water balance\(Hersbachet al\.,[2023](https://arxiv.org/html/2606.23833#bib.bib5)\), namely, total precipitation \(PP\), surface latent heat flux converted to evapotranspiration \(EE\), and total runoff \(RR\)\. Hourly fields are aggregated to daily values, re\-gridded to the same1∘1^\{\\circ\}grid, expressed in EWH via a spherical harmonic expansion to degree 96, and provided as NetCDF ensuring full spatial consistency with the GRACE prediction target\.

### 4\.3\.Physical Link: Water Balance Equation

The two datasets are physically coupled through the terrestrial water balance\(Lorenzet al\.,[2014](https://arxiv.org/html/2606.23833#bib.bib27); Eickeret al\.,[2020](https://arxiv.org/html/2606.23833#bib.bib7)\):

\(2\)P−E−R=dSdt,P\-E\-R=\\frac\{dS\}\{dt\},where the storage changedS/dtdS/dtis observed by GRACE and the flux terms are represented by ERA5\.

## 5\.Methodology

### 5\.1\.Spatio\-Temporal Graph\-Based Architecture

Conventional CNNs assume a regular grid with fixed local neighborhoods\. This assumption is problematic for global climate data because important relationships are not always local\. Regions separated by large distances may still exhibit strong interactions through river systems, ocean currents, and large\-scale atmospheric circulation patterns\(Wuet al\.,[2021](https://arxiv.org/html/2606.23833#bib.bib42); Cachayet al\.,[2020](https://arxiv.org/html/2606.23833#bib.bib34)\)\. Graph convolution generalizes the convolution operation to variable, unstructured neighborhoods and has delivered strong results in climate applications such as ENSO forecasting\(Cachayet al\.,[2020](https://arxiv.org/html/2606.23833#bib.bib34)\)\. Spatio\-temporal GNNs \(STGNNs\) combine per\-time\-step graph convolution with temporal convolution or recurrence over node sequences, processing input tensorsX∈ℝT×N×DX\\in\\mathbb\{R\}^\{T\\times N\\times D\}\(TTtime steps,NNnodes,DDfeatures\)\(Wuet al\.,[2021](https://arxiv.org/html/2606.23833#bib.bib42)\)\.

### 5\.2\.MTGNN

The architecture we deploy, MTGNN\(Wuet al\.,[2020](https://arxiv.org/html/2606.23833#bib.bib45)\), was developed and demonstrated in the mobility and traffic forecasting domain: predicting future readings of road\-sensor networks from their multivariate history, where each sensor is a graph node and the \(partly latent\) road topology governs how congestion propagates\. We selected it for this application not despite but*because*of that origin\. The traffic problem and the hydrological problem share their abstract structure which can be defined as a fixed set of spatially embedded nodes, multivariate time series per node, propagation of influence along a network with characteristic delays, and the need to capture periodicities at multiple time scales \(rush hours and weekly cycles in traffic; rainy seasons and annual cycles in hydrology\)\. MTGNN’s dilated temporal convolutions, designed for exactly such multi\-scale periodicity, and its mix\-hop graph convolutions, designed for delayed multi\-node propagation, therefore map naturally onto hydroclimatic dynamics\. Table[2](https://arxiv.org/html/2606.23833#S5.T2)summarizes the correspondence and the adaptations that were required\.

Table 2\.Transfer mapping from the original mobility setting of MTGNN\(Wuet al\.,[2020](https://arxiv.org/html/2606.23833#bib.bib45)\)to the gravimetric application\.Three adaptations deserve emphasis\. First, the prediction task changes from*forecasting*\(predicting future values of the input quantity itself\) to*cross\-variable regression with temporal aggregation*\. The model maps a 30\-day window of daily fluxes to a single monthly value of a different physical quantity, observed by a different instrument\. Second, the graph is prescribed rather than learned, for reasons detailed in Section[5\.4](https://arxiv.org/html/2606.23833#S5.SS4)\. Third, the supervision regime is far more constrained: instead of years of dense sensor data, only 183 monthly target fields exist for training, which pushed the design toward strong regularization \(dropout, weight decay\) and a deliberately moderate model capacity\.

### 5\.3\.MTGNN Adapted to Hydrology

We retain the three functional components of MTGNN\(Wuet al\.,[2020](https://arxiv.org/html/2606.23833#bib.bib45)\): a graph \(structure\) layer, graph convolution modules, and temporal convolution modules\. Graph convolution uses two mix\-hop propagation layers that aggregate information over multiple hop distances while a retention factor counteracts over\-smoothing\(Wuet al\.,[2020](https://arxiv.org/html/2606.23833#bib.bib45); Zhouet al\.,[2020](https://arxiv.org/html/2606.23833#bib.bib48)\)\. Temporal convolution uses dilated inception layers with kernel sizes1×21\\times 2,1×31\\times 3,1×61\\times 6, and1×71\\times 7and a tanh/sigmoid as activation functions for the gating mechanism, yielding exponentially growing receptive fields that capture delayed hydrological responses\. Residual and skip connections stabilize training and preserve node\-level history\(Wuet al\.,[2020](https://arxiv.org/html/2606.23833#bib.bib45)\)\. Figure[1](https://arxiv.org/html/2606.23833#S5.F1)shows the adapted architecture: nodes are1∘1^\{\\circ\}ERA5 grid cells, node features are dailyPP,EE,RRsequences, and the output is one TWSA value per node and month\.

![Refer to caption](https://arxiv.org/html/2606.23833v1/figures/fig07_mtgnn-model-architecture.png)Figure 1\.Model architecture of the adapted MTGNN \(modified fromWuet al\.\([2020](https://arxiv.org/html/2606.23833#bib.bib45)\)\)\.Enjoying the baseball game from the third\-base seats\. Ichiro Suzuki preparing to bat\.
### 5\.4\.A Static, Interpretable Hybrid Graph

The original MTGNN*learns*its adjacency matrix from randomly initialized node embeddings with subgraph sampling\(Wuet al\.,[2020](https://arxiv.org/html/2606.23833#bib.bib45)\)\. For this application we deliberately replaced the learned structure with a static graph, for reasons that we consider an important practical lesson\. First, learned graphs are hard to interpret, whereas transparency about spatial dependencies is essential in a geoscientific setting\. Second, jointly optimizing a latent graph and a deep predictive model proved prone to convergence instability on heterogeneous meteorological inputs, while random subgraph sampling can sever physically meaningful dependencies\(Huanget al\.,[2025](https://arxiv.org/html/2606.23833#bib.bib17); Kirschstein and Sun,[2024](https://arxiv.org/html/2606.23833#bib.bib22)\)\.

The static graph is undirected and homogeneous, and its weights combine two similarity terms\.*Climate similarity*captures teleconnections via the maximum\-lag Pearson correlation between node time series\(Silvaet al\.,[2021](https://arxiv.org/html/2606.23833#bib.bib35); Berman,[2016](https://arxiv.org/html/2606.23833#bib.bib2)\)\. Here,ρij\(f\)\\rho^\{\(f\)\}\_\{ij\}is the Pearson correlation coefficient, which measures linear dependence on a scale from−1\-1to11, between the time series of variableffat nodesiiandjj\. It is evaluated at the lagτ∈\[0,τmax\]\\tau\\in\[0,\\tau\_\{\\max\}\]that gives the largest absolute correlation\. Using this lagged cross\-correlation rather than the zero\-lag value captures relationships in which one region’s signal leads or trails another’s, as is typical of propagating or transport\-driven teleconnection patterns\. The per\-variable correlations are aggregated into a single similarity,

\(3\)sijclimate=∑fwf⋅\|ρij\(f\)\|\+12,s^\{\\mathrm\{climate\}\}\_\{ij\}=\\sum\_\{f\}w\_\{f\}\\cdot\\frac\{\|\\rho^\{\(f\)\}\_\{ij\}\|\+1\}\{2\},where the absolute value lets strong anti\-correlations contribute as much as strong positive ones, andwfw\_\{f\}weights each variable\.*Spatial proximity*converts the geodesic distancedijd\_\{ij\}into a similarity with a distance\-decay Gaussian kernel\(Zhu and Ghahramani,[2002](https://arxiv.org/html/2606.23833#bib.bib49)\):

\(4\)sijdistance=exp⁡\(−dijσ\),s^\{\\mathrm\{distance\}\}\_\{ij\}=\\exp\\\!\\left\(\-\\frac\{d\_\{ij\}\}\{\\sigma\}\\right\),where the length scaleσ\\sigma\(in the units ofdijd\_\{ij\}\) controls the rate of distance decay\. Atdij=σd\_\{ij\}=\\sigma, the similarity falls to1/e≈0\.371/e\\approx 0\.37of its maximum value\. Smallσ\\sigmaemphasizes local connections, while largeσ\\sigmapermits stronger links between distant cells\. The trade\-off adjacency weight is their convex combination,

\(5\)Aij=α⋅sijclimate\+\(1−α\)⋅sijdistance,A\_\{ij\}=\\alpha\\cdot s^\{\\mathrm\{climate\}\}\_\{ij\}\+\(1\-\\alpha\)\\cdot s^\{\\mathrm\{distance\}\}\_\{ij\},withα=0\.6\\alpha=0\.6andσ=3500km\\sigma=3500\\,\\mathrm\{km\}selected for the South American domain\. Figure[2](https://arxiv.org/html/2606.23833#S5.F2)shows the resulting edge weights for a single node\. In addition to nearby locations, distant regions with similar climate conditions can also receive substantial weight\. These long\-range connections capture teleconnection patterns that grid\-based models cannot represent\.

![Refer to caption](https://arxiv.org/html/2606.23833v1/figures/fig06_global-adjacency-weights-south-america.png)Figure 2\.Adjacency weights of a single node in South America under the hybrid graph construction \(Eq\.[5](https://arxiv.org/html/2606.23833#S5.E5)\)\.Enjoying the baseball game from the third\-base seats\. Ichiro Suzuki preparing to bat\.
### 5\.5\.Training Setup

Each training example pairs a 30\-day window of daily ERA5 inputs with the GRACE TWSA of the corresponding month, i\.e\.,x∈ℝ30×1120×3→y∈ℝ1×1120×1x\\in\\mathbb\{R\}^\{30\\times 1120\\times 3\}\\rightarrow y\\in\\mathbb\{R\}^\{1\\times 1120\\times 1\}\. Feature\-wise Z\-score normalization is fitted on the training partition only and applied consistently to validation and test data\(Singh and Singh,[2022](https://arxiv.org/html/2606.23833#bib.bib36)\)\. Because random splits leak information in time series, the data are split sequentially\(Reitermanová,[2010](https://arxiv.org/html/2606.23833#bib.bib33)\)\. Training covers April 2002 to June 2017 \(183 months, containing the 2004–2009 GRACE baseline and excluding the interpolated inter\-mission gap\), followed by 34 validation and 33 test months as can be seen in Figure[3](https://arxiv.org/html/2606.23833#S5.F3)\.

![Refer to caption](https://arxiv.org/html/2606.23833v1/figures/fig08_dataset-splitting-train-val-test.png)Figure 3\.Sequential train/validation/test split along the GRACE record\.Enjoying the baseball game from the third\-base seats\. Ichiro Suzuki preparing to bat\.The model is trained for 40 epochs with the Adam optimizer\(Kingma and Ba,[2015](https://arxiv.org/html/2606.23833#bib.bib21)\)\(learning rate10−310^\{\-3\}, batch size 64, dropout 0\.4, weight decay 0\.005\) and an MSE loss, which penalizes the large deviations associated with hydrological extremes more strongly than MAE\(da Silva Duarteet al\.,[2021](https://arxiv.org/html/2606.23833#bib.bib6)\)\. Architecture hyperparameters were tuned manually against validation error resulting in a graph convolution depth of 4, 32 convolution channels, 64 residual and skip channels and 128 end channels\. Training and validation losses decrease monotonically without divergence, indicating no overfitting\. The final validation metrics are MSE 0\.0117, RMSE 0\.1083, MAE 0\.0830 \(normalized units\)\. On the held\-out test months the model achieves MSE 0\.0128, RMSE 0\.1131, and MAE 0\.0862, which is consistent with validation and evidence of stable generalization\. Temporally aggregated test predictions track the observed basin\-mean signal closely, while the spatial correlation of time\-averaged fields is 0\.61, foreshadowing the regional weaknesses analyzed below\.

## 6\.Experimental Results

The evaluation proceeds in three stages: \(i\) direct validation against GRACE/GRACE\-FO over 2002–2023, \(ii\) analysis of seasonal, spatial, and interannual error structure including ENSO case studies, and \(iii\) a systematic comparison with established reconstruction approaches\.

### 6\.1\.Correlation with GRACE Observations

Table[3](https://arxiv.org/html/2606.23833#S6.T3)summarizes the correlation between predictions and observations over South America from April 2002 to December 2023\. All metrics are computed at the grid\-cell level and pooled across all land cells and months\. As a result, they measure performance at the native1∘1^\{\\circ\}resolution rather than after spatial averaging\. Given a grid\-cell dynamic range of approximately±0\.6m\\pm 0\.6\\,\\mathrm\{m\}EWH, the RMSE of0\.132m0\.132\\,\\mathrm\{m\}and MAE of0\.096m0\.096\\,\\mathrm\{m\}indicate high overall correlation\. The bias is close to zero \(−0\.004m\-0\.004\\,\\mathrm\{m\}\), suggesting no systematic over\- or underestimation\. The grid\-cell Pearson correlation of0\.690\.69indicates a strong positive relationship between predicted and observed variability\. This correlation is lower than the value obtained after spatial averaging\. This difference is expected because spatial averaging reduces local errors and emphasizes large\-scale seasonal variability\. In contrast, the grid\-cell metric preserves regional differences, including higher correlation in the humid Amazon and weaker correlation in the arid southern regions, as discussed in Sec\.[6\.2](https://arxiv.org/html/2606.23833#S6.SS2)\.

Table 3\.Model evaluation against GRACE over South America \(2002–2023\)\.Quality measureValueRoot Mean Squared Error \(RMSE\) \[m EWH\]0\.1323Mean Absolute Error \(MAE\) \[m EWH\]0\.0955Bias \[m EWH\]−0\.0044\-0\.0044Pearson correlation \(all grid cells\)0\.6931Figure[4](https://arxiv.org/html/2606.23833#S6.F4)compares predicted and observed time series for the area mean and an exemplary Amazon grid cell \(Figure[2](https://arxiv.org/html/2606.23833#S5.F2)\)\. The prediction follows the seasonal cycle closely, and the scatter plots cluster around the identity line\. Smaller deviations concentrate in extreme wet and dry phases, and are slightly more pronounced at the single cell than in the spatial mean, indicating lower consistency for local phenomena\. The exemplary cell also exhibits a clearly negative observed long\-term trend whose direction the model reproduces but whose magnitude it underestimates\.

![Refer to caption](https://arxiv.org/html/2606.23833v1/figures/fig11_timeseries-scatter-areamean-amazon_a.png)\(a\)area mean over South America
![Refer to caption](https://arxiv.org/html/2606.23833v1/figures/fig11_timeseries-scatter-areamean-amazon_b.png)\(b\)exemplary grid cell in the Amazon basin

Figure 4\.Observed vs\. predicted TWSA with scatter plots for \(a\) the area mean and \(b\) an exemplary Amazon grid cell \(Figure[2](https://arxiv.org/html/2606.23833#S5.F2)\)\.The spatial correlation map \(Figure[5](https://arxiv.org/html/2606.23833#S6.F5)\) shows where the model is strongest\. The central Amazon basin, where the seasonal storage signal is most pronounced, reaches the highest correlations, confirming that the model excels in regions dominated by strong, regular hydrological cycles\.

![Refer to caption](https://arxiv.org/html/2606.23833v1/figures/fig12_pearson-correlation-map-south-america.png)Figure 5\.Per\-grid\-cell Pearson correlation between the MTGNN reconstruction and GRACE over South America \(April 2002–December 2023\)\. Correlation is highest in the central Amazon, where the seasonal storage signal is most pronounced, and falls below 0\.5 in the arid south \(parts of Bolivia, Paraguay, and Argentina, Chile\), revealing the regional structure behind the aggregate grid\-cell correlation reported in Table[3](https://arxiv.org/html/2606.23833#S6.T3)\.Enjoying the baseball game from the third\-base seats\. Ichiro Suzuki preparing to bat\.
### 6\.2\.Seasonal and Spatial Error Structure

A monthly error analysis at the Amazon cell \(Figure[6](https://arxiv.org/html/2606.23833#S6.F6)\) reveals systematic seasonal behavior\. The TWSA is slightly underestimated from December to June and slightly overestimated in the second half of the year, with the largest error variability in the transition months between dry and rainy season\. The model thus struggles most at the onset and end of the rainy season, and occasional outliers point to difficulties with extreme events\.

![Refer to caption](https://arxiv.org/html/2606.23833v1/figures/fig13_monthly-error-boxplot-amazon.png)Figure 6\.Monthly distribution of errors \(predicted−\-observed\) at the exemplary Amazon grid cell\.Enjoying the baseball game from the third\-base seats\. Ichiro Suzuki preparing to bat\.Seasonal difference maps \(Figure[7](https://arxiv.org/html/2606.23833#S6.F7)\) highlight the strong influence of spatial information on the error analysis\. Rainy season errors \(e\.g\., February 2010\) are large but spatially coherent, whereas dry season errors \(e\.g\., July 2010\) are fragmented, suggesting that the model’s ability to*distribute*water within the region degrades at low storage levels\. The long\-term mean difference exposes a persistent regional bias with a slight overestimation in the northwestern basin and underestimation in the southeast\.

![Refer to caption](https://arxiv.org/html/2606.23833v1/figures/fig14_spatial-difference-rainy-dry-season.png)Figure 7\.Spatial difference between prediction and observation for \(a\) rainy vs\. dry season and \(b\) the full\-period mean\.Enjoying the baseball game from the third\-base seats\. Ichiro Suzuki preparing to bat\.
### 6\.3\.Interannual Signals and ENSO Case Study

To test whether the model captures more than trend and seasonality, we fit a harmonic regression with linear trend and \(semi\-\)annual components\(European Space Agency,[2020](https://arxiv.org/html/2606.23833#bib.bib8)\),

\(6\)S^\(t\)=a\+bt\+ccos⁡\(ωt\)\+dsin⁡\(ωt\)\+ecos⁡\(2ωt\)\+fsin⁡\(2ωt\),\\hat\{S\}\(t\)=a\+bt\+c\\cos\(\\omega t\)\+d\\sin\(\\omega t\)\+e\\cos\(2\\omega t\)\+f\\sin\(2\\omega t\),and analyze the residuals, which carry the interannual signal including extreme events\. At the Amazon cell \(Figure[8](https://arxiv.org/html/2606.23833#S6.F8)\), predicted and observed residuals agree in waveform, timing, and magnitude\. Additionally, peaks and troughs coincide, multi\-year excursions are followed, and amplitudes are comparable, though the prediction is somewhat smoothed at extreme outliers\.

![Refer to caption](https://arxiv.org/html/2606.23833v1/figures/fig15_residuals-prediction-observation-amazon.png)Figure 8\.Residual \(de\-trended, de\-seasonalized\) TWSA of prediction and observation at exemplary Amazon grid cell\. Note that the outlier in 2015 is known and results from poor sampling in a repeat orbit, meaning the error is due to low quality ground truth data rather than the proposed approach\.Enjoying the baseball game from the third\-base seats\. Ichiro Suzuki preparing to bat\.Figure[9](https://arxiv.org/html/2606.23833#S6.F9)examines the two strongest recent ENSO events as spatial anomaly maps relative to the climatological monthly mean\. During the 2015/16 El Niño, GRACE shows large\-scale drought \(negative anomalies\) in the Amazon and pronounced wet anomalies in the La Plata basin\. Our model reproduces this contrasting pattern with correct extent and position, though with slightly weaker intensity\. During the 2020/21 La Niña, the reversed pattern, drought in southern Brazil, wet conditions in the Amazon, is likewise captured\. The model thus identifies the characteristic spatial fingerprints of large\-scale climate anomalies while underestimating their amplitude, consistent with the smoothing seen in the residual analysis\.

![Refer to caption](https://arxiv.org/html/2606.23833v1/figures/fig16_elnino-lanina-anomaly-comparison.png)Figure 9\.Deviations from the climatological monthly mean in observation and prediction during \(a\) the 2015/16 El Niño and \(b\) the 2020/21 La Niña\.Enjoying the baseball game from the third\-base seats\. Ichiro Suzuki preparing to bat\.
### 6\.4\.Comparison with SOTA Reconstruction Approaches

We compare against GTWS\-MLrec\(Yinet al\.,[2023](https://arxiv.org/html/2606.23833#bib.bib47)\), RM\-REC\(Liet al\.,[2021](https://arxiv.org/html/2606.23833#bib.bib26)\), and GRAiCE\(Palazzoliet al\.,[2025](https://arxiv.org/html/2606.23833#bib.bib31)\), which together span statistical, ensemble\-ML, and recurrent neural approaches\. All products are harmonized to a common1∘1^\{\\circ\}grid, masked to the South American domain, and restricted to the shared overlap period 2002–2020, so that methodological differences are not confounded with technical ones\. Table[4](https://arxiv.org/html/2606.23833#S6.T4)reports metrics on the spatially averaged time series plus the correlation at the exemplary Amazon cell\.

Table 4\.Comparison of established reconstruction approaches against GRACE based on basin\-mean time series\.*Corr\. \(Amazon\)*denotes the correlation at a single exemplary grid cell in the Amazon, while*Corr\.*refers to the basin\-mean \(spatially averaged\) time series\.The results of all reconstruction approaches correlate highly with GRACE \(\>0\.93\>0\.93\) and lie within an RMSE band of 0\.026–0\.031, i\.e\., a few centimeters of water column\. GRAiCE performs best overall and matches the GRACE variability almost exactly; GTWS\-MLrec slightly dampens variability \(std\. 0\.074\), whereas MTGNN slightly overemphasizes it \(0\.088\)\. At the single Amazon cell, MTGNN is weakest \(0\.866\), suggesting that the graph model trades some fine\-scale robustness for its large\-scale structure\. Both the single\-cell correlation \(0\.866\) and the basin\-mean correlation \(0\.939\) differ from the all\-cells correlation of 0\.69 reported in Table[3](https://arxiv.org/html/2606.23833#S6.T3)\. The all\-cells metric includes every grid cell in the domain, including regions in the arid south where correlations are relatively low\. In contrast, the single\-cell value represents one location in the Amazon with strong correlation, while the basin\-mean value is computed from spatially averaged time series\. The Taylor diagram in Figure[10](https://arxiv.org/html/2606.23833#S6.F10)condenses these relationships\.

These numbers must be read against the predictor budget each model consumes, shown in the*Inputs*column of Table[4](https://arxiv.org/html/2606.23833#S6.T4)\. While MTGNN is driven by only precipitation, evapotranspiration, and runoff, the better\-scoring baselines rely on substantially richer feature sets\. GRAiCE uses five meteorological variables \(total precipitation, snow depth water equivalent, surface net solar radiation, surface air temperature, and relative humidity\), adding solar\-induced fluorescence in its full variant\(Palazzoliet al\.,[2025](https://arxiv.org/html/2606.23833#bib.bib31)\)\. RM\-REC combines precipitation, temperature, sea surface temperature, soil moisture, runoff, and evaporation with 17 climate indices\(Liet al\.,[2021](https://arxiv.org/html/2606.23833#bib.bib26)\)\. GTWS\-MLrec ingests four whole categories of meteorological, hydrological, land\-use, and vegetation predictors \(16 variables in total\) with per\-cell feature selection\(Yinet al\.,[2023](https://arxiv.org/html/2606.23833#bib.bib47)\)while DeepRec combines 16 ERA5 variables with sea surface temperature, an ENSO index, land\-use and lake fractions, and engineered geographic and temporal features\(Gentneret al\.,[2026](https://arxiv.org/html/2606.23833#bib.bib11)\)\. Seen this way, the graph model reaches a basin\-mean correlation of 0\.939, within 0\.025 of GRAiCE and statistically indistinguishable from GTWS\-MLrec, using roughly half to a tenth of the inputs that the better\-scoring baselines require\. The slight performance gap is therefore better interpreted as a favorable accuracy\-per\-variable trade\-off than as a deficit\. Furthermore, the topology of the spatio\-temporal graph appears to substitute for part of the information that competing models must supply through additional covariates\. This parsimony is not merely aesthetic\. It lowers data\-acquisition and preprocessing cost, reduces exposure to predictors that are themselves poorly constrained in the pre\-satellite era, and makes the learned mapping easier to attribute to physical drivers, which is an advantage we expand on in Section[7](https://arxiv.org/html/2606.23833#S7)\.

![Refer to caption](https://arxiv.org/html/2606.23833v1/figures/fig17_taylor-diagram-reconstructions.png)Figure 10\.Taylor diagram of the reconstructions relative to GRACE, with radial axes indicating standard deviation\.Enjoying the baseball game from the third\-base seats\. Ichiro Suzuki preparing to bat\.The temporal view at the Amazon cell \(Figure[11](https://arxiv.org/html/2606.23833#S6.F11)\) adds nuance: MTGNN matches the phase of the GRACE signal well—better than GTWS\-MLrec, whose phase is slightly delayed—while underestimating peak amplitudes in parts of the record and producing a less smoothed series overall, indicating higher sensitivity to short\-term events \(a potential advantage\) at the cost of possible model noise\. GRAiCE, despite its leading global metrics, follows the reference poorly at this particular cell in the first half of the record\.

![Refer to caption](https://arxiv.org/html/2606.23833v1/figures/fig18_temporal-evolution-reconstructions-amazon.png)Figure 11\.TWSA at the exemplary Amazon grid cell for each reconstruction approach vs\. GRACE \(2002–2023\)\. The dashed blue line denotes GRACE, the orange line denotes MTGNN \(ours\), the green line denotes GTWS\-MLrec, the pink line denotes RM\-REC, and the purple line denotes GRAiCE\.Enjoying the baseball game from the third\-base seats\. Ichiro Suzuki preparing to bat\.Because GTWS\-MLrec is the only baseline extending back to the 1940s, it enables a pre\-GRACE consistency check \(Figure[12](https://arxiv.org/html/2606.23833#S6.F12)\)\. In the spatial mean, MTGNN consistently shows higher amplitude but tracks the temporal evolution closely: both products reproduce the 1948/49 decline, the prolonged late\-1960s decrease and recovery, and the decline since 2010\. At the exemplary cell, deviations alternate without systematic bias, and the major historical excursions \(1948/49, 1955, 1961, early 1970s, post\-2015\) agree\.

![Refer to caption](https://arxiv.org/html/2606.23833v1/figures/fig19_mtgnn-vs-gtws-mlrec-comparison.png)Figure 12\.MTGNN \(orange line\) vs\. GTWS\-MLrec \(green line\) from 1940 onward \(top: spatial mean; bottom: exemplary Amazon grid cell\)\.Enjoying the baseball game from the third\-base seats\. Ichiro Suzuki preparing to bat\.Indicated by the spatial correlation maps \(Figure[13](https://arxiv.org/html/2606.23833#S6.F13)\) MTGNN matches the baselines in the humid Amazon but falls below 0\.5 in parts of Bolivia, Paraguay, Argentina, and easternmost Brazil which can be characterized as arid and semi\-arid regions where hydroclimatic processes are more complex and human influence \(e\.g\., groundwater extraction\) is stronger\. The same regions are also the weakest for the baselines, but less severely so\. This community\-wide pattern echoes the limitations reported byPalazzoliet al\.\([2025](https://arxiv.org/html/2606.23833#bib.bib31)\)andYinet al\.\([2023](https://arxiv.org/html/2606.23833#bib.bib47)\)for arid, human\-influenced areas\.

![Refer to caption](https://arxiv.org/html/2606.23833v1/figures/fig20_spatial-correlations-grace-reconstructions.png)Figure 13\.Spatial correlation with GRACE for all reconstruction approaches evaluated over the overlap period\.Enjoying the baseball game from the third\-base seats\. Ichiro Suzuki preparing to bat\.In summary, the graph\-based reconstruction is statistically on par with established reconstruction approaches at basin scale \(correlation 0\.94, RMSE 0\.030\), captures phase and interannual dynamics particularly well, but exhibits specific spatial deficits in arid regions and slightly overemphasized variability\.

## 7\.Flexible Extensions and Lessons Learned

### 7\.1\.Input Parameter Expansion

The reconstruction uses only the three water balance core variables, whereas the strongest baselines integrate broader predictor sets spanning meteorological drivers, hydrological states, and land surface/anthropogenic factors\(Gentneret al\.,[2026](https://arxiv.org/html/2606.23833#bib.bib11); Yinet al\.,[2023](https://arxiv.org/html/2606.23833#bib.bib47); Palazzoliet al\.,[2025](https://arxiv.org/html/2606.23833#bib.bib31); Liet al\.,[2021](https://arxiv.org/html/2606.23833#bib.bib26)\)\. As established in Section[6\.4](https://arxiv.org/html/2606.23833#S6.SS4), the graph model nearly matches these established approaches on a fraction of their inputs\. This raises an obvious question: if so much is achieved with three variables, how much further could the model go—and, more importantly, how cheaply can the additional variables be added?

The architecture makes such expansion structurally straightforward\. Because each predictor enters as an additional channel of the per\-node feature dimensionDD\(Section[2\.3](https://arxiv.org/html/2606.23833#S2.SS3)\), adding a meteorological forcing means extending the node feature tensor fromℝT×N×D\\mathbb\{R\}^\{T\\times N\\times D\}toℝT×N×\(D\+1\)\\mathbb\{R\}^\{T\\times N\\times\(D\+1\)\}and widening the first layer\. The graph topology, the temporal modules, and the entire training pipeline are untouched\. No new model has to be trained per variable, and no per\-cell feature\-selection stage is required\. This contrasts with the comparing approaches\. While GTWS\-MLrec must rerun its ensemble and per\-pixel selection over eight input schemes when its predictor set changes\(Yinet al\.,[2023](https://arxiv.org/html/2606.23833#bib.bib47)\), the per\-grid\-cell \(Bi\)LSTMs of GRAiCE are configured and tuned cell by cell\(Palazzoliet al\.,[2025](https://arxiv.org/html/2606.23833#bib.bib31)\), and the patch\-based CNN of DeepRec fixes its feature stack in the patch construction\(Gentneret al\.,[2026](https://arxiv.org/html/2606.23833#bib.bib11)\)\. In the spatio\-temporal graph, by contrast, a new forcing variable is simply one more signal observed at every node, propagated by the same shared convolutional weights, meaning the marginal engineering cost of testing a candidate predictor is close to zero\.

As a pilot exploiting exactly this property, we appended 2 m temperature, known to be a key driver of evaporation, snowmelt, and soil water dynamics a single feature channel that is consistently available in ERA5\. The effect is measurable \(Table[5](https://arxiv.org/html/2606.23833#S7.T5)\), reducing validation errors and increasing correlation with GRACE from 0\.693 to 0\.713 \(grid\-cell\) and from 0\.945 to 0\.947 \(basin level\)\. Given that the architecture accepts arbitrary numbers of node features at this near\-zero cost, soil moisture and snow water equivalent are natural next candidates, since empirical studies link soil moisture anomalies tightly to TWS variations in semi\-arid regions\(Guoet al\.,[2023](https://arxiv.org/html/2606.23833#bib.bib15)\)\. Furthermore, anthropogenic predictors are essential where groundwater depletion dominates the signal\(Asoka and Mishra,[2020](https://arxiv.org/html/2606.23833#bib.bib1)\)directly targeting the arid\-region weakness identified above\. The favorable accuracy\-per\-variable position observed in Table[4](https://arxiv.org/html/2606.23833#S6.T4)thus reflects headroom, not a ceiling\. As a result, the cheap expansion path is precisely the one that can close the remaining gap to the input\-heavy baselines while keeping the model interpretable\.

Table 5\.Validation quality without and with temperature as input\.
### 7\.2\.Physics\-Informed Training

Physics\-informed neural networks \(PINNs\) integrate physical laws as soft constraints into the loss function, improving plausibility and generalization where data are sparse\(Luoet al\.,[2025](https://arxiv.org/html/2606.23833#bib.bib28)\)\. The water balance \(Eq\.[2](https://arxiv.org/html/2606.23833#S4.E2)\) translates directly into such a constraint\. WithTWSAθ\\mathrm\{TWSA\}\_\{\\theta\}the network prediction, the combined lossL=αLdata\+βLphysicsL=\\alpha L\_\{\\mathrm\{data\}\}\+\\beta L\_\{\\mathrm\{physics\}\}uses

\(7\)Lphysics=1N∑i=1N\(\(TWSAθ\(ti\)−TWSAθ\(ti−1\)\)−\(P¯\(ti\)−E¯\(ti\)−R¯\(ti\)\)\)2,\\begin\{split\}L\_\{\\mathrm\{physics\}\}=\\frac\{1\}\{N\}\\sum\_\{i=1\}^\{N\}\\Bigl\(&\\bigl\(\\mathrm\{TWSA\}\_\{\\theta\}\(t\_\{i\}\)\-\\mathrm\{TWSA\}\_\{\\theta\}\(t\_\{i\-1\}\)\\bigr\)\\\\ &\-\\bigl\(\\bar\{P\}\(t\_\{i\}\)\-\\bar\{E\}\(t\_\{i\}\)\-\\bar\{R\}\(t\_\{i\}\)\\bigr\)\\Bigr\)^\{2\},\\end\{split\}penalizing violations of the discrete storage recursionS\(t\)=S\(t−1\)\+P−E−RS\(t\)=S\(t\-1\)\+P\-E\-R\. WhereasGentneret al\.\([2026](https://arxiv.org/html/2606.23833#bib.bib11)\)used the water balance for post\-hoc validation, embedding it into training \(Figure[14](https://arxiv.org/html/2606.23833#S7.F14)\) turns it into a regularizer, particularly promising for arid regions and extreme events where purely data\-driven training has shown deficits\.

This constraint is the natural consequence of the input choice\. Because the three node features \(PP,EE,RR\) are exactly the flux terms of the water balance and the target \(TWSA\\mathrm\{TWSA\}\) is its storage term, inputs and output already stand in a closed physical relationship \(Eq\.[2](https://arxiv.org/html/2606.23833#S4.E2)\)\. The physics loss asks the network to respect, between consecutive months, the same equation that motivated the feature set in the first place\. As a result, no auxiliary variables, derived quantities, or separate physical model are needed to evaluate it\. This represents a structural advantage of the parsimonious design over input\-heavy baselines\. Their larger predictor sets, including vegetation indices, land\-use fractions, and radiation balances, do not naturally conform to a conservation law, making comparable constraints difficult to enforce\. Here the loss is, by construction, hydrologically interpretable which both eases tuning of the weightβ\\betaand makes residual violations diagnosable as genuine hydrological inconsistencies rather than opaque model error\. In short, choosing the minimal water\-balance feature set buys not only data efficiency but also a loss function that is naturally aligned with the governing physics\.

![Refer to caption](https://arxiv.org/html/2606.23833v1/figures/fig21_pinn-hybrid-framework.png)Figure 14\.Hybrid framework combining the data loss with a water balance physics loss\.Enjoying the baseball game from the third\-base seats\. Ichiro Suzuki preparing to bat\.

## 8\.Conclusion

In this work, we applied spatio\-temporal graph deep learning to reconstruct terrestrial water storage prior to the GRACE mission, addressing a long\-standing problem in hydrology and satellite geodesy\. An MTGNN with a static, interpretable hybrid graph, combining geodesic proximity and lagged climatic correlations, was trained on daily ERA5 fluxes against monthly GRACE TWSA and used to reconstruct South American water storage anomalies back to 1940\. Against GRACE/GRACE\-FO the reconstruction achieves a grid\-cell correlation of 0\.69, a basin\-mean correlation of 0\.94 with near\-zero bias, and correctly reproduces the spatial fingerprints of major ENSO events\. Compared with three established reconstruction methods \(GTWS\-MLrec, RM\-REC, and GRAiCE\), our approach achieves competitive basin\-scale performance using only three predictor variables, while sharing the common challenge of reduced accuracy in arid and human\-influenced regions\.

This case study demonstrates that graph architectures matured on urban mobility problems transfer to global geophysical fields once the graph encodes domain structure such as teleconnections\. Our results show that an analogy between traffic flow in road networks and water transport in the climate system is not merely conceptual but can be directly exploited in graph\-based architectures\. They also highlight that the choice between learned and prescribed graphs is a key modeling decision rather than an implementation detail\. For the field of hydrology and geodesy, the resulting reconstruction offers a new, topology\-aware perspective on historical water storage that complements existing per\-cell reconstruction approaches\. Ongoing work scales the workflow from the South American test region to the global1∘1^\{\\circ\}grid, expands the predictor set toward hydrological states and anthropogenic factors, and integrates the water balance constraint into training\.

## References

- L\. Arzoumanidis, J\. Knechtel, J\. Haunert, and Y\. Dehbi \(2026\)Semantic segmentation of historical maps using self\-constructing graph convolutional networks\.Cartography and Geographic Information Science53\(2\),pp\. 177–187\.External Links:[Document](https://dx.doi.org/10.1080/15230406.2025.2468304)Cited by:[§3](https://arxiv.org/html/2606.23833#S3.p7.1)\.
- A\. Asoka and V\. Mishra \(2020\)Anthropogenic and climate contributions on the changes in terrestrial water storage in india\.Journal of Geophysical Research: Atmospheres125\(10\),pp\. 1–21\.External Links:[Document](https://dx.doi.org/https%3A//doi.org/10.1029/2020JD032470)Cited by:[§7\.1](https://arxiv.org/html/2606.23833#S7.SS1.p3.1)\.
- J\. J\. Berman \(2016\)Chapter 4 \- understanding your data\.InData Simplification,pp\. 135–187\.External Links:ISBN 978\-0\-12\-803781\-2,[Document](https://dx.doi.org/https%3A//doi.org/10.1016/B978-0-12-803781-2.00004-7),[Link](https://www.sciencedirect.com/science/article/pii/B9780128037812000047)Cited by:[§5\.4](https://arxiv.org/html/2606.23833#S5.SS4.p2.7)\.
- S\. R\. Cachay, E\. Erickson, A\. F\. C\. Bucker, E\. Pokropek, W\. Potosnak, S\. Osei, and B\. Lütjens \(2020\)Graph neural networks for improved el niño forecasting\.InTackling Climate Change with Machine Learning Workshop at NeurIPS 2020,External Links:[Link](https://www.climatechange.ai/papers/neurips2020/86)Cited by:[§3](https://arxiv.org/html/2606.23833#S3.p7.1),[§5\.1](https://arxiv.org/html/2606.23833#S5.SS1.p1.4)\.
- J\. Chen, A\. Cazenave, C\. Dahle, W\. Llovel, I\. Panet, J\. Pfeffer, and L\. Moreira \(2022\)Applications and challenges of grace and grace follow\-on satellite gravimetry\.Surveys in Geophysics43\(1\),pp\. 305–345\.External Links:[Document](https://dx.doi.org/10.1007/s10712-021-09685-x)Cited by:[§2\.1](https://arxiv.org/html/2606.23833#S2.SS1.p1.1),[§2\.1](https://arxiv.org/html/2606.23833#S2.SS1.p2.1),[§4\.1](https://arxiv.org/html/2606.23833#S4.SS1.p1.2)\.
- M\. Cheng, B\. D\. Tapley, and J\. C\. Ries \(2013\)Deceleration in the earth’s oblateness\.Journal of Geophysical Research: Solid Earth118\(2\),pp\. 740–747\.External Links:[Document](https://dx.doi.org/https%3A//doi.org/10.1002/jgrb.50058)Cited by:[§4\.1](https://arxiv.org/html/2606.23833#S4.SS1.p1.2)\.
- G\. J\. da Silva Duarte, T\. A\. Pereira, E\. J\. F\. Nascimento, D\. P\. P\. Mesquita, and A\. H\. de Souza Jr\. \(2021\)How do loss functions impact the performance of graph neural networks?\.InCongresso Brasileiro de Inteligência Computacional \(CBIC 2021\),External Links:[Document](https://dx.doi.org/10.21528/CBIC2021-161)Cited by:[§5\.5](https://arxiv.org/html/2606.23833#S5.SS5.p2.1)\.
- A\. Eicker, L\. Jensen, V\. Wöhnke, H\. Dobslaw, A\. Kvas, T\. Mayer\-Gürr, and R\. Dill \(2020\)Daily GRACE satellite data evaluate short\-term hydro\-meteorological fluxes from global atmospheric reanalyses\.Scientific Reports10\(1\),pp\. 4504\.External Links:[Document](https://dx.doi.org/10.1038/s41598-020-61166-0)Cited by:[§4\.1](https://arxiv.org/html/2606.23833#S4.SS1.p1.2),[§4\.3](https://arxiv.org/html/2606.23833#S4.SS3.p1.2)\.
- European Space Agency \(2020\)CCI sea level budget closure: product validation and intercomparison report \(PVIR\)\.Technical reportTechnical ReportD4\.7 v1\.1,Climate Change Initiative\.External Links:[Link](https://climate.esa.int/documents/192/ESA_SLBC_cci_D4.7_v1.1.pdf)Cited by:[§6\.3](https://arxiv.org/html/2606.23833#S6.SS3.p1.1)\.
- L\. Q\. Gentner, J\. Gou, M\. J\. Tourian, L\. Börger, N\. Sneeuw, and B\. Soja \(2026\)DeepRec: global terrestrial water storage reconstruction since 1941 using spatiotemporal\-aware deep learning model\.Journal of Geophysical Research: Machine Learning and Computation3\(1\)\.External Links:[Document](https://dx.doi.org/https%3A//doi.org/10.1029/2025JH000889)Cited by:[Table 1](https://arxiv.org/html/2606.23833#S3.T1.15.15.4),[§3](https://arxiv.org/html/2606.23833#S3.p6.1),[§6\.4](https://arxiv.org/html/2606.23833#S6.SS4.p3.1),[§7\.1](https://arxiv.org/html/2606.23833#S7.SS1.p1.1),[§7\.1](https://arxiv.org/html/2606.23833#S7.SS1.p2.3),[§7\.2](https://arxiv.org/html/2606.23833#S7.SS2.p1.3)\.
- J\. Guo, D\. Mu, X\. Liu, H\. Yan, and H\. Dai \(2014\)Equivalent water height extracted from GRACE gravity field model with robust independent component analysis\.Acta Geophysica62\(4\),pp\. 953–972\.External Links:[Document](https://dx.doi.org/10.2478/s11600-014-0210-0)Cited by:[§4\.1](https://arxiv.org/html/2606.23833#S4.SS1.p1.2)\.
- Y\. Guo, N\. Xing, F\. Gan, B\. Yan, and J\. Bai \(2023\)Evaluating the hydrological components contributions to terrestrial water storage changes in inner mongolia with multiple datasets\.Sensors23\(14\)\.External Links:[Document](https://dx.doi.org/10.3390/s23146452)Cited by:[§7\.1](https://arxiv.org/html/2606.23833#S7.SS1.p3.1)\.
- H\. Hersbach, B\. Bell, P\. Berrisford, G\. Biavati, A\. Horányi, J\. Muñoz Sabater, J\. Nicolas, C\. Peubey, R\. Radu, I\. Rozum, D\. Schepers, A\. Simmons, C\. Soci, D\. Dee, and J\. Thépaut \(2023\)ERA5 hourly data on single levels from 1940 to present\.Note:Copernicus Climate Change Service \(C3S\) Climate Data Store \(CDS\)External Links:[Document](https://dx.doi.org/10.24381/cds.adbb2d47)Cited by:[§2\.2](https://arxiv.org/html/2606.23833#S2.SS2.p1.1),[§4\.2](https://arxiv.org/html/2606.23833#S4.SS2.p1.4)\.
- H\. Hersbach, B\. Bell, P\. Berrisford, S\. Hirahara, A\. Horányi, J\. Muñoz\-Sabater, J\. Nicolas, C\. Peubey, R\. Radu, D\. Schepers, A\. Simmons, C\. Soci, S\. Abdalla, X\. Abellan, G\. Balsamo, P\. Bechtold, G\. Biavati, J\. Bidlot, M\. Bonavita, G\. De Chiara, P\. Dahlgren, D\. Dee, M\. Diamantakis, R\. Dragani, J\. Flemming, R\. Forbes, M\. Fuentes, A\. Geer, L\. Haimberger, S\. Healy, R\. J\. Hogan, E\. Hólm, M\. Janisková, S\. Keeley, P\. Laloyaux, P\. Lopez, C\. Lupu, G\. Radnoti, P\. de Rosnay, I\. Rozum, F\. Vamborg, S\. Villaume, and J\. Thépaut \(2020\)The ERA5 global reanalysis\.Quarterly Journal of the Royal Meteorological Society146\(730\),pp\. 1999–2049\.External Links:[Document](https://dx.doi.org/https%3A//doi.org/10.1002/qj.3803)Cited by:[§2\.2](https://arxiv.org/html/2606.23833#S2.SS2.p1.1),[§4\.2](https://arxiv.org/html/2606.23833#S4.SS2.p1.4)\.
- R\. Huang, Y\. Long, and T\. Zia \(2025\)A physical\-enhanced spatio\-temporal graph convolutional network for river flow prediction\.Applied Sciences15\(16\)\.External Links:[Document](https://dx.doi.org/10.3390/app15169054)Cited by:[§5\.4](https://arxiv.org/html/2606.23833#S5.SS4.p1.1)\.
- V\. Humphrey and L\. Gudmundsson \(2019\)GRACE\-REC: a reconstruction of climate\-driven water storage changes over the last century\.Earth System Science Data11\(3\),pp\. 1153–1170\.External Links:[Document](https://dx.doi.org/10.5194/essd-11-1153-2019)Cited by:[§1](https://arxiv.org/html/2606.23833#S1.p1.1),[§1](https://arxiv.org/html/2606.23833#S1.p3.1),[Table 1](https://arxiv.org/html/2606.23833#S3.T1.3.3.4),[§3](https://arxiv.org/html/2606.23833#S3.p2.1)\.
- L\. Jensen, A\. Eicker, H\. Dobslaw, and R\. Pail \(2020\)Emerging changes in terrestrial water storage variability as a target for future satellite gravity missions\.Remote Sensing12\(23\)\.External Links:[Document](https://dx.doi.org/10.3390/rs12233898)Cited by:[§1](https://arxiv.org/html/2606.23833#S1.p2.1)\.
- D\. P\. Kingma and J\. Ba \(2015\)Adam: a method for stochastic optimization\.In3rd International Conference on Learning Representations \(ICLR\),External Links:[Link](https://arxiv.org/abs/1412.6980)Cited by:[§5\.5](https://arxiv.org/html/2606.23833#S5.SS5.p2.1)\.
- N\. Kirschstein and Y\. Sun \(2024\)The merit of river network topology for neural flood forecasting\.InProceedings of the 41st International Conference on Machine Learning,ICML’24\.Cited by:[§5\.4](https://arxiv.org/html/2606.23833#S5.SS4.p1.1)\.
- E\. Kurtenbach, A\. Eicker, T\. Mayer\-Gürr, M\. Holschneider, H\. Hayn, M\. Fuhrmann, and J\. Kusche \(2012\)Improved daily GRACE gravity field solutions using a Kalman smoother\.Journal of Geodynamics59–60,pp\. 39–48\.External Links:[Document](https://dx.doi.org/10.1016/j.jog.2012.02.006)Cited by:[§2\.1](https://arxiv.org/html/2606.23833#S2.SS1.p2.1)\.
- F\. W\. Landerer, F\. M\. Flechtner, H\. Save, F\. H\. Webb, T\. Bandikova, W\. I\. Bertiger, S\. V\. Bettadpur, S\. H\. Byun, C\. Dahle, H\. Dobslaw, E\. Fahnestock, N\. Harvey, Z\. Kang, G\. L\. H\. Kruizinga, B\. D\. Loomis, C\. McCullough, M\. Murböck, P\. Nagel, M\. Paik, N\. Pie, S\. Poole, D\. Strekalov, M\. E\. Tamisiea, F\. Wang, M\. M\. Watkins, H\. Wen, D\. N\. Wiese, and D\. Yuan \(2020\)Extending the global mass change data record: grace follow\-on instrument and science data performance\.Geophysical Research Letters47\(12\)\.External Links:[Document](https://dx.doi.org/https%3A//doi.org/10.1029/2020GL088306)Cited by:[§1](https://arxiv.org/html/2606.23833#S1.p2.1)\.
- F\. Li, J\. Kusche, N\. Chao, Z\. Wang, and A\. Löcher \(2021\)Long\-term \(1979\-present\) total water storage anomalies over the global land derived by reconstructing grace data\.Geophysical Research Letters48\(8\),pp\. 1–10\.External Links:[Document](https://dx.doi.org/https%3A//doi.org/10.1029/2021GL093492)Cited by:[3rd item](https://arxiv.org/html/2606.23833#S1.I1.i3.p1.1),[Table 1](https://arxiv.org/html/2606.23833#S3.T1.18.18.4),[§3](https://arxiv.org/html/2606.23833#S3.p3.1),[§6\.4](https://arxiv.org/html/2606.23833#S6.SS4.p1.1),[§6\.4](https://arxiv.org/html/2606.23833#S6.SS4.p3.1),[§7\.1](https://arxiv.org/html/2606.23833#S7.SS1.p1.1)\.
- C\. Lorenz, H\. Kunstmann, B\. Devaraju, M\. J\. Tourian, N\. Sneeuw, and J\. Riegger \(2014\)Large\-scale runoff from landmasses: a global assessment of the closure of the hydrological and atmospheric water balances\.Journal of Hydrometeorology15\(6\),pp\. 2111–2139\.External Links:[Document](https://dx.doi.org/10.1175/JHM-D-13-0157.1)Cited by:[§4\.3](https://arxiv.org/html/2606.23833#S4.SS3.p1.2)\.
- K\. Luo, J\. Zhao, Y\. Wang, J\. Li, J\. Wen, J\. Liang, H\. Soekmadji, and S\. Liao \(2025\)Physics\-informed neural networks for pde problems: a comprehensive review\.Artificial Intelligence Review58\(10\),pp\. 323\.External Links:[Document](https://dx.doi.org/10.1007/s10462-025-11322-7)Cited by:[§7\.2](https://arxiv.org/html/2606.23833#S7.SS2.p1.2)\.
- T\. Mayer\-Gürr, S\. Behzadpour, A\. Eicker, M\. Ellmer, B\. Koch, S\. Krauss, C\. Pock, D\. Rieser, S\. Strasser, B\. Süsser\-Rechberger, N\. Zehentner, and A\. Kvas \(2021\)GROOPS: a software toolkit for gravity field recovery and GNSS processing\.Computers & Geosciences155,pp\. 104864\.External Links:[Document](https://dx.doi.org/https%3A//doi.org/10.1016/j.cageo.2021.104864)Cited by:[§4\.1](https://arxiv.org/html/2606.23833#S4.SS1.p1.2)\.
- T\. Mayer\-Gürr, S\. Behzadpur, M\. Ellmer, A\. Kvas, B\. Klinger, S\. Strasser, and N\. Zehentner \(2018\)ITSG\-Grace2018 – monthly, daily and static gravity field solutions from GRACE\.External Links:[Document](https://dx.doi.org/10.5880/ICGEM.2018.003)Cited by:[§4\.1](https://arxiv.org/html/2606.23833#S4.SS1.p1.2)\.
- I\. Palazzoli, S\. Ceola, and P\. Gentine \(2025\)GRAiCE: reconstructing terrestrial water storage anomalies with recurrent neural networks\.Scientific Data12\(1\),pp\. 146\.External Links:[Document](https://dx.doi.org/10.1038/s41597-025-04403-3)Cited by:[3rd item](https://arxiv.org/html/2606.23833#S1.I1.i3.p1.1),[Table 1](https://arxiv.org/html/2606.23833#S3.T1.9.9.4),[§3](https://arxiv.org/html/2606.23833#S3.p5.1),[§6\.4](https://arxiv.org/html/2606.23833#S6.SS4.p1.1),[§6\.4](https://arxiv.org/html/2606.23833#S6.SS4.p3.1),[§6\.4](https://arxiv.org/html/2606.23833#S6.SS4.p6.1),[§7\.1](https://arxiv.org/html/2606.23833#S7.SS1.p1.1),[§7\.1](https://arxiv.org/html/2606.23833#S7.SS1.p2.3)\.
- N\. Qian, G\. Chang, P\. Ditmar, J\. Gao, and Z\. Wei \(2022\)Sparse ddk: a data\-driven decorrelation filter for grace level\-2 products\.Remote Sensing14\(12\)\.External Links:[Document](https://dx.doi.org/10.3390/rs14122810)Cited by:[§4\.1](https://arxiv.org/html/2606.23833#S4.SS1.p1.2)\.
- Z\. Reitermanová \(2010\)Data splitting\.InWDS’10 Proceedings of Contributed Papers: Part I – Mathematics and Computer Sciences,pp\. 31–36\.Cited by:[§5\.5](https://arxiv.org/html/2606.23833#S5.SS5.p1.1)\.
- F\. N\. Silva, D\. A\. Vega\-Oliveros, X\. Yan, A\. Flammini, F\. Menczer, F\. Radicchi, B\. Kravitz, and S\. Fortunato \(2021\)Detecting climate teleconnections with granger causality\.Geophysical Research Letters48\(18\)\.External Links:[Document](https://dx.doi.org/https%3A//doi.org/10.1029/2021GL094707)Cited by:[§5\.4](https://arxiv.org/html/2606.23833#S5.SS4.p2.7)\.
- D\. Singh and B\. Singh \(2022\)Feature wise normalization: an effective way of normalizing data\.Pattern Recognition122,pp\. 108307\.External Links:[Document](https://dx.doi.org/https%3A//doi.org/10.1016/j.patcog.2021.108307)Cited by:[§5\.5](https://arxiv.org/html/2606.23833#S5.SS5.p1.1)\.
- A\. Y\. Sun, P\. Jiang, M\. K\. Mudunuru, and X\. Chen \(2021\)Explore spatio\-temporal learning of large sample hydrology using graph neural networks\.Water Resources Research57\(12\)\.External Links:[Document](https://dx.doi.org/https%3A//doi.org/10.1029/2021WR030394)Cited by:[§3](https://arxiv.org/html/2606.23833#S3.p7.1)\.
- Y\. Sun, P\. Ditmar, and R\. Riva \(2017\)Statistically optimal estimation of degree\-1 and c20 coefficients based on grace data and an ocean bottom pressure model\.Geophysical Journal International210\(3\),pp\. 1305–1322\.External Links:[Document](https://dx.doi.org/10.1093/gji/ggx241)Cited by:[§2\.1](https://arxiv.org/html/2606.23833#S2.SS1.p2.1),[§4\.1](https://arxiv.org/html/2606.23833#S4.SS1.p1.2)\.
- B\. D\. Tapley, S\. Bettadpur, M\. Watkins, and C\. Reigber \(2004\)The gravity recovery and climate experiment: mission overview and early results\.Geophysical Research Letters31\(9\)\.External Links:[Document](https://dx.doi.org/https%3A//doi.org/10.1029/2004GL019920)Cited by:[§1](https://arxiv.org/html/2606.23833#S1.p2.1)\.
- B\. D\. Tapley, M\. M\. Watkins, F\. Flechtner, C\. Reigber, S\. Bettadpur, M\. Rodell, I\. Sasgen, J\. S\. Famiglietti, F\. W\. Landerer, D\. P\. Chambers, J\. T\. Reager, A\. S\. Gardner, H\. Save, E\. R\. Ivins, S\. C\. Swenson, C\. Boening, C\. Dahle, D\. N\. Wiese, H\. Dobslaw, M\. E\. Tamisiea, and I\. Velicogna \(2019\)Contributions of grace to understanding climate change\.Nature Climate Change9\(5\),pp\. 358–369\.External Links:[Document](https://dx.doi.org/10.1038/s41558-019-0456-2)Cited by:[§1](https://arxiv.org/html/2606.23833#S1.p2.1),[§2\.1](https://arxiv.org/html/2606.23833#S2.SS1.p1.1)\.
- I\. R\. Ward, J\. Joyner, C\. Lickfold, Y\. Guo, and M\. Bennamoun \(2022\)A practical tutorial on graph neural networks\.ACM Comput\. Surv\.54\(10s\)\.External Links:[Document](https://dx.doi.org/10.1145/3503043)Cited by:[§3](https://arxiv.org/html/2606.23833#S3.p7.1)\.
- Z\. Wu, S\. Pan, F\. Chen, G\. Long, C\. Zhang, and P\. S\. Yu \(2021\)A comprehensive survey on graph neural networks\.IEEE Transactions on Neural Networks and Learning Systems32\(1\),pp\. 4–24\.External Links:[Document](https://dx.doi.org/10.1109/TNNLS.2020.2978386)Cited by:[§5\.1](https://arxiv.org/html/2606.23833#S5.SS1.p1.4)\.
- Z\. Wu, S\. Pan, G\. Long, J\. Jiang, X\. Chang, and C\. Zhang \(2020\)Connecting the dots: multivariate time series forecasting with graph neural networks\.InProceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining,KDD ’20,New York, NY, USA,pp\. 753–763\.External Links:ISBN 9781450379984,[Link](https://doi.org/10.1145/3394486.3403118),[Document](https://dx.doi.org/10.1145/3394486.3403118)Cited by:[§1](https://arxiv.org/html/2606.23833#S1.p4.1),[Figure 1](https://arxiv.org/html/2606.23833#S5.F1),[Figure 1](https://arxiv.org/html/2606.23833#S5.F1.3.2),[§5\.2](https://arxiv.org/html/2606.23833#S5.SS2.p1.1),[§5\.3](https://arxiv.org/html/2606.23833#S5.SS3.p1.8),[§5\.4](https://arxiv.org/html/2606.23833#S5.SS4.p1.1),[Table 2](https://arxiv.org/html/2606.23833#S5.T2),[Table 2](https://arxiv.org/html/2606.23833#S5.T2.9.2)\.
- J\. Yin, L\. J\. Slater, A\. Khouakhi, L\. Yu, P\. Liu, F\. Li, Y\. Pokhrel, and P\. Gentine \(2023\)GTWS\-mlrec: global terrestrial water storage reconstruction by machine learning from 1940 to present\.Earth System Science Data15\(12\),pp\. 5597–5615\.External Links:[Document](https://dx.doi.org/10.5194/essd-15-5597-2023)Cited by:[3rd item](https://arxiv.org/html/2606.23833#S1.I1.i3.p1.1),[§1](https://arxiv.org/html/2606.23833#S1.p1.1),[Table 1](https://arxiv.org/html/2606.23833#S3.T1.12.12.4),[§3](https://arxiv.org/html/2606.23833#S3.p4.1),[§6\.4](https://arxiv.org/html/2606.23833#S6.SS4.p1.1),[§6\.4](https://arxiv.org/html/2606.23833#S6.SS4.p3.1),[§6\.4](https://arxiv.org/html/2606.23833#S6.SS4.p6.1),[§7\.1](https://arxiv.org/html/2606.23833#S7.SS1.p1.1),[§7\.1](https://arxiv.org/html/2606.23833#S7.SS1.p2.3)\.
- Q\. Zhang, D\. Paul, M\. Miller, A\. Morales, and S\. Gao \(2025\)Scalable inter\-county food flow prediction using graph neural networks\.InProceedings of the 33rd ACM International Conference on Advances in Geographic Information Systems,SIGSPATIAL ’25,New York, NY, USA,pp\. 985–994\.External Links:[Document](https://dx.doi.org/10.1145/3748636.3764168)Cited by:[§3](https://arxiv.org/html/2606.23833#S3.p7.1)\.
- J\. Zhou, G\. Cui, S\. Hu, Z\. Zhang, C\. Yang, Z\. Liu, L\. Wang, C\. Li, and M\. Sun \(2020\)Graph neural networks: a review of methods and applications\.AI Open1,pp\. 57–81\.External Links:[Document](https://dx.doi.org/https%3A//doi.org/10.1016/j.aiopen.2021.01.001)Cited by:[§3](https://arxiv.org/html/2606.23833#S3.p7.1),[§5\.3](https://arxiv.org/html/2606.23833#S5.SS3.p1.8)\.
- X\. Zhu and Z\. Ghahramani \(2002\)Learning from labeled and unlabeled data with label propagation\.Technical reportTechnical ReportCMU\-CALD\-02\-107,Carnegie Mellon University, School of Computer Science\.External Links:[Link](https://mlg.eng.cam.ac.uk/zoubin/papers/CMU-CALD-02-107.pdf)Cited by:[§5\.4](https://arxiv.org/html/2606.23833#S5.SS4.p2.9)\.
Reconstructing GRACE Terrestrial Water Storage with Spatio-Temporal Graph Neural Networks: An Application to South America

Similar Articles

GRACE: Gated Refinement for Accurate Causal Edge Discovery in High-Dimensional Time Series

A Global-Local Graph Attention Network for Traffic Forecasting

Towards Fast GNN Surrogates for CO2 Migration in Complex Geological Formations

A Temporally Augmented Graph Attention Network for Affordance Classification

Graph-Conditioned Mixture of Graph Neural Network Experts for Traffic Forecasting

Submit Feedback

Similar Articles

GRACE: Gated Refinement for Accurate Causal Edge Discovery in High-Dimensional Time Series
A Global-Local Graph Attention Network for Traffic Forecasting
Towards Fast GNN Surrogates for CO2 Migration in Complex Geological Formations
A Temporally Augmented Graph Attention Network for Affordance Classification
Graph-Conditioned Mixture of Graph Neural Network Experts for Traffic Forecasting