Towards Anomaly Detection on Relational Data
Summary
This paper introduces RelAD, a reconstruction-based framework for detecting anomalies in relational databases by jointly modeling attribute and relational edge reconstruction. Extensive experiments on six new benchmarks show RelAD outperforms existing methods.
View Cached Full Text
Cached at: 06/18/26, 05:44 AM
# Towards Anomaly Detection on Relational Data
Source: [https://arxiv.org/html/2606.18621](https://arxiv.org/html/2606.18621)
Shiyuan Li Griffith University shiyuan\.li@griffith\.edu\.au&Yunfeng Zhao Guangxi University yunf\.zhao@st\.gxu\.edu\.cn&Yue Tan Griffith University yue\.tan@griffith\.edu\.au&Qingfeng Chen Guangxi University qingfeng@gxu\.edu\.cn&Yixin Liu∗ Griffith University yixin\.liu@griffith\.edu\.au&Shirui Pan Griffith University s\.pan@griffith\.edu\.au
###### Abstract
Relational databases are widely used for managing structured data in real\-world systems\. Detecting anomalies from such relational data is crucial for identifying fraud, risks, and abnormal behaviors, yet remains under\-explored\. The key challenges lie in the intrinsic complexity of relational data: multi\-table attributes are high\-dimensional and heterogeneous, making sparse abnormal clues easy to overwhelm by normal or irrelevant information; and anomalies may further manifest as abnormal connection patterns across different foreign\-key relations, which existing tabular and graph anomaly detection methods are ill\-suited to capture\. To address them, we propose RelAD, a reconstruction\-based framework that captures anomalies from both attribute and relational edge reconstruction\. RelAD contains two core modules: conditional sparse\-gated attribute reconstruction, which suppresses redundant multi\-table attributes and emphasizes abnormal semantic blocks, and dual\-view multi\-relational edge reconstruction, which detects relation\-specific abnormal connections from both intrinsic and behavioral entity profiles\. The resulting attribute and relational signals are integrated through a lightweight fusion module to produce the final anomaly score\. We further construct 6 benchmark datasets with systematic anomalies, on which extensive experiments show that RelAD consistently outperforms other baselines while achieving competitive efficiency\.
## 1Introduction
Relational databases are widely used as the primary storage abstraction for structured data in real\-world applications such as financial risk control, e\-commerce, and industrial operation monitoringRobinsonet al\.\([2024a](https://arxiv.org/html/2606.18621#bib.bib1)\); Dwivediet al\.\([2025](https://arxiv.org/html/2606.18621#bib.bib2)\)\. Unlike single\-table data, a relational database organizes information across multiple tables interconnected via primary–foreign key relationships, modeling rich structural and temporal dependencies between entities\. For example, in a risk\-control system, a user can be associated not only with attributes in the target table, such as age, region, and registration time, but also with related entities such as products, devices, transactions, and visit records[Dwivediet al\.](https://arxiv.org/html/2606.18621#bib.bib17); Chenet al\.\([2025b](https://arxiv.org/html/2606.18621#bib.bib18)\)\. Since abnormal behaviors in real business systems may naturally be hidden in such multi\-table dependencies, detecting these anomalous entities from relational databases is crucial for fraud detection, risk warning, and abnormal behavior discovery \(an example is shown in Fig\.[1](https://arxiv.org/html/2606.18621#S1.F1)a\)\. Recent studies of relational deep learning \(RDL\) have developed effective paradigms for representation learning on relational databases by treating them as heterogeneous graphs and modeling entities and their relationships across interconnected tablesRobinsonet al\.\([2024a](https://arxiv.org/html/2606.18621#bib.bib1)\);[Ranjanet al\.](https://arxiv.org/html/2606.18621#bib.bib21); Wanget al\.\([2025](https://arxiv.org/html/2606.18621#bib.bib20)\)\. However, how to identify anomalous entities from relational data, a problem referred to as relational anomaly detection \(RAD\) in this paper, remains under\-explored\.
Figure 1:Sketch maps of \(a\) Anomalous entity in relational database; and the limitations of adapting \(b\) tabular anomaly detection methods or \(c\) graph anomaly detection methods to relational databases\.To address this under\-explored problem, a straightforward solution is to transform the database into a format compatible with existing anomaly detection methods and directly adapt these approaches for RAD\. Available methods mainly fall into two categories: tabular anomaly detection \(TAD\)Thimonieret al\.\([2024a](https://arxiv.org/html/2606.18621#bib.bib32)\); Yeet al\.\([2025a](https://arxiv.org/html/2606.18621#bib.bib34)\)and graph anomaly detection \(GAD\)Panet al\.\([2026b](https://arxiv.org/html/2606.18621#bib.bib53),[2025](https://arxiv.org/html/2606.18621#bib.bib48)\); Dinget al\.\([2019](https://arxiv.org/html/2606.18621#bib.bib26)\); Liuet al\.\([2021](https://arxiv.org/html/2606.18621#bib.bib28)\); Chenet al\.\([2024](https://arxiv.org/html/2606.18621#bib.bib35)\); Panet al\.\([2026a](https://arxiv.org/html/2606.18621#bib.bib6)\)\. TAD methods usually represent each sample as an independent feature vector and identify anomalies through density estimation, distance measurement, or reconstruction errorsBreuniget al\.\([2000](https://arxiv.org/html/2606.18621#bib.bib24)\); Goodgeet al\.\([2022](https://arxiv.org/html/2606.18621#bib.bib31)\); Yinet al\.\([2024](https://arxiv.org/html/2606.18621#bib.bib33)\)\. When applied to relational databases, however, TAD methods rely on flattening relational schemas into single tables through feature engineering and aggregation \(as in Fig\.[1](https://arxiv.org/html/2606.18621#S1.F1)b\)\. Although this preprocessing step makes model input convenient, it may discard the dependencies encoded by primary–foreign key relationships\. Meanwhile, because relational databases contain heterogeneous attributes distributed across multiple tables, the resulting feature matrix often contains high\-dimensional and redundant feature groups, which can dilute local abnormal signals in global objectives\. On the other hand, GAD methods jointly model node attributes and graph structuresChenet al\.\([2025a](https://arxiv.org/html/2606.18621#bib.bib52)\); Tanet al\.\([2024](https://arxiv.org/html/2606.18621#bib.bib51)\), and have achieved promising performance in scenarios such as social networks, transaction networks, and citation networksLiuet al\.\([2024](https://arxiv.org/html/2606.18621#bib.bib29)\); Zhaoet al\.\([2025](https://arxiv.org/html/2606.18621#bib.bib30)\)\. Nevertheless, most GAD methods are designed for homogeneous graphs with a single adjacency matrix, making it difficult to directly preserve multiple relation types induced by different primary–foreign key relationships\. If a relational database is converted into a homogeneous graph \(as in Fig\.[1](https://arxiv.org/html/2606.18621#S1.F1)c\), relation\-specific semantics can be mixed, and the model may fail to distinguish which relational behavior pattern causes the abnormality\.
The limitations of existing TAD and GAD methods in handling complex relational data highlight the urgent need to develop dedicated frameworks for RAD\. To achieve this goal, the key objective is to exploit both rich entity attributes and primary–foreign key relationships for accurate anomaly prediction\. Driven by this objective, we identify two key challenges\.C1\- Feature redundancy and signal dilution\.Unlike standard tabular data with carefully curated features from a single table, relational databases are inherently multi\-table and heterogeneous: a central entity can be described by attributes in the central table, attributes from child tables, and feature groups derived from different entity contexts\. This naturally leads to a high\-dimensional and redundant feature space in which only a small fraction of attributes can indicate abnormal behavior\. Consequently, the massive volume of anomaly\-irrelevant features tends to dominate the model’s learning objective, diluting or completely obscuring subtle local abnormal signals\. In this case, how to adaptively extract sparse anomaly indicators from overwhelming redundant features poses a significant challenge\.C2\- Relational heterogeneity and complex entity dependencies\.Anomalies in relational databases are often embedded in complex dependencies, where a central entity interacts with different types of neighbor entities through semantically distinct primary–foreign key relationships\. More importantly, different types of relations may contribute unequally to the abnormality of an entity, requiring the model to selectively focus on the most informative dependencies\. In this context, exploiting heterogeneous relations to capture relation\-specific abnormal connection patterns requires effective modeling of relation semantics\. In this context, how to exploit heterogeneous relations to capture relation\-specific abnormal connection patterns poses another key challenge\.
To tackle these challenges, we proposeRelationalAnomalyDetection \(RelAD\), a reconstruction\-based framework for RAD\. RelAD is specifically designed for relational databases, with the core idea of characterizing anomalies from both attribute reconstruction and relational edge reconstruction\. To addressC1, RelAD incorporates a conditional sparse\-gated attribute reconstruction module\. This module generates conditional masks for central\-table attributes and child\-table aggregated attributes according to the block structure of relational databases, enabling the model to adaptively select informative dimensions before reconstruction\. It further adopts block\-specific decoding and block\-level residuals, allowing anomaly scoring to focus on the most significant local attribute deviations instead of being diluted by global average errors\. To addressC2, RelAD proposes a dual\-view multi\-relational edge reconstruction module, which directly reconstructs relation\-specific edges over the heterogeneous graph induced by primary–foreign key relationships\. Specifically, the model encodes each central entity from both the central\-table self profile and the child\-table aggregated profile, and learns relation\-specific neighbor\-entity representations for each relation type\. In this way, RelAD measures whether an entity profile can explain its connection behaviors under different relations\. Finally, RelAD fuses three complementary signals, including attribute\-block anomalies, self\-profile relational anomalies, and child\-profile relational anomalies, to produce the final anomaly score and cover diverse anomaly sources\. In summary, this paper makes the following contributions:
- •Problem\.We, to the best of our knowledge, are the first to propose and formalize the problem of relational data anomaly detection\. Based on relational benchmarks, we further design dataset\-specific anomaly synthesis rules to enable systematic model evaluation\.
- •Methodology\.We propose a novel reconstruction\-based relational anomaly detection method RelAD, which jointly captures local attribute deviations and relation\-specific abnormal connection patterns through attribute and multi\-relational edge reconstruction\.
- •Experiments\.We conduct extensive experiments for evaluation, and the experimental results validate the effectiveness, robustness, and efficiency of RelAD on six benchmark datasets\.
## 2Preliminary
Relational Database\.Following relational deep learning, we consider a relational database as a collection of tables𝒟=\{𝒯k\}k=1K\\mathcal\{D\}=\\\{\\mathcal\{T\}^\{k\}\\\}\_\{k=1\}^\{K\}, whereKKis the number of tables\. Each table𝒯k\\mathcal\{T\}^\{k\}contains rows corresponding to entities and columns corresponding to their attributes\. Tables are connected by primary–foreign key relationships\. A primary key uniquely identifies a row in a table, while a foreign key in another table references this primary key and thereby defines a typed relationship between rows from two tables\. Thus, relational databases contain both table attributes and inter\-table entity relationships\.
Heterogeneous Graph View\.A relational database can be represented as a heterogeneous graph𝒢=\(\{𝒱k\}k=1K,\{ℰr\}r∈ℛ\)\\mathcal\{G\}=\(\\\{\\mathcal\{V\}^\{k\}\\\}\_\{k=1\}^\{K\},\\\{\\mathcal\{E\}\_\{r\}\\\}\_\{r\\in\\mathcal\{R\}\}\), where each node set𝒱k\\mathcal\{V\}^\{k\}is induced by the rows of table𝒯k\\mathcal\{T\}^\{k\}\. The setℛ\\mathcal\{R\}contains relation types derived from the database schema\. Specifically, each relation typer∈ℛr\\in\\mathcal\{R\}corresponds to a primary–foreign key constraint, or equivalently a typed link between a source table, a foreign\-key column, and a referenced table\. The associated edge setℰr\\mathcal\{E\}\_\{r\}is obtained by instantiating this constraint over the rows of the two tables\. An edge\(v,i\)∈ℰr\(v,i\)\\in\\mathcal\{E\}\_\{r\}indicates that two rows are connected under relation typerr\. Different relation types preserve different semantics, such as user–item interactions, user–device associations, or paper–citation links\. This heterogeneous graph view is only used to define the relational structure; our setting keeps relation\-specific edge sets instead of collapsing them into a single homogeneous graph\.
Relational Anomaly Detection Setting\.For RAD, we designate one table as the target, or central, table whose entities are to be detected, and refer to schema\-connected tables as child or related tables when describing entity\-centric features and neighborhoods\. We focus on identifying abnormal entities from the central table𝒯0\\mathcal\{T\}^\{0\}\. Let𝒰=\{u1,…,uN\}\\mathcal\{U\}=\\\{u\_\{1\},\\dots,u\_\{N\}\\\}denote theNNcentral entities\. Each central entityu∈𝒰u\\in\\mathcal\{U\}may be connected to heterogeneous neighbor entities through multiple relation types\. We represent the relation\-specific edge set incident to central entities as
ℰ=\{\(u,r,i\)∣u∈𝒰,r∈ℛ,i∈𝒱r\},\\mathcal\{E\}=\\\{\(u,r,i\)\\mid u\\in\\mathcal\{U\},\\;r\\in\\mathcal\{R\},\\;i\\in\\mathcal\{V\}^\{r\}\\\},\(1\)where𝒱r\\mathcal\{V\}^\{r\}denotes the neighbor\-entity table associated with relation typerr, and\(u,i\)∈ℰr\(u,i\)\\in\\mathcal\{E\}\_\{r\}indicates thatuuis connected to neighbor entityiiunder relationrr\. An anomaly label vector𝐲∈\{0,1\}N\\mathbf\{y\}\\in\\\{0,1\\\}^\{N\}is used for evaluation, where𝐲u=1\\mathbf\{y\}\_\{u\}=1indicates that target entityuuis anomalous and𝐲u=0\\mathbf\{y\}\_\{u\}=0otherwise\. The goal is to learn an anomaly scoring functionf:𝒰→ℝf:\\mathcal\{U\}\\rightarrow\\mathbb\{R\}such that anomalous entities receive larger scores than normal entities\. At a high level, anomalies may appear as abnormal attribute patterns of a target entity, abnormal relation\-specific connection patterns, or inconsistencies between entity attributes and relational connections\.
Initial Features\.For each table, raw columns can include numerical, categorical, timestamp, and text attributes\. Following the feature preprocessing protocol of RelBenchGuet al\.\([2026](https://arxiv.org/html/2606.18621#bib.bib19)\), we encode heterogeneous raw columns into unified row\-level representations, where non\-text attributes are normalized or embedded according to their types and text attributes are represented by pretrained text embeddings\. Let𝐡vk\\mathbf\{h\}\_\{v\}^\{k\}denote the initial feature vector of rowvvin table𝒯k\\mathcal\{T\}^\{k\}\. For each target entityuu, we construct an attribute vector:
𝐱u=Concat\(𝐱uself,𝐱uagg,1,…,𝐱uagg,B\)∈ℝd,\\mathbf\{x\}\_\{u\}=\\operatorname\{Concat\}\\left\(\\mathbf\{x\}\_\{u\}^\{self\},\\mathbf\{x\}\_\{u\}^\{agg,1\},\\dots,\\mathbf\{x\}\_\{u\}^\{agg,B\}\\right\)\\in\\mathbb\{R\}^\{d\},\(2\)where𝐱uself\\mathbf\{x\}\_\{u\}^\{self\}is the row feature ofuufrom the target table, and each𝐱uagg,b\\mathbf\{x\}\_\{u\}^\{agg,b\}is an aggregated feature block computed from rows in related tables\. Here,BBdenotes the number of child\-table aggregated blocks, so the complete attribute vector containsB\+1B\+1blocks including the self block\. The aggregation uses statistics such as mean, standard deviation, and count over rows connected touu\. The resulting matrix is denoted by𝐗∈ℝN×d\\mathbf\{X\}\\in\\mathbb\{R\}^\{N\\times d\}\. For relation\-specific edge reconstruction, each relation typerradditionally provides an edge listℰr\\mathcal\{E\}\_\{r\}and a neighbor feature matrix𝐇r∈ℝ\|𝒱r\|×dr\\mathbf\{H\}^\{r\}\\in\\mathbb\{R\}^\{\|\\mathcal\{V\}^\{r\}\|\\times d\_\{r\}\}, wheredrd\_\{r\}is the feature dimension of neighbor entities under relationrr\.
## 3RelAD: Reconstruction\-based Relational Anomaly Detection
Figure 2:The overall pipeline of RelAD for relational data anomaly detection\.In this section, we introduce RelAD, a reconstruction\-based framework for anomaly detection on relational data\. Since real\-world relational databases rarely provide anomaly labels and normal entities usually dominate the data, an unsupervised reconstruction model can learn prevalent normal patterns from unlabeled entities\. Entities whose attributes or relational behaviors deviate from these patterns are then expected to incur larger reconstruction residuals, making reconstruction error a natural anomaly indicator\. Based on this principle, RelAD characterizes anomalies from two complementary views, as shown in Fig\.[2](https://arxiv.org/html/2606.18621#S3.F2)\. To suppress redundant cross\-table attributes and preserve local semantic deviations, we first introduce aconditional sparse\-gated attribute reconstructionmodule in Sec\.[3\.1](https://arxiv.org/html/2606.18621#S3.SS1)\. Next, to model relation\-specific abnormal connections under heterogeneous foreign keys, we propose adual\-view multi\-relational edge reconstructionmodule in Sec\.[3\.2](https://arxiv.org/html/2606.18621#S3.SS2), which reconstructs relational edges from both the intrinsic entity profile and the child\-table behavioral profile\. Finally, we combine the most salient attribute and relational signals through ananomaly score fusionstrategy \(Sec\.[3\.3](https://arxiv.org/html/2606.18621#S3.SS3)\)\.
### 3\.1Conditional Sparse\-Gated Attribute Reconstruction
In relational databases, a central entity can be represented using heterogeneous attributes from its central table and multiple child tables\. These attributes naturally form semantic blocks with different relevance to anomaly detection\. Since abnormal signals may only appear in a few blocks or dimensions, directly reconstructing the entire attribute vector can make numerous anomaly\-irrelevant features dominate the objective\. To address this challenge, RelAD introduces a conditional sparse\-gated attribute reconstruction module, which first identifies informative relational feature blocks and then reconstructs the entity profile\.
Conditional Sparse Gating\.To filter out the anomaly\-irrelevant attributes, a key question is how to select informative ones from heterogeneous relational feature blocks\. An effective strategy is to use a gating mechanism to assign adaptive importance weights to input dimensions, so that less relevant features are suppressed before reconstruction\. However, directly applying a single global gate to the concatenated attribute vector is not ideal for relational databases, because it treats all dimensions as an unstructured feature set and ignores the block semantics induced by different tables\. Moreover, whether a child\-table aggregated block is informative often depends on the central entity profile rather than on the child block alone, highlighting the need to explicitly model such dependencies\. To fill this gap, we make the gate both block\-aware and context\-conditioned\. Specifically, we first generate a separate mask for each semantic block, so that attribute selection can respect the table from which each block originates\. Let𝐱0\\mathbf\{x\}\_\{0\}be the central\-table attribute block and𝐱t\\mathbf\{x\}\_\{t\}be thett\-th child\-table aggregated block\. Since the central block describes the target entity itself, its mask is generated from its own attributes\. In contrast, each child block is evaluated together with the central block, allowing the model to determine whether the related\-table information is informative for the central entity:
𝐦0=σ\(MLP0\(𝐱0\)\),\\mathbf\{m\}\_\{0\}=\\sigma\\left\(\\operatorname\{MLP\}\_\{0\}\(\\mathbf\{x\}\_\{0\}\)\\right\),\(3\)𝐦t=σ\(MLPt\(\[𝐱0;𝐱t\]\)\),t=1,…,B,\\mathbf\{m\}\_\{t\}=\\sigma\\left\(\\operatorname\{MLP\}\_\{t\}\(\[\\mathbf\{x\}\_\{0\};\\mathbf\{x\}\_\{t\}\]\)\\right\),\\quad t=1,\\dots,B,\(4\)whereBBis the number of child\-table aggregated blocks,𝐦0\\mathbf\{m\}\_\{0\}and𝐦t\\mathbf\{m\}\_\{t\}are block\-level masks, andσ\(⋅\)\\sigma\(\\cdot\)is the sigmoid function\. After obtaining these block\-level masks, we concatenate them to form an entity\-specific attribute mask aligned with the original feature vector for input feature filtering:
𝐦=\[𝐦0;𝐦1;…;𝐦B\],𝐱~=𝐦⊙𝐱,\\mathbf\{m\}=\[\\mathbf\{m\}\_\{0\};\\mathbf\{m\}\_\{1\};\\dots;\\mathbf\{m\}\_\{B\}\],\\qquad\\tilde\{\\mathbf\{x\}\}=\\mathbf\{m\}\\odot\\mathbf\{x\},\(5\)where𝐦\\mathbf\{m\}has the same dimensionality as𝐱\\mathbf\{x\},𝐱~\\tilde\{\\mathbf\{x\}\}is the gated attribute vector, and⊙\\odotdenotes element\-wise multiplication\. In this way, the encoder receives a compact attribute profile that emphasizes informative blocks and suppresses redundant dimensions\. The gated vector is then projected into a latent representation:
𝐳attr=Encattr\(𝐱~\),\\mathbf\{z\}\_\{attr\}=\\operatorname\{Enc\}\_\{attr\}\(\\tilde\{\\mathbf\{x\}\}\),\(6\)where𝐳attr\\mathbf\{z\}\_\{attr\}is the attribute latent representation andEncattr\(⋅\)\\operatorname\{Enc\}\_\{attr\}\(\\cdot\)is the shared attribute encoder\.
Although conditional gating can assign adaptive importance to different attributes, the learned masks may still remain dense without explicit regularization\. To further encourage sparse attribute selection, we impose a sparsity penalty on the mask values:
ℒsparse=1Nd∑u=1N∑j=1dmu,j,\\mathcal\{L\}\_\{sparse\}=\\frac\{1\}\{Nd\}\\sum\_\{u=1\}^\{N\}\\sum\_\{j=1\}^\{d\}m\_\{u,j\},\(7\)wheremu,jm\_\{u,j\}denotes the gate value of entityuuon thejj\-th attribute dimension\. With this constraint, the model is encouraged to reconstruct entity profiles using a compact subset of informative attributes, reducing the interference from redundant dimensions\.
Block\-Specific Reconstruction\.After obtaining𝐳attr\\mathbf\{z\}\_\{attr\}, a straightforward reconstruction strategy is to use a single decoder to predict the entire attribute vector\. However, such a decoder ignores that different blocks originate from different tables and carry distinct semantics\. It may also allow high\-dimensional blocks to dominate the reconstruction objective, making anomalies in small but important blocks less visible\. To preserve block\-level semantics during reconstruction, RelAD adopts block\-specific decoders, where each decoder is responsible for reconstructing one semantic block:
𝐱^t=Dect\(𝐳attr\),t=0,…,B,\\hat\{\\mathbf\{x\}\}\_\{t\}=\\operatorname\{Dec\}\_\{t\}\(\\mathbf\{z\}\_\{attr\}\),\\quad t=0,\\dots,B,\(8\)where𝐱^t\\hat\{\\mathbf\{x\}\}\_\{t\}is the reconstructed vector of blockttandDect\(⋅\)\\operatorname\{Dec\}\_\{t\}\(\\cdot\)is the decoder for that block\. The reconstructed blocks are then concatenated to form the full reconstruction𝐱^u\\hat\{\\mathbf\{x\}\}\_\{u\}for each entity\. During training, we optimize the overall reconstruction objective over the full attribute vector:
ℒrec=1Nd∑u=1N‖𝐱u−𝐱^u‖22,\\mathcal\{L\}\_\{rec\}=\\frac\{1\}\{Nd\}\\sum\_\{u=1\}^\{N\}\\left\\\|\\mathbf\{x\}\_\{u\}\-\\hat\{\\mathbf\{x\}\}\_\{u\}\\right\\\|\_\{2\}^\{2\},\(9\)where𝐱^u\\hat\{\\mathbf\{x\}\}\_\{u\}is the reconstructed full attribute vector of entityuu\. While this objective trains the attribute branch to reconstruct the complete entity profile, anomaly scoring should remain sensitive to localized deviations\. Therefore, for inference, we further compute a block\-level reconstruction error:
ℒt\(u\)=1dt‖𝐱u,t−𝐱^u,t‖22,\\mathcal\{L\}\_\{t\}\(u\)=\\frac\{1\}\{d\_\{t\}\}\\left\\\|\\mathbf\{x\}\_\{u,t\}\-\\hat\{\\mathbf\{x\}\}\_\{u,t\}\\right\\\|\_\{2\}^\{2\},\(10\)wheret=0,…,Bt=0,\\dots,B,𝐱u,t\\mathbf\{x\}\_\{u,t\}and𝐱^u,t\\hat\{\\mathbf\{x\}\}\_\{u,t\}are the original and reconstructed vectors of blocktt, anddtd\_\{t\}is the dimensionality of this block\. These block\-wise residuals reveal which semantic parts of an entity are poorly reconstructed\. Since anomalies may only appear in a few semantic blocks, aggregating all block errors uniformly may dilute the abnormal signal\. We therefore use the average of the top\-KKnormalized block errors as the local attribute anomaly score:
sattr\(u\)=1K∑t∈TopKt\(Norm\(ℒt\(u\)\)\)Norm\(ℒt\(u\)\),s\_\{attr\}\(u\)=\\frac\{1\}\{K\}\\sum\_\{t\\in\\operatorname\{TopK\}\_\{t\}\(\\operatorname\{Norm\}\(\\mathcal\{L\}\_\{t\}\(u\)\)\)\}\\operatorname\{Norm\}\(\\mathcal\{L\}\_\{t\}\(u\)\),\(11\)whereNorm\(⋅\)\\operatorname\{Norm\}\(\\cdot\)denotes score normalization,KKis a hyperparameter controlling the number of selected local signals, andTopKt\(⋅\)\\operatorname\{TopK\}\_\{t\}\(\\cdot\)selects theKKlargest block\-level scores\. As a result, the attribute branch produces an anomaly score that is less affected by redundant or weakly relevant attributes and more sensitive to localized deviations within specific semantic blocks\. This allows RelAD to distinguish entities with abnormal attribute patterns even when such deviations are sparse and would be diluted by global reconstruction errors or uniform block aggregation\.
### 3\.2Dual\-View Multi\-Relational Edge Reconstruction
Apart from attributes, relations also play a critical role in RAD, because abnormal entities may not only exhibit unusual attributes but also form abnormal connections with related records\. In relational databases, these connections are organized by multiple foreign\-key relations with distinct semantics, and each relation may reveal a different type of behavioral deviation\. From a relational perspective, the abnormality can arise from two different sources\. An edge may be inconsistent with the entity’s intrinsic profile recorded in the central table, or it may be inconsistent with the behavioral context summarized from related child tables\. In this case, collapsing these sources into a single representation for relation reconstruction may blur their distinct roles, especially when behavioral aggregations provide stronger shortcut signals for edge prediction and thereby suppress weaker but important self\-profile evidence\. Motivated by this, RelAD reconstructs the original multi\-relational edges from two complementary views\. To achieve this, we design a dual\-view multi\-relational edge reconstruction module with self\-profile and child\-profile relational branches\.
Dual\-View Entity Encoding\.To keep the two sources of relational evidence disentangled, we encode the central\-table profile and the child\-table aggregations with two independent encoders\. The self\-profile branch maps the central\-table profile into a latent space, allowing the model to examine whether the observed neighbors are compatible with the intrinsic properties of the target entity:
𝐳u,selfrel=Encself\(𝐱uself\),\\mathbf\{z\}\_\{u,self\}^\{rel\}=\\operatorname\{Enc\}\_\{self\}\(\\mathbf\{x\}\_\{u\}^\{self\}\),\(12\)where𝐳u,selfrel\\mathbf\{z\}\_\{u,self\}^\{rel\}is the relational representation from the central\-table profile andEncself\(⋅\)\\operatorname\{Enc\}\_\{self\}\(\\cdot\)is the corresponding encoder\. In parallel, the child\-profile branch encodes the aggregated child\-table profile, allowing edge reconstruction to be conditioned on the target entity’s historical behavioral context:
𝐳u,childrel=Encchild\(𝐱uchild\),\\mathbf\{z\}\_\{u,child\}^\{rel\}=\\operatorname\{Enc\}\_\{child\}\(\\mathbf\{x\}\_\{u\}^\{child\}\),\(13\)where𝐳u,childrel\\mathbf\{z\}\_\{u,child\}^\{rel\}is the relational representation from child\-table aggregations andEncchild\(⋅\)\\operatorname\{Enc\}\_\{child\}\(\\cdot\)is an independent encoder\. By keeping the two encoders separate, RelAD can produce view\-specific relational residuals rather than collapsing all evidence into a single mixed representation\.
After obtaining the two target\-entity views, the neighbor side should still preserve relation semantics, because different foreign\-key relations may connect the target entity to different types of neighbor records\. Therefore, for each relation typer∈ℛr\\in\\mathcal\{R\}, RelAD uses a relation\-specific neighbor\-entity encoder shared by both branches:
𝐞ir=NodeEncr\(𝐡ir\),\\mathbf\{e\}\_\{i\}^\{r\}=\\operatorname\{NodeEnc\}^\{r\}\(\\mathbf\{h\}\_\{i\}^\{r\}\),\(14\)where𝐡ir\\mathbf\{h\}\_\{i\}^\{r\}is the row feature of neighbor entityiiin𝐇r\\mathbf\{H\}^\{r\},NodeEncr\(⋅\)\\operatorname\{NodeEnc\}^\{r\}\(\\cdot\)is the neighbor\-entity encoder of relationrr, and𝐞ir\\mathbf\{e\}\_\{i\}^\{r\}is the shared relation\-specific neighbor\-entity embedding used by both branches\. Sharing this neighbor embedding space makes the reconstruction scores from the two views comparable within each relation, while the relation\-specific encoder prevents semantically different foreign\-key relations from being forced into a single neighbor representation\.
Multi\-Relational Edge Reconstruction\.Given an observed edge\(u,i\)∈ℰr\(u,i\)\\in\\mathcal\{E\}\_\{r\}and a randomly sampled negative neighbor entityjj, the two branches reconstruct relational edges via dot\-product scoring\. Letq∈\{self,child\}q\\in\\\{self,child\\\}denote the relational branch, with𝐳u,qrel\\mathbf\{z\}\_\{u,q\}^\{rel\}being the corresponding target\-entity representation, then the branch\-specific relational loss can be written as
ℒqr=𝔼\(u,i\)∈ℰr\[ϕ\(−𝐳u,qrel⋅𝐞ir\)\]\+𝔼\(u,j\)∉ℰr\[ϕ\(𝐳u,qrel⋅𝐞jr\)\],\\mathcal\{L\}\_\{q\}^\{r\}=\\mathbb\{E\}\_\{\(u,i\)\\in\\mathcal\{E\}\_\{r\}\}\\left\[\\phi\(\-\\mathbf\{z\}\_\{u,q\}^\{rel\}\\cdot\\mathbf\{e\}\_\{i\}^\{r\}\)\\right\]\+\\mathbb\{E\}\_\{\(u,j\)\\notin\\mathcal\{E\}\_\{r\}\}\\left\[\\phi\(\\mathbf\{z\}\_\{u,q\}^\{rel\}\\cdot\\mathbf\{e\}\_\{j\}^\{r\}\)\\right\],\(15\)whereϕ\(x\)=log\(1\+exp\(x\)\)\\phi\(x\)=\\log\(1\+\\exp\(x\)\)denotes the softplus function\.
To obtain anomaly scores during inference, for each relation type, we compute the average edge\-level negative log\-likelihood for each entity and normalize the relation\-wise scores\. The top\-KKrelation scores are then aggregated for each branch:
srel,q\(u\)=1K∑r∈ℛK,q\(u\)Norm\(ℓ¯qr\(u\)\),ℛK,q\(u\)=TopKr\{Norm\(ℓ¯qr\(u\)\)\},s\_\{rel,q\}\(u\)=\\frac\{1\}\{K\}\\sum\_\{r\\in\\mathcal\{R\}\_\{K,q\}\(u\)\}\\operatorname\{Norm\}\(\\bar\{\\ell\}\_\{q\}^\{r\}\(u\)\),\\quad\\mathcal\{R\}\_\{K,q\}\(u\)=\\operatorname\{TopK\}\_\{r\}\\left\\\{\\operatorname\{Norm\}\(\\bar\{\\ell\}\_\{q\}^\{r\}\(u\)\)\\right\\\},\(16\)whereℓ¯qr\(u\)\\bar\{\\ell\}\_\{q\}^\{r\}\(u\)denotes the average positive\-edge negative log\-likelihood under relation typerrfrom branchqq\. Equipped with dual\-view reconstruction, RelAD captures inconsistencies between relational behaviors and both central\-profile attributes and child\-table behavioral contexts\.
### 3\.3Model Training and Anomaly Scoring
To optimize RelAD, the overall objective combines the previous loss terms:
ℒ=ℒrec\+λsℒsparse\+1\|ℛact\|∑q∈\{self,child\}∑r∈ℛactℒqr,\\mathcal\{L\}=\\mathcal\{L\}\_\{rec\}\+\\lambda\_\{s\}\\mathcal\{L\}\_\{sparse\}\+\\frac\{1\}\{\|\\mathcal\{R\}\_\{act\}\|\}\\sum\_\{q\\in\\\{self,child\\\}\}\\sum\_\{r\\in\\mathcal\{R\}\_\{act\}\}\\mathcal\{L\}\_\{q\}^\{r\},\(17\)whereℛact\\mathcal\{R\}\_\{act\}is the set of active relation types with valid sampled edges in the current batch, andλs\\lambda\_\{s\}controls the sparsity regularization\.
Entity Anomaly Score Fusion\.Although the attribute and relation branches provide complementary evidence, anomalies in relational data are often localized to only a few semantic blocks or relation types\. Directly averaging all residuals may dilute these local signals, while relying on a single branch may miss anomalies from other sources\. To address this issue, RelAD adopts a two\-level fusion of three local anomaly scores computed from normalized block\- and relation\-level errors:
s\(u\)=αsattr\(u\)\+\(1−α\)\[βsrel,self\(u\)\+\(1−β\)srel,child\(u\)\],s\(u\)=\\alpha s\_\{attr\}\(u\)\+\(1\-\\alpha\)\\left\[\\beta s\_\{rel,self\}\(u\)\+\(1\-\\beta\)s\_\{rel,child\}\(u\)\\right\],\(18\)whereα∈\[0,1\]\\alpha\\in\[0,1\]controls the trade\-off between attribute\-local anomalies and relational anomalies, andβ∈\[0,1\]\\beta\\in\[0,1\]controls the trade\-off between the self\-profile and child\-profile relational branches\. This hierarchical parameterization reduces the fusion space while preserving interpretability: the attribute branch dominates when anomalies appear as local profile deviations, while the two relational branches provide complementary evidence when anomalies are reflected by inconsistent multi\-relational behaviors\. The algorithmic details and complexity analysis of RelAD are provided in Appendices[A](https://arxiv.org/html/2606.18621#A1)and[B](https://arxiv.org/html/2606.18621#A2), respectively\.
## 4Experiments
### 4\.1Experiment Setup
Datasets\.We conduct experiments on 6 public relational databases from RelbenchRobinsonet al\.\([2024a](https://arxiv.org/html/2606.18621#bib.bib1)\)and Relbench v2Guet al\.\([2026](https://arxiv.org/html/2606.18621#bib.bib19)\)across multiple domains, including 3 e\-commerce platforms \(Amazon, HM, and Avito\), an academic citation network \(ArXiv\), a Q&A community \(Stack\), and an enterprise resource planning system \(SALT[Kleinet al\.](https://arxiv.org/html/2606.18621#bib.bib37)\)\. Detailed statistics are provided in Appendix[C](https://arxiv.org/html/2606.18621#A3)\. As there are no ground\-truth anomaly labels in these benchmarks, we design dataset\-specific anomaly injection rules for evaluation\. We simulate plausible fraud scenarios grounded in each dataset’s relational structure\. For example, in Amazon we replace a subset of a user’s reviews with products from unrelated categories to simulate review manipulation, while in ArXiv we redirect citation edges to shared beacon papers to simulate citation cartels\. The anomaly ratio is set to 5% across all datasets\. Detailed injection procedures are provided in Appendix[D](https://arxiv.org/html/2606.18621#A4)\.
Baselines\.We compare RelAD with the representation GAD and TAD methods\. ❶ GAD methods include PREMPanet al\.\([2023](https://arxiv.org/html/2606.18621#bib.bib27)\), FreeGADZhaoet al\.\([2025](https://arxiv.org/html/2606.18621#bib.bib30)\)and DOMINANTDinget al\.\([2019](https://arxiv.org/html/2606.18621#bib.bib26)\)\. ❷ TAD methods include MCMTADYinet al\.\([2024](https://arxiv.org/html/2606.18621#bib.bib33)\), DRLYeet al\.\([2025a](https://arxiv.org/html/2606.18621#bib.bib34)\), KNNRamaswamyet al\.\([2000](https://arxiv.org/html/2606.18621#bib.bib25)\), LOFBreuniget al\.\([2000](https://arxiv.org/html/2606.18621#bib.bib24)\), IsolationForestLiuet al\.\([2019](https://arxiv.org/html/2606.18621#bib.bib23)\)and LUNARGoodgeet al\.\([2022](https://arxiv.org/html/2606.18621#bib.bib31)\)\.
Table 1:Anomaly detection performance in terms of AUROC and AUPRC\. OOM denotes out\-of\-memory on a 24GB GPU\. Best performance is highlighted inbold\.Evaluation and Implementation\.We report AUROC and AUPRC as the main metrics\. For all methods, we report the mean performance over 5 random seeds\. In our implementation, we optimize RelAD using Adam with a learning rate of5×10−35\\times 10^\{\-3\}, and train the model for 100 epochs with a batch size of 8192\. More comprehensive implementation details are given in Appendix[E](https://arxiv.org/html/2606.18621#A5)\.
### 4\.2Experimental Results
Main Results\.Table[1](https://arxiv.org/html/2606.18621#S4.T1)reports the comparison results in terms of AUROC and AUPRC \(more results are in Appendix[F\.1](https://arxiv.org/html/2606.18621#A6.SS1)\)\. We have the following observations\. ❶ RelAD achieves the best AUROC on all six datasets, demonstrating its consistent effectiveness for RAD across diverse relational schemas and anomaly sources\. Compared with the strongest baseline on each dataset, RelAD shows more than 10% relative performance gain on 4 datasets\. ❷ RelAD also obtains the best AUPRC on all six datasets, with particularly large gains on ArXiv, HM, and SALT\. This indicates that RelAD not only ranks anomalies higher overall, but also better identifies rare anomalous entities under the highly imbalanced detection setting\. ❸ Existing TAD methods can perform competitively on some datasets after flattening relational information into tabular features, such as DRL on Amazon and Avito and MCMTAD on HM and SALT\. However, their performance varies significantly across datasets, suggesting that flattened features cannot consistently preserve relation\-specific abnormal patterns\. ❹ GAD methods generally underperform and DOMINANT suffers from out\-of\-memory issues on most datasets\. This confirms that directly adapting homogeneous graph anomaly detection methods is insufficient for RAD, where multiple typed foreign\-key relations and heterogeneous entity attributes need to be modeled explicitly\. Overall, these results validate the advantage of jointly modeling local attribute reconstruction and dual\-view multi\-relational edge reconstruction in RelAD\.
Ablation Study\.To verify the effectiveness of each component, we construct six variants of RelAD: ❶w/o Attr\., removing attribute reconstruction; ❷w/o Relation, removing multi\-relational edge reconstruction; ❸w/o Gating, removing conditional sparse gates; ❹w/o Block Dec\., using one shared attribute decoder; ❺w/o Self View, removing the self\-profile branch; and ❻w/o Child View, removing the child\-profile branch\.
Table 2:AUROC of RelAD and its variants\.Table[2](https://arxiv.org/html/2606.18621#S4.T2)shows that all components are beneficial\. Relational reconstruction is the most critical component, confirming the importance of modeling abnormal connection patterns\. Removing the attribute branch also hurts performance, indicating that local attribute deviations provide complementary evidence\. The drops ofw/o Gatingandw/o Block Dec\.further show that conditional feature selection and block\-specific reconstruction reduce redundancy and preserve localized anomaly signals\. For the dual\-view design,w/o Child Viewgenerally suffers a larger drop thanw/o Self View, suggesting that child\-table behavioral context is especially informative\. Nevertheless, the full model consistently outperforms both single\-view variants, confirming that all designs are valid and effective\. More ablation results are available in the Appendix[F\.2](https://arxiv.org/html/2606.18621#A6.SS2)\.
Hyperparameter Analysis\.Fig\.[3](https://arxiv.org/html/2606.18621#S4.F3)shows the sensitivity of RelAD toα\\alphaandβ\\betaon Amazon and SALT \(more results are in Appendix[F\.3](https://arxiv.org/html/2606.18621#A6.SS3)\)\. The reliances on the two weights differ notably across datasets\. SALT requires a higherα\\alpha, indicating that attribute deviations play a major role in its anomaly pattern; in contrast, Amazon requires a smallerα\\alpha, relying more on multi\-relational edge reconstruction\.
\(a\)Amazon
\(b\)SALT
Figure 3:Sensitivity w\.r\.t\.α\\alphaandβ\\beta\.Forβ\\beta, most datasets prefer moderate values, confirming that both relational views provide complementary evidence\. This disparity suggests that anomalies in different relational databases may manifest primarily in attributes or in relational connections, and RelAD can flexibly adapt the fusion weights to the underlying anomaly source\.
Figure 4:Runtime comparison on the HM dataset\.Efficiency Analysis\.To assess the runtime efficiency of RelAD, we compare its total running time with representative baselines on the HM dataset, excluding methods that run OOM\. For a fair comparison, all trainable methods are evaluated under the same 5\-epoch setting\. As shown in Fig\.[4](https://arxiv.org/html/2606.18621#S4.F4), RelAD achieves competitive efficiency while maintaining strong detection performance\. In particular, RelAD is substantially faster than several costly baselines such as KNN, LOF, LUNAR, DRL, and MCMTAD, and its runtime remains close to the fastest methods\. These results indicate that RelAD introduces limited computational overhead and remains practical for relational anomaly detection on large\-scale datasets\.
## 5Related Work
In this section, we briefly review three lines of related studies\. A detailed literature review is in Appendix[G](https://arxiv.org/html/2606.18621#A7)\.
Relational Deep Learning \(RDL\)aims to learn from multi\-table relational databases by modeling entities and their primary–foreign key relationshipsRobinsonet al\.\([2024b](https://arxiv.org/html/2606.18621#bib.bib8)\)\. Mainstream studies transform relational databases into heterogeneous graphs and develop graph neural network \(GNN\)\-based or Transformer\-based frameworks for representation learning and downstream task modeling[Dwivediet al\.](https://arxiv.org/html/2606.18621#bib.bib17); Chenet al\.\([2025b](https://arxiv.org/html/2606.18621#bib.bib18)\)\. Advanced studies have also extended RDL to more general paradigms, such as LLM\-based frameworksWuet al\.\([2025](https://arxiv.org/html/2606.18621#bib.bib9)\)and foundation models[Ranjanet al\.](https://arxiv.org/html/2606.18621#bib.bib21); Wanget al\.\([2025](https://arxiv.org/html/2606.18621#bib.bib20)\); Kothapalliet al\.\([2026](https://arxiv.org/html/2606.18621#bib.bib22)\)for RDL\. Despite their success, existing works rarely investigate anomaly detection for relational data, leaving this problem largely under\-explored\.
Tabular Anomaly Detection \(TAD\)investigates the detection of anomalous instances in structured tabular dataPanget al\.\([2021](https://arxiv.org/html/2606.18621#bib.bib10)\); Borisovet al\.\([2022](https://arxiv.org/html/2606.18621#bib.bib36)\)\. Early TAD methods mainly rely on handcrafted statistical assumptions, including isolation\-, density\-, and distance\-based anomaly scoring strategiesBreuniget al\.\([2000](https://arxiv.org/html/2606.18621#bib.bib24)\); Ramaswamyet al\.\([2000](https://arxiv.org/html/2606.18621#bib.bib25)\); Liuet al\.\([2019](https://arxiv.org/html/2606.18621#bib.bib23)\)\. Recent deep learning\-based solutions, built upon techniques such as autoencoding and contrastive learning, demonstrate strong performance in TADYeet al\.\([2025a](https://arxiv.org/html/2606.18621#bib.bib34)\); Yinet al\.\([2024](https://arxiv.org/html/2606.18621#bib.bib33)\); Shenkar and Wolf \([2022](https://arxiv.org/html/2606.18621#bib.bib11)\)\. However, TAD methods cannot effectively model the relational dependencies in relational data, making them sub\-optimal for RAD\.
Graph Anomaly Detection \(GAD\)aims to identify anomalous samples from graph dataMaet al\.\([2021](https://arxiv.org/html/2606.18621#bib.bib12)\); Qiaoet al\.\([2025](https://arxiv.org/html/2606.18621#bib.bib13)\); Liuet al\.\([2026a](https://arxiv.org/html/2606.18621#bib.bib57)\); Shenet al\.\([2026](https://arxiv.org/html/2606.18621#bib.bib7)\); Zhaoet al\.\([2026](https://arxiv.org/html/2606.18621#bib.bib5)\); Liuet al\.\([2026b](https://arxiv.org/html/2606.18621#bib.bib3)\)\. As shallow methods usually struggle to handle complex graph dataPenget al\.\([2018](https://arxiv.org/html/2606.18621#bib.bib15)\); Liet al\.\([2017](https://arxiv.org/html/2606.18621#bib.bib14)\), mainstream studies primarily develop deep learning\-based approaches built upon GNNsQiaoet al\.\([2025](https://arxiv.org/html/2606.18621#bib.bib13)\)\. These deep methods mainly rely on paradigms such as reconstructionDinget al\.\([2019](https://arxiv.org/html/2606.18621#bib.bib26)\), contrastive learningLiuet al\.\([2021](https://arxiv.org/html/2606.18621#bib.bib28)\); Panet al\.\([2023](https://arxiv.org/html/2606.18621#bib.bib27)\), and affinity learningQiao and Pang \([2023](https://arxiv.org/html/2606.18621#bib.bib16)\)to characterize anomalous patterns\. Nevertheless, most GAD methods are developed for homogeneous graphs, limiting their effectiveness in RAD with heterogeneous entities and relations\.
## 6Conclusion
In this paper, we take the first step towards addressing the relational anomaly detection problem, aiming to identify abnormal entities from heterogeneous multi\-table attributes and typed foreign\-key relations\. We introduce RelAD, a reconstruction\-based RAD framework that jointly captures local attribute deviations and relation\-specific abnormal connection patterns via conditional sparse\-gated attribute reconstruction and dual\-view multi\-relational edge reconstruction\. Extensive experiments on six relational benchmark datasets demonstrate the detection prowess, robustness, and efficiency of RelAD compared to representative tabular and graph anomaly detection approaches\. Onelimitationis that the current setting does not explicitly consider the temporal dimension and its influence on anomaly formation\. A potentialfuture directionis to extend RelAD to dynamic relational databases where entities, attributes, and foreign\-key relations evolve over time\.
## References
- \[1\]R\. Bing, G\. Yuan, M\. Zhu, F\. Meng, H\. Ma, and S\. Qiao\(2023\)Heterogeneous graph neural networks analysis: a survey of techniques, evaluations and applications\.\.Artificial Intelligence Review56\(8\)\.Cited by:[§G\.3](https://arxiv.org/html/2606.18621#A7.SS3.p1.1)\.
- \[2\]V\. Borisov, T\. Leemann, K\. Seßler, J\. Haug, M\. Pawelczyk, and G\. Kasneci\(2022\)Deep neural networks and tabular data: a survey\.IEEE transactions on neural networks and learning systems35\(6\),pp\. 7499–7519\.Cited by:[§G\.2](https://arxiv.org/html/2606.18621#A7.SS2.p1.1),[§5](https://arxiv.org/html/2606.18621#S5.p3.1)\.
- \[3\]M\. M\. Breunig, H\. Kriegel, R\. T\. Ng, and J\. Sander\(2000\)LOF: identifying density\-based local outliers\.InProceedings of the 2000 ACM SIGMOD international conference on Management of data,pp\. 93–104\.Cited by:[§G\.2](https://arxiv.org/html/2606.18621#A7.SS2.p1.1),[§1](https://arxiv.org/html/2606.18621#S1.p2.1),[§4\.1](https://arxiv.org/html/2606.18621#S4.SS1.p2.1),[§5](https://arxiv.org/html/2606.18621#S5.p3.1)\.
- \[4\]C\. Chang, J\. Yoon, S\. Ö\. Arik, M\. Udell, and T\. Pfister\(2023\)Data\-efficient and interpretable tabular anomaly detection\.InProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining,pp\. 190–201\.Cited by:[§G\.2](https://arxiv.org/html/2606.18621#A7.SS2.p1.1)\.
- \[5\]J\. Chen, G\. Zhu, C\. Yuan, and Y\. Huang\(2024\)Boosting graph anomaly detection with adaptive message passing\.InThe Twelfth International Conference on Learning Representations,Cited by:[§1](https://arxiv.org/html/2606.18621#S1.p2.1)\.
- \[6\]Q\. Chen, S\. Li, Y\. Liu, S\. Pan, G\. I\. Webb, and S\. Zhang\(2025\)Uncertainty\-aware graph neural networks: a multihop evidence fusion approach\.IEEE Transactions on Neural Networks and Learning Systems\.Cited by:[§1](https://arxiv.org/html/2606.18621#S1.p2.1)\.
- \[7\]T\. Chen, C\. Kanatsoulis, and J\. Leskovec\(2025\)RelGNN: composite message passing for relational deep learning\.InInternational Conference on Machine Learning,pp\. 8296–8312\.Cited by:[§G\.1](https://arxiv.org/html/2606.18621#A7.SS1.p1.1),[§1](https://arxiv.org/html/2606.18621#S1.p1.1),[§5](https://arxiv.org/html/2606.18621#S5.p2.1)\.
- \[8\]K\. Ding, J\. Li, R\. Bhanushali, and H\. Liu\(2019\)Deep anomaly detection on attributed networks\.InProceedings of the 2019 SIAM international conference on data mining,pp\. 594–602\.Cited by:[§G\.3](https://arxiv.org/html/2606.18621#A7.SS3.p1.1),[§1](https://arxiv.org/html/2606.18621#S1.p2.1),[§4\.1](https://arxiv.org/html/2606.18621#S4.SS1.p2.1),[§5](https://arxiv.org/html/2606.18621#S5.p4.1)\.
- \[9\]X\. Dong, X\. Zhang, Y\. Sun, L\. Chen, M\. Yuan, and S\. Wang\(2025\)Smoothgnn: smoothing\-aware gnn for unsupervised node anomaly detection\.InProceedings of the ACM on Web Conference 2025,pp\. 1225–1236\.Cited by:[§G\.3](https://arxiv.org/html/2606.18621#A7.SS3.p1.1)\.
- \[10\]V\. P\. Dwivedi, S\. Jaladi, Y\. Shen, F\. Lopez, C\. I\. Kanatsoulis, R\. Puri, M\. Fey, and J\. LeskovecRelational graph transformer\.InTemporal Graph Learning Workshop@ KDD 2025,Cited by:[§G\.1](https://arxiv.org/html/2606.18621#A7.SS1.p1.1),[§1](https://arxiv.org/html/2606.18621#S1.p1.1),[§5](https://arxiv.org/html/2606.18621#S5.p2.1)\.
- \[11\]V\. P\. Dwivedi, C\. Kanatsoulis, S\. Huang, and J\. Leskovec\(2025\)Relational deep learning: challenges, foundations and next\-generation architectures\.InProceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V\. 2,pp\. 5999–6009\.Cited by:[§1](https://arxiv.org/html/2606.18621#S1.p1.1)\.
- \[12\]A\. Goodge, B\. Hooi, S\. Ng, and W\. S\. Ng\(2022\)Lunar: unifying local outlier detection methods via graph neural networks\.InProceedings of the AAAI conference on artificial intelligence,Vol\.36,pp\. 6737–6745\.Cited by:[§1](https://arxiv.org/html/2606.18621#S1.p2.1),[§4\.1](https://arxiv.org/html/2606.18621#S4.SS1.p2.1)\.
- \[13\]J\. Gu, R\. Ranjan, C\. Kanatsoulis, H\. Tang, M\. Jurkovic, V\. Hudovernik, M\. Znidar, P\. Chaturvedi, P\. Shroff, F\. Li,et al\.\(2026\)RelBench v2: a large\-scale benchmark and repository for relational data\.arXiv preprint arXiv:2602\.12606\.Cited by:[§G\.1](https://arxiv.org/html/2606.18621#A7.SS1.p1.1),[§2](https://arxiv.org/html/2606.18621#S2.p4.4),[§4\.1](https://arxiv.org/html/2606.18621#S4.SS1.p1.1)\.
- \[14\]T\. K\. K\. Ho, A\. Karami, and N\. Armanfard\(2025\)Graph anomaly detection in time series: a survey\.IEEE Transactions on Pattern Analysis and Machine Intelligence\.Cited by:[§G\.3](https://arxiv.org/html/2606.18621#A7.SS3.p1.1)\.
- \[15\]T\. Klein, C\. Biehl, M\. Costa, A\. Sres, J\. Kolk, and J\. HoffartSALT: sales autocompletion linked business tables dataset\.InNeurIPS 2024 Third Table Representation Learning Workshop,Cited by:[§4\.1](https://arxiv.org/html/2606.18621#S4.SS1.p1.1)\.
- \[16\]V\. Kothapalli, R\. Ranjan, V\. Hudovernik, V\. P\. Dwivedi, J\. Hoffart, C\. Guestrin, and J\. Leskovec\(2026\)PluRel: synthetic data unlocks scaling laws for relational foundation models\.arXiv preprint arXiv:2602\.04029\.Cited by:[§G\.1](https://arxiv.org/html/2606.18621#A7.SS1.p1.1),[§5](https://arxiv.org/html/2606.18621#S5.p2.1)\.
- \[17\]J\. Li, H\. Dani, X\. Hu, and H\. Liu\(2017\)Radar: residual analysis for anomaly detection in attributed networks\.InProceedings of the 26th International Joint Conference on Artificial Intelligence,pp\. 2152–2158\.Cited by:[§G\.3](https://arxiv.org/html/2606.18621#A7.SS3.p1.1),[§5](https://arxiv.org/html/2606.18621#S5.p4.1)\.
- \[18\]S\. Li, Y\. Liu, Q\. Wen, C\. Zhang, and S\. Pan\(2026\)Assemble your crew: automatic multi\-agent communication topology design via autoregressive graph generation\.InProceedings of the AAAI Conference on Artificial Intelligence,Vol\.40,pp\. 23142–23150\.Cited by:[§G\.1](https://arxiv.org/html/2606.18621#A7.SS1.p1.1)\.
- \[19\]S\. Li, Y\. Liu, Y\. Zheng, X\. Cao, S\. Pan, and H\. T\. Shen\(2026\)Towards one\-for\-all anomaly detection for tabular data\.InForty\-third international conference on machine learning,Cited by:[§G\.2](https://arxiv.org/html/2606.18621#A7.SS2.p1.1)\.
- \[20\]S\. Li, Y\. Liu, Y\. Zheng, M\. Li, Q\. V\. H\. Nguyen, and S\. Pan\(2026\)Ofa\-mas: one\-for\-all multi\-agent system topology design based on mixture\-of\-experts graph generative models\.InProceedings of the ACM Web Conference 2026,pp\. 1333–1344\.Cited by:[§G\.1](https://arxiv.org/html/2606.18621#A7.SS1.p1.1)\.
- \[21\]Y\. Liu, Z\. Li, C\. Zhou, Y\. Jiang, J\. Sun, M\. Wang, and X\. He\(2019\)Generative adversarial active learning for unsupervised outlier detection\.IEEE Transactions on Knowledge and Data Engineering32\(8\),pp\. 1517–1528\.Cited by:[§G\.2](https://arxiv.org/html/2606.18621#A7.SS2.p1.1),[§4\.1](https://arxiv.org/html/2606.18621#S4.SS1.p2.1),[§5](https://arxiv.org/html/2606.18621#S5.p3.1)\.
- \[22\]Y\. Liu, S\. Li, Y\. Zheng, Q\. Chen, C\. Zhang, and S\. Pan\(2024\)Arc: a generalist graph anomaly detector with in\-context learning\.Advances in Neural Information Processing Systems37,pp\. 50772–50804\.Cited by:[§1](https://arxiv.org/html/2606.18621#S1.p2.1)\.
- \[23\]Y\. Liu, S\. Li, Y\. Zheng, Q\. Chen, C\. Zhang, P\. S\. Yu, and S\. Pan\(2026\)From few\-shot to zero\-shot: towards generalist graph anomaly detection\.IEEE Transactions on Knowledge and Data Engineering\.Cited by:[§5](https://arxiv.org/html/2606.18621#S5.p4.1)\.
- \[24\]Y\. Liu, Z\. Li, S\. Pan, C\. Gong, C\. Zhou, and G\. Karypis\(2021\)Anomaly detection on attributed networks via contrastive self\-supervised learning\.IEEE transactions on neural networks and learning systems33\(6\),pp\. 2378–2392\.Cited by:[§G\.3](https://arxiv.org/html/2606.18621#A7.SS3.p1.1),[§1](https://arxiv.org/html/2606.18621#S1.p2.1),[§5](https://arxiv.org/html/2606.18621#S5.p4.1)\.
- \[25\]Y\. Liu, Y\. Liu, Y\. Zheng, A\. W\. Liew, X\. Cao, and S\. Pan\(2026\)Rethinking feature alignment in generalist graph anomaly detection: a relational fingerprint\-based approach\.InInternational Conference on Machine Learning,Cited by:[§5](https://arxiv.org/html/2606.18621#S5.p4.1)\.
- \[26\]X\. Ma, J\. Wu, S\. Xue, J\. Yang, C\. Zhou, Q\. Z\. Sheng, H\. Xiong, and L\. Akoglu\(2021\)A comprehensive survey on graph anomaly detection with deep learning\.IEEE transactions on knowledge and data engineering35\(12\),pp\. 12012–12038\.Cited by:[§G\.3](https://arxiv.org/html/2606.18621#A7.SS3.p1.1),[§5](https://arxiv.org/html/2606.18621#S5.p4.1)\.
- \[27\]R\. Miao, Y\. Liu, Y\. Wang, X\. Shen, Y\. Tan, Y\. Dai, S\. Pan, and X\. Wang\(2026\)Blindguard: safeguarding llm\-based multi\-agent systems under unknown attacks\.InProceedings of the 64th Annual Meeting of the Association for Computational Linguistics,Cited by:[§G\.1](https://arxiv.org/html/2606.18621#A7.SS1.p1.1)\.
- \[28\]J\. Pan, Y\. Liu, Y\. Zheng, and S\. Pan\(2023\)PREM: a simple yet effective approach for node\-level graph anomaly detection\.In2023 IEEE International Conference on Data Mining \(ICDM\),pp\. 1253–1258\.Cited by:[§G\.3](https://arxiv.org/html/2606.18621#A7.SS3.p1.1),[§4\.1](https://arxiv.org/html/2606.18621#S4.SS1.p2.1),[§5](https://arxiv.org/html/2606.18621#S5.p4.1)\.
- \[29\]J\. Pan, Y\. Liu, Y\. Zheng, L\. Chi, A\. W\. Liew, and S\. Pan\(2026\)CAMERA: adapting to semantic camouflage in unsupervised text\-attributed graph fraud detection\.InInternational Joint Conference on Artificial Intelligence,Cited by:[§1](https://arxiv.org/html/2606.18621#S1.p2.1)\.
- \[30\]J\. Pan, Y\. Liu, C\. Zhou, F\. Xiong, A\. W\. Liew, and S\. Pan\(2026\)Correcting false alarms from unseen: adapting graph anomaly detectors at test time\.InProceedings of the AAAI Conference on Artificial Intelligence,Cited by:[§1](https://arxiv.org/html/2606.18621#S1.p2.1)\.
- \[31\]J\. Pan, Y\. Zheng, Y\. Tan, and Y\. Liu\(2025\)A survey of generalization of graph anomaly detection: from transfer learning to foundation models\.InThe 16th IEEE International Conference on Knowledge Graphs,Cited by:[§1](https://arxiv.org/html/2606.18621#S1.p2.1)\.
- \[32\]G\. Pang, C\. Shen, L\. Cao, and A\. V\. D\. Hengel\(2021\)Deep learning for anomaly detection: a review\.ACM computing surveys \(CSUR\)54\(2\),pp\. 1–38\.Cited by:[§G\.2](https://arxiv.org/html/2606.18621#A7.SS2.p1.1),[§5](https://arxiv.org/html/2606.18621#S5.p3.1)\.
- \[33\]Z\. Peng, M\. Luo, J\. Li, H\. Liu, Q\. Zheng,et al\.\(2018\)ANOMALOUS: a joint modeling approach for anomaly detection on attributed networks\.\.InIjcai,Vol\.18,pp\. 3513–3519\.Cited by:[§G\.3](https://arxiv.org/html/2606.18621#A7.SS3.p1.1),[§5](https://arxiv.org/html/2606.18621#S5.p4.1)\.
- \[34\]Y\. Qian, Y\. Tan, Y\. Liu, W\. Yu, and S\. Pan\(2026\)DynHD: hallucination detection for diffusion large language models via denoising dynamics deviation learning\.arXiv preprint arXiv:2603\.16459\.Cited by:[§G\.1](https://arxiv.org/html/2606.18621#A7.SS1.p1.1)\.
- \[35\]H\. Qiao and G\. Pang\(2023\)Truncated affinity maximization: one\-class homophily modeling for graph anomaly detection\.Advances in Neural Information Processing Systems36,pp\. 49490–49512\.Cited by:[§G\.3](https://arxiv.org/html/2606.18621#A7.SS3.p1.1),[§5](https://arxiv.org/html/2606.18621#S5.p4.1)\.
- \[36\]H\. Qiao, H\. Tong, B\. An, I\. King, C\. Aggarwal, and G\. Pang\(2025\)Deep graph anomaly detection: a survey and new perspectives\.IEEE Transactions on Knowledge and Data Engineering\.Cited by:[§G\.3](https://arxiv.org/html/2606.18621#A7.SS3.p1.1),[§5](https://arxiv.org/html/2606.18621#S5.p4.1)\.
- \[37\]S\. Ramaswamy, R\. Rastogi, and K\. Shim\(2000\)Efficient algorithms for mining outliers from large data sets\.InProceedings of the 2000 ACM SIGMOD international conference on Management of data,pp\. 427–438\.Cited by:[§G\.2](https://arxiv.org/html/2606.18621#A7.SS2.p1.1),[§4\.1](https://arxiv.org/html/2606.18621#S4.SS1.p2.1),[§5](https://arxiv.org/html/2606.18621#S5.p3.1)\.
- \[38\]R\. Ranjan, V\. Hudovernik, M\. Znidar, C\. I\. Kanatsoulis, R\. R\. Upendra, M\. Mohammadi, J\. Meyer, T\. Palczewski, C\. Guestrin, and J\. LeskovecRelational transformer: toward zero\-shot foundation models for relational data\.InEurIPS 2025 Workshop: AI for Tabular Data,Cited by:[§G\.1](https://arxiv.org/html/2606.18621#A7.SS1.p1.1),[§1](https://arxiv.org/html/2606.18621#S1.p1.1),[§5](https://arxiv.org/html/2606.18621#S5.p2.1)\.
- \[39\]J\. Robinson, R\. Ranjan, W\. Hu, K\. Huang, J\. Han, A\. Dobles, M\. Fey, J\. E\. Lenssen, Y\. Yuan, Z\. Zhang,et al\.\(2024\)Relbench: a benchmark for deep learning on relational databases\.Advances in Neural Information Processing Systems37,pp\. 21330–21341\.Cited by:[§G\.1](https://arxiv.org/html/2606.18621#A7.SS1.p1.1),[§1](https://arxiv.org/html/2606.18621#S1.p1.1),[§4\.1](https://arxiv.org/html/2606.18621#S4.SS1.p1.1)\.
- \[40\]J\. Robinson, R\. Ranjan, W\. Hu, K\. Huang, J\. Han, A\. Dobles, M\. Fey, J\. E\. Lenssen, Y\. Yuan, Z\. Zhang,et al\.\(2024\)Relational deep learning: graph representation learning on relational databases\.InNeurIPS 2024 third table representation learning workshop,Cited by:[§G\.1](https://arxiv.org/html/2606.18621#A7.SS1.p1.1),[§5](https://arxiv.org/html/2606.18621#S5.p2.1)\.
- \[41\]M\. Schlichtkrull, T\. N\. Kipf, P\. Bloem, R\. Van Den Berg, I\. Titov, and M\. Welling\(2018\)Modeling relational data with graph convolutional networks\.InEuropean semantic web conference,pp\. 593–607\.Cited by:[§G\.1](https://arxiv.org/html/2606.18621#A7.SS1.p1.1)\.
- \[42\]X\. Shen, Y\. Liu, Y\. Wang, R\. Miao, Y\. Dai, S\. Pan, Y\. Chang, and X\. Wang\(2026\)Raising the bar in graph ood generalization: invariant learning beyond explicit environment modeling\.IEEE Transactions on Pattern Analysis and Machine Intelligence\.Cited by:[§5](https://arxiv.org/html/2606.18621#S5.p4.1)\.
- \[43\]T\. Shenkar and L\. Wolf\(2022\)Anomaly detection for tabular data with internal contrastive learning\.InInternational conference on learning representations,Cited by:[§G\.2](https://arxiv.org/html/2606.18621#A7.SS2.p1.1),[§5](https://arxiv.org/html/2606.18621#S5.p3.1)\.
- \[44\]Y\. Tan, G\. Long, J\. Jiang, and C\. Zhang\(2024\)Influence\-oriented personalized federated learning\.arXiv preprint arXiv:2410\.03315\.Cited by:[§1](https://arxiv.org/html/2606.18621#S1.p2.1)\.
- \[45\]H\. Thimonier, F\. Popineau, A\. Rimmel, and B\. Doan\(2024\)Beyond individual input for deep anomaly detection on tabular data\.InInternational Conference on Machine Learning,pp\. 48097–48123\.Cited by:[§1](https://arxiv.org/html/2606.18621#S1.p2.1)\.
- \[46\]H\. Thimonier, F\. Popineau, A\. Rimmel, and B\. Doan\(2024\)Retrieval augmented deep anomaly detection for tabular data\.InProceedings of the 33rd ACM international conference on information and knowledge management,pp\. 2250–2259\.Cited by:[§G\.2](https://arxiv.org/html/2606.18621#A7.SS2.p1.1)\.
- \[47\]C\. Tsai, G\. Teng, P\. Wallis, and W\. Ding\(2025\)AnoLLM: large language models for tabular anomaly detection\.InThe Thirteenth International Conference on Learning Representations,Cited by:[§G\.2](https://arxiv.org/html/2606.18621#A7.SS2.p1.1)\.
- \[48\]Y\. Wang, X\. Wang, Q\. Gan, M\. Wang, Q\. Yang, D\. Wipf, and M\. Zhang\(2025\)Griffin: towards a graph\-centric relational database foundation model\.InInternational Conference on Machine Learning,pp\. 64604–64627\.Cited by:[§G\.1](https://arxiv.org/html/2606.18621#A7.SS1.p1.1),[§1](https://arxiv.org/html/2606.18621#S1.p1.1),[§5](https://arxiv.org/html/2606.18621#S5.p2.1)\.
- \[49\]F\. Wu, V\. P\. Dwivedi, and J\. Leskovec\(2025\)Large language models are good relational learners\.InProceedings of the 63rd Annual Meeting of the Association for Computational Linguistics \(Volume 1: Long Papers\),pp\. 7835–7854\.Cited by:[§G\.1](https://arxiv.org/html/2606.18621#A7.SS1.p1.1),[§5](https://arxiv.org/html/2606.18621#S5.p2.1)\.
- \[50\]J\. Yan, E\. Zuo, K\. Liang, M\. Liu, M\. Li, X\. Liu, X\. Lv, and K\. Lu\(2025\)Address anomalies at critical crossroads for graph anomaly detection\.IEEE Transactions on Knowledge and Data Engineering\.Cited by:[§G\.3](https://arxiv.org/html/2606.18621#A7.SS3.p1.1)\.
- \[51\]H\. Ye, H\. Zhao, W\. Fan, M\. Zhou, D\. dan Guo, and Y\. Chang\(2025\)DRL: decomposed representation learning for tabular anomaly detection\.InThe Thirteenth International Conference on Learning Representations,Cited by:[§G\.2](https://arxiv.org/html/2606.18621#A7.SS2.p1.1),[§1](https://arxiv.org/html/2606.18621#S1.p2.1),[§4\.1](https://arxiv.org/html/2606.18621#S4.SS1.p2.1),[§5](https://arxiv.org/html/2606.18621#S5.p3.1)\.
- \[52\]J\. Ye, Z\. Tan, Y\. Hu, X\. Yang, G\. Cheng, and K\. Huang\(2025\)Disentangling tabular data towards better one\-class anomaly detection\.InProceedings of the AAAI Conference on Artificial Intelligence,Vol\.39,pp\. 13061–13068\.Cited by:[§G\.2](https://arxiv.org/html/2606.18621#A7.SS2.p1.1)\.
- \[53\]J\. Yin, Y\. Qiao, Z\. Zhou, X\. Wang, and J\. Yang\(2024\)Mcm: masked cell modeling for anomaly detection in tabular data\.InThe Twelfth International Conference on Learning Representations,Cited by:[§G\.2](https://arxiv.org/html/2606.18621#A7.SS2.p1.1),[§1](https://arxiv.org/html/2606.18621#S1.p2.1),[§4\.1](https://arxiv.org/html/2606.18621#S4.SS1.p2.1),[§5](https://arxiv.org/html/2606.18621#S5.p3.1)\.
- \[54\]Y\. Zhao, Y\. Liu, Q\. Chen, S\. Li, Y\. Tan, and S\. Pan\(2026\)FedCIGAR: a personalized reconstruction approach for federated graph\-level anomaly detection\.InInternational Joint Conference on Artificial Intelligence,Cited by:[§5](https://arxiv.org/html/2606.18621#S5.p4.1)\.
- \[55\]Y\. Zhao, Y\. Liu, S\. Li, Q\. Chen, Y\. Zheng, and S\. Pan\(2025\)Freegad: a training\-free yet effective approach for graph anomaly detection\.InProceedings of the 34th ACM International Conference on Information and Knowledge Management,pp\. 4379–4389\.Cited by:[§1](https://arxiv.org/html/2606.18621#S1.p2.1),[§4\.1](https://arxiv.org/html/2606.18621#S4.SS1.p2.1)\.
- \[56\]E\. Zuo, J\. Yan, A\. Aysa, C\. Chen, C\. Chen, H\. Ma, X\. Lv, and K\. Ubul\(2023\)SUCOLA: self\-adaptive structure refinement unsupervised contrastive learning framework for food safety risk early warning\.Engineering Applications of Artificial Intelligence126,pp\. 107016\.Cited by:[§G\.2](https://arxiv.org/html/2606.18621#A7.SS2.p1.1)\.
- \[57\]E\. Zuo, J\. Zhong, C\. Chen, C\. Chen, K\. Ubul, and X\. Lv\(2025\)Rethinking unsupervised time series anomaly detection: dynamic attention based on route inverse\-masking\.Applied Soft Computing,pp\. 113971\.Cited by:[§G\.2](https://arxiv.org/html/2606.18621#A7.SS2.p1.1)\.
## Appendix AAlgorithm Description
The training and testing algorithms are given in Algorithm[1](https://arxiv.org/html/2606.18621#algorithm1)and Algorithm[2](https://arxiv.org/html/2606.18621#algorithm2), respectively\.
Input:Attribute matrix
𝐗∈ℝN×d\\mathbf\{X\}\\in\\mathbb\{R\}^\{N\\times d\}with attribute blocks
\{𝐱t\}t=0B\\\{\\mathbf\{x\}\_\{t\}\\\}\_\{t=0\}^\{B\}; relation set
ℛ\\mathcal\{R\}with edge sets
\{ℰr\}\\\{\\mathcal\{E\}\_\{r\}\\\}and neighbor features
\{𝐇r\}\\\{\\mathbf\{H\}^\{r\}\\\}; training epochs
EE; batch size
bb; sampled edges per relation
MM; learning rate
η\\eta; sparsity weight
λs\\lambda\_\{s\}\.
Output:Trained model parameters
Θ=\{MLPt,Encattr,Dect,Encself,Encchild,NodeEncr\}t=0B,r∈ℛ\\Theta=\\\{\\operatorname\{MLP\}\_\{t\},\\operatorname\{Enc\}\_\{attr\},\\operatorname\{Dec\}\_\{t\},\\operatorname\{Enc\}\_\{self\},\\operatorname\{Enc\}\_\{child\},\\operatorname\{NodeEnc\}^\{r\}\\\}\_\{t=0\}^\{B,r\\in\\mathcal\{R\}\}\.
1
2Apply feature scaling on
𝐗\\mathbf\{X\}and on each
𝐇r\\mathbf\{H\}^\{r\};
3Initialize all model parameters
Θ\\Theta;
4for*𝑒𝑝𝑜𝑐ℎ=1\\mathit\{epoch\}=1toEE*do
5Generate target\-entity permutation
π\\piover
\{1,…,N\}\\\{1,\\dots,N\\\};
//Pre\-sample at mostMMedges for each relation
6for*r∈ℛr\\in\\mathcal\{R\}*do
7
ℰ~r←\\tilde\{\\mathcal\{E\}\}\_\{r\}\\leftarrowsample
min\(M,\|ℰr\|\)\\min\(M,\|\\mathcal\{E\}\_\{r\}\|\)edges from
ℰr\\mathcal\{E\}\_\{r\}uniformly;
8
9end for
10for*each mini\-batchℬ⊂π\\mathcal\{B\}\\subset\\piof sizebb*do
//Step 1: Conditional Sparse\-Gated Attribute Reconstruction
11Compute block\-wise masks
\{𝐦t\}\\\{\\mathbf\{m\}\_\{t\}\\\}via Eq\. \([3](https://arxiv.org/html/2606.18621#S3.E3)\)–\([4](https://arxiv.org/html/2606.18621#S3.E4)\) and the gated input
𝐱~\\tilde\{\\mathbf\{x\}\}via Eq\. \([5](https://arxiv.org/html/2606.18621#S3.E5)\);
12Encode
𝐳attr\\mathbf\{z\}\_\{attr\}via Eq\. \([6](https://arxiv.org/html/2606.18621#S3.E6)\) and reconstruct
\{𝐱^t\}t=0B\\\{\\hat\{\\mathbf\{x\}\}\_\{t\}\\\}\_\{t=0\}^\{B\}via Eq\. \([8](https://arxiv.org/html/2606.18621#S3.E8)\);
13Compute
ℒrec\\mathcal\{L\}\_\{rec\}via Eq\. \([9](https://arxiv.org/html/2606.18621#S3.E9)\) and
ℒsparse\\mathcal\{L\}\_\{sparse\}via Eq\. \([7](https://arxiv.org/html/2606.18621#S3.E7)\);
14
//Step 2: Dual\-View Multi\-Relational Edge Reconstruction
15Compute
𝐳u,selfrel\\mathbf\{z\}\_\{u,self\}^\{rel\}via Eq\. \([12](https://arxiv.org/html/2606.18621#S3.E12)\) and
𝐳u,childrel\\mathbf\{z\}\_\{u,child\}^\{rel\}via Eq\. \([13](https://arxiv.org/html/2606.18621#S3.E13)\);
16Identify active relations
ℛact\\mathcal\{R\}\_\{act\}with valid sampled edges in
\{ℰ~r\}r∈ℛ\\\{\\tilde\{\\mathcal\{E\}\}\_\{r\}\\\}\_\{r\\in\\mathcal\{R\}\}connected to entities in batch
ℬ\\mathcal\{B\};
17for*r∈ℛactr\\in\\mathcal\{R\}\_\{act\}*do
18Sample a negative neighbor
j∼Uniform\(𝒱r\)j\\sim\\operatorname\{Uniform\}\(\\mathcal\{V\}^\{r\}\)for each in\-batch positive edge
\(u,i\)∈ℰ~r\(u,i\)\\in\\tilde\{\\mathcal\{E\}\}\_\{r\};
19Encode shared neighbor embeddings
𝐞ir,𝐞jr\\mathbf\{e\}\_\{i\}^\{r\},\\mathbf\{e\}\_\{j\}^\{r\}via Eq\. \([14](https://arxiv.org/html/2606.18621#S3.E14)\);
20Compute
ℒqr\\mathcal\{L\}\_\{q\}^\{r\}for
q∈\{self,child\}q\\in\\\{self,child\\\}via Eq\. \([15](https://arxiv.org/html/2606.18621#S3.E15)\);
21
22end for
23
//Step 3: Objective Computation and Parameter Update
24Compute the total objective
ℒ\\mathcal\{L\}via Eq\. \([17](https://arxiv.org/html/2606.18621#S3.E17)\);
25Update
Θ\\Thetausing Adam:
Θ←Θ−η∇Θℒ\\Theta\\leftarrow\\Theta\-\\eta\\nabla\_\{\\Theta\}\\mathcal\{L\};
26
27end for
28
29end for
30return
Θ\\Theta;
Algorithm 1The Training Algorithm of RelADInput:Attribute matrix
𝐗∈ℝN×d\\mathbf\{X\}\\in\\mathbb\{R\}^\{N\\times d\}with attribute blocks
\{𝐱t\}t=0B\\\{\\mathbf\{x\}\_\{t\}\\\}\_\{t=0\}^\{B\}; relation set
ℛ\\mathcal\{R\}with edge sets
\{ℰr\}\\\{\\mathcal\{E\}\_\{r\}\\\}and neighbor features
\{𝐇r\}\\\{\\mathbf\{H\}^\{r\}\\\}; trained model parameters
Θ\\Theta; batch size
bb; top\-
KKfor local fusion; weights
α,β∈\[0,1\]\\alpha,\\beta\\in\[0,1\]\.
Output:Final anomaly scores
\{s\(u\)\}u=1N\\\{s\(u\)\\\}\_\{u=1\}^\{N\}\.
1
//Stage 1: Forward Pass for Entity Representations and Attribute Reconstruction
2for*each mini\-batchℬ⊂\{1,…,N\}\\mathcal\{B\}\\subset\\\{1,\\dots,N\\\}of sizebb*do
3Obtain the gated input
𝐱~\\tilde\{\\mathbf\{x\}\}, attribute latent
𝐳attr\\mathbf\{z\}\_\{attr\}, and block reconstructions
\{𝐱^t\}t=0B\\\{\\hat\{\\mathbf\{x\}\}\_\{t\}\\\}\_\{t=0\}^\{B\}via Eq\. \([3](https://arxiv.org/html/2606.18621#S3.E3)\)–\([8](https://arxiv.org/html/2606.18621#S3.E8)\);
4Compute and cache the block\-level reconstruction errors
ℒt\(u\)\\mathcal\{L\}\_\{t\}\(u\)for
u∈ℬu\\in\\mathcal\{B\}via Eq\. \([10](https://arxiv.org/html/2606.18621#S3.E10)\);
5Encode and cache the relational representations
𝐳u,selfrel\\mathbf\{z\}\_\{u,self\}^\{rel\}and
𝐳u,childrel\\mathbf\{z\}\_\{u,child\}^\{rel\}via Eq\. \([12](https://arxiv.org/html/2606.18621#S3.E12)\)–\([13](https://arxiv.org/html/2606.18621#S3.E13)\);
6
7end for
8
//Stage 2: Attribute\-Level Local Anomaly Scoring
9Compute the local attribute anomaly score
sattr\(u\)s\_\{attr\}\(u\)for all entities using the normalized top\-
KKblock errors via Eq\. \([11](https://arxiv.org/html/2606.18621#S3.E11)\);
10
//Stage 3: Dual\-View Multi\-Relational Anomaly Scoring
11for*r∈ℛr\\in\\mathcal\{R\}*do
12Encode all neighbor entities
\{𝐞ir\}i∈𝒱r\\\{\\mathbf\{e\}\_\{i\}^\{r\}\\\}\_\{i\\in\\mathcal\{V\}^\{r\}\}using the shared neighbor encoder via Eq\. \([14](https://arxiv.org/html/2606.18621#S3.E14)\);
13Compute the average positive\-edge negative log\-likelihoods
ℓ¯selfr\(u\)\\bar\{\\ell\}\_\{self\}^\{r\}\(u\)and
ℓ¯childr\(u\)\\bar\{\\ell\}\_\{child\}^\{r\}\(u\)for entities connected under relation
rr;
14
15end for
16Aggregate the dual\-view relational anomaly scores
srel,self\(u\)s\_\{rel,self\}\(u\)and
srel,child\(u\)s\_\{rel,child\}\(u\)for all entities via Eq\. \([16](https://arxiv.org/html/2606.18621#S3.E16)\);
17
//Stage 4: Hierarchical Anomaly Score Fusion
18Compute the final anomaly score
s\(u\)s\(u\)via Eq\. \([18](https://arxiv.org/html/2606.18621#S3.E18)\);
19return
\{s\(u\)\}u=1N\\\{s\(u\)\\\}\_\{u=1\}^\{N\};
Algorithm 2The Inference Algorithm of RelAD
## Appendix BComplexity Analysis
In this subsection, we discuss the time complexity of RelAD in the testing phase\. The overall cost mainly consists of four stages: conditional sparse\-gated attribute reconstruction, dual\-view entity encoding, dual\-view multi\-relational scoring, and hierarchical anomaly score fusion\. The complexity of conditional sparse\-gated attribute reconstruction is𝒪\(N\(d0h\+∑t=1B\(d0\+dt\)h\+dh\+hdz\+∑t=0B\(dzh\+hdt\)\+\(B\+1\)logK\)\)\\mathcal\{O\}\\big\(N\(d\_\{0\}h\+\\sum\_\{t=1\}^\{B\}\(d\_\{0\}\+d\_\{t\}\)h\+dh\+hd\_\{z\}\+\\sum\_\{t=0\}^\{B\}\(d\_\{z\}h\+hd\_\{t\}\)\+\(B\+1\)\\log K\)\\big\), whereNNis the number of target entities,BBis the number of child\-table aggregated blocks in Eq\. \([5](https://arxiv.org/html/2606.18621#S3.E5)\),dtd\_\{t\}is the dimension of thett\-th block, andd=∑t=0Bdtd=\\sum\_\{t=0\}^\{B\}d\_\{t\}\. This term covers block\-wise gate generation, attribute encoding, block\-specific decoding, and top\-KKblock selection in Eq\. \([11](https://arxiv.org/html/2606.18621#S3.E11)\)\. The complexity of dual\-view entity encoding is𝒪\(N\(d0h\+hdrel\+dchildh\+hdrel\)\)\\mathcal\{O\}\\big\(N\(d\_\{0\}h\+hd\_\{rel\}\+d\_\{child\}h\+hd\_\{rel\}\)\\big\), wheredchildd\_\{child\}is the dimension of𝐱uchild\\mathbf\{x\}\_\{u\}^\{child\}\. The complexity of dual\-view multi\-relational scoring is𝒪\(∑r∈ℛ\|𝒱r\|\(drh\+hdrel\)\+drel∑r∈ℛ\|ℰr\|\+N\|ℛ\|logK\)\\mathcal\{O\}\\big\(\\sum\_\{r\\in\\mathcal\{R\}\}\|\\mathcal\{V\}^\{r\}\|\(d\_\{r\}h\+hd\_\{rel\}\)\+d\_\{rel\}\\sum\_\{r\\in\\mathcal\{R\}\}\|\\mathcal\{E\}\_\{r\}\|\+N\|\\mathcal\{R\}\|\\log K\\big\), where\|𝒱r\|\|\\mathcal\{V\}^\{r\}\|and\|ℰr\|\|\\mathcal\{E\}\_\{r\}\|denote the number of neighbor entities and positive edges under relationrr, respectively, anddrd\_\{r\}is the feature dimension of neighbor entities under relationrr\. The first term encodes relation\-specific neighbor entities in Eq\. \([14](https://arxiv.org/html/2606.18621#S3.E14)\), the second term computes positive\-edge likelihoods for the two branches up to a constant factor, and the last term selects the top\-KKrelation\-wise scores in Eq\. \([16](https://arxiv.org/html/2606.18621#S3.E16)\)\. The complexity of hierarchical anomaly score fusion in Eq\. \([18](https://arxiv.org/html/2606.18621#S3.E18)\) is𝒪\(N\)\\mathcal\{O\}\(N\)\.
In summary, by treating the feature dimensions, hidden dimensions, the number of child\-table aggregated blocksBB, the number of relation types\|ℛ\|\|\\mathcal\{R\}\|, and the latent dimensiondreld\_\{rel\}as constants, the overall testing complexity simplifies to𝒪\(N\+∑r∈ℛ\|𝒱r\|\+∑r∈ℛ\|ℰr\|\)\\mathcal\{O\}\(N\+\\sum\_\{r\\in\\mathcal\{R\}\}\|\\mathcal\{V\}^\{r\}\|\+\\sum\_\{r\\in\\mathcal\{R\}\}\|\\mathcal\{E\}\_\{r\}\|\)\. This demonstrates that the inference time of RelAD scales linearly with the total number of target entities, neighbor entities, and relation\-specific edges, ensuring its practical efficiency and scalability for large\-scale relational databases\.
## Appendix CDatasets
Table 3:Statistics of datasets\.To comprehensively evaluate the generalization capability and practical effectiveness of our proposed model, we introduce six large\-scale, real\-world relational datasets into our experiments\. These datasets span multiple core domains, including e\-commerce, question\-and\-answering \(Q&A\) communities, fashion retail, online advertising, enterprise resource planning \(ERP\) systems, and academic citation networks\. They encompass massive heterogeneous node entities and record rich multimodal attributes along with complex, dynamic interaction relations, providing a highly challenging and high\-fidelity testing environment to examine the model’s performance in complex network topologies and real\-world relational reasoning tasks\. Dataset statistics are summarized in Table[3](https://arxiv.org/html/2606.18621#A3.T3), with detailed descriptions provided below:
- •Amazon:This dataset focuses on the e\-commerce interactions within the books category of the Amazon platform\. It comprehensively records product entities \(e\.g\., price, category\), user profiles, and massive review interactions containing rich unstructured text, numerical ratings, and “verified purchase” labels\. It provides multi\-dimensional feature support for modeling the deep associations among user preferences, implicit intents, and product characteristics\.
- •Stack:This dataset originates from the stats\-exchange site within the Stack Exchange Q&A network, focusing on the domains of statistics and machine learning\. It covers multi\-dimensional user activity trajectories based on a community reputation system, including user biographies, Q&A texts, edit histories, and voting behaviors\. It highly reflects the temporal evolution of information flow and self\-moderating mechanisms within a knowledge community, providing abundant temporal network features driven by collective behavior\.
- •HM:This dataset integrates massive online transaction records and rich entity metadata from the globally renowned fast\-fashion brand H&M\. It encompasses detailed customer demographic features and specific parameters for various product categories, exhibiting typical characteristics of long\-term, large\-scale consumer behavior\. It serves as a high\-quality benchmark for studying consumer purchasing patterns and product lifecycle predictions by delineating historical transaction relations within complex transaction graphs\.
- •Avito:This dataset originates from a contextual click\-through rate \(CTR\) prediction competition hosted by Avito, a leading Russian classified advertisements platform\. It encompasses large\-scale user search query logs, detailed descriptions of ad attributes, and related exposure context features\. It focuses on mining the precise matching relationships between search intents and ad contents, making it a classic scenario for validating a model’s capability to process high\-dimensional sparse features and complex context predictions\.
- •SALT:This dataset utilizes end\-to\-end real\-world enterprise business transaction data released by SAP AI Research, collected directly from authentic Enterprise Resource Planning \(ERP\) systems\. It covers sales document headers, line items, linked customer master data, and business workflow entities \(e\.g\., sales offices, shipping points\) with precise timestamps\. It preserves an extremely high degree of industrial authenticity, focusing on the prediction of operational variables in practical supply chain and order fulfillment scenarios\.
- •ArXiv:This dataset constructs a large\-scale academic literature network based on the arXiv physics domain between 2018 and 2023\. It encompasses over 222,000 papers, 143,000 uniquely mapped authors via ORCID, 1\.5 million directed citation links, and a hierarchical taxonomy of 53 physics subcategories\. It provides a highly dense and complex many\-to\-many scientific knowledge graph, serving as a high\-fidelity validation space for evaluating the model’s ability to track scientific research evolution and analyze academic network topologies\.
## Appendix DAnomaly Injection Procedure
This section details the anomaly injection methods and implementation specifics applied across six relational datasets\. To accurately evaluate the performance of anomaly detection algorithms in real\-world business scenarios and effectively prevent models from relying on simple statistical features for “shortcut learning,” our anomaly injection process strictly adheres to three core design principles:
1. 1\.Stratified Injection Rate:We initially sample 5% of the total population across all datasets using a stratified sampling strategy \(e\.g\., binning by node activity or degree\) to eliminate selection bias, ensuring that the statistical distribution of the anomalous group rigorously aligns with that of the normal group\. Importantly, anomaly labels are assigned only after verifying that the sampled entities satisfy the dataset\-specific injection constraints\. Consequently, while our target injection ratio is 5%, the final proportion of labeled anomalies closely approximates this target but may be slightly lower in certain datasets due to these validity constraints\.
2. 2\.Strict “Replace\-Only” Strategy:We strictly enforce a “replace\-only” strategy during the injection phase, firmly prohibiting the addition or deletion of any data rows\. This guarantees that the foundational statistical features of the nodes \(e\.g\., interaction frequency, total degree\) remain absolutely conserved before and after injection\.
3. 3\.Real\-World Scenario Reconstruction:Each injection strategy is explicitly designed to reconstruct real\-world business fraud scenarios, discarding mere random noise\.
Guided by these principles, we design a separate injection rule for each dataset according to its specific schema and business context\. The per\-dataset details are as follows:
- •Amazon\.In e\-commerce platforms, review manipulation \(i\.e\., a user suddenly posting reviews on products in categories unrelated to their purchase history\) is a common fraud pattern\. We treat such category\-shifting users as anomalous\. Specifically, we select 5% of users via stratified sampling from those whose review count exceeds the 20th\-percentile threshold and whose dominant category share is above 25%\. For each selected user, we choose a target category different from their dominant one and replace 5–10 of their reviews with products from that category, resampling the rating and verified status from the target category distribution\.
- •ArXiv\.In academic citation networks, citation cartels \(i\.e\., a group of authors collectively citing a shared set of papers to inflate visibility\) represent a realistic form of misconduct\. We treat such coordinated citing authors as anomalous\. Specifically, we sample 5% of authors with at least one outgoing citation via degree\-weighted sampling and organize them into cartels of 6–14 members\. For each cartel, 15 high\-citation papers spanning multiple categories are selected as shared beacons, and 40% of each member’s outgoing citation edges are replaced with these beacon papers\.
- •Avito\.Click farming \(i\.e\., a user concentrating clicks on a small set of paid ads to generate fake traffic\) is a prevalent fraud scenario in online advertising\. We treat users exhibiting such concentrated click patterns as anomalous\. Specifically, we uniformly sample 5% of users and replace 80%–100% of their clicked rows in theSearchStreamtable with 5 fixed high\-priced context ads, setting the position to 1–2\. TheVisitStreamandPhoneRequestsStreamtables are similarly redirected to the same target ad pool\.
- •HM\.Considering that scalper\-like purchasing is a notable risk in fashion retail, we treat users who buy products far outside their dominant style preference as anomalous\. Specifically, 5% of active customers are selected via stratified sampling, and 18%–35% of their transactions are replaced with products from a different style category, with price resampled accordingly\.
- •SALT\.In enterprise order systems, anomalous transactions often exhibit rare attribute combinations that seldom co\-occur under normal operations\. We treat customers whose orders contain such rare\-combination patterns as anomalous\.Specifically, we select 5% of active customers via two\-dimensional stratified sampling and replace 3%–8% of their order line attributes \(product, category, plant, and shipping point\) with rare values drawn from the bottom 20th percentile of the global distribution\.
- •Stack\.In Q&A platforms, cross\-topic comment bots that post under unrelated topics are a representative form of spam behavior\. We sample 5% of users via stratified sampling and redirect approximately 60% of their comments to posts under non\-dominant topics, keeping the comment text unchanged\.
## Appendix EImplementation Details
Architecture, Training, and Inference Details\.In our experiments, all encoders, including the attribute encoderEncattr\\operatorname\{Enc\}\_\{attr\}, the dual\-view relational encoders \(Encself\\operatorname\{Enc\}\_\{self\}andEncchild\\operatorname\{Enc\}\_\{child\}\), and the neighbor\-entity encodersNodeEncr\\operatorname\{NodeEnc\}^\{r\}, along with the block\-specific decodersDect\\operatorname\{Dec\}\_\{t\}, are implemented using two\-layer Multilayer Perceptrons \(MLPs\) with ReLU activation\. We usehhto denote the hidden dimension of the two\-layer MLPs, and seth=1024h=1024for the attribute encoder, relational encoders, gate generators, and block\-specific decoders\. The attribute latent dimension is denoted bydzd\_\{z\}and set to6464, while the relational latent dimension is denoted bydreld\_\{rel\}and also set to6464\. For the neighbor\-entity encodersNodeEncr\\operatorname\{NodeEnc\}^\{r\}, the hidden dimension is set to256256, with input dimensions dynamically adapted to the specific feature blocks of𝒱r\\mathcal\{V\}^\{r\}\. To stabilize training, Layer Normalization is applied in the intermediate layers ofEncattr\\operatorname\{Enc\}\_\{attr\}and the relational encoders\. For each dataset, we tune the key optimization and fusion hyperparameters by random search, including the learning rate, weight decay, batch size, training epochs, sparsity coefficientλs\\lambda\_\{s\}, and fusion coefficientsα\\alphaandβ\\beta\.
Regarding the objective function and inference, the model is optimized by the overall objective in Eq\. \([17](https://arxiv.org/html/2606.18621#S3.E17)\), whereλs\\lambda\_\{s\}controls the sparsity regularization for compact feature selection\. To address GPU memory bottlenecks caused by large\-scale relational neighbor sets, we restrict the maximum number of neighbor entities residing on the GPU to500,000500,000, while implementing an edge sampling mechanism that limits samples to200,000200,000per relation type per epoch\.
Data Preprocessing\.The data processing pipeline initiates by integrating anomaly labels𝐲\\mathbf\{y\}with the central table𝒯0\\mathcal\{T\}^\{0\}to define the target entities𝒰=\{u1,…,uN\}\\mathcal\{U\}=\\\{u\_\{1\},\\dots,u\_\{N\}\\\}\. Global entity IDs are mapped to a contiguous local index space\. As defined in the initial features setting, we follow the RelBench\-style preprocessing protocol to obtain row representations𝐡vk\\mathbf\{h\}\_\{v\}^\{k\}for each table, with normalization applied to numerical attributes and unified encodings applied to heterogeneous raw columns\.
To capture inter\-table dependencies, we instantiate the heterogeneous graph𝒢\\mathcal\{G\}by resolving predefined primary–foreign key schemas\. This process extracts clean, relation\-specific edge listsℰr\\mathcal\{E\}\_\{r\}connecting central entitiesu∈𝒰u\\in\\mathcal\{U\}to their neighborsi∈𝒱ri\\in\\mathcal\{V\}^\{r\}\. Following Eq\. \([2](https://arxiv.org/html/2606.18621#S2.E2)\), we compute the aggregated feature blocks𝐱uagg,b\\mathbf\{x\}\_\{u\}^\{agg,b\}via two distinct modes: \(1\) direct aggregation of schema\-connected child tables \(calculating descriptive statistics such as mean, standard deviation, and counts\); and \(2\) edge\-mediated aggregation of neighbor feature matrices𝐇r\\mathbf\{H\}^\{r\}alongℰr\\mathcal\{E\}\_\{r\}\.
Finally, for each target entityuu, the self\-feature𝐱uself\\mathbf\{x\}\_\{u\}^\{self\}is concatenated with all aggregated blocks𝐱uagg,b\\mathbf\{x\}\_\{u\}^\{agg,b\}to form the unified attribute matrix𝐗∈ℝN×d\\mathbf\{X\}\\in\\mathbb\{R\}^\{N\\times d\}\. The complete pre\-processing module outputs𝐗\\mathbf\{X\}, neighbor features𝐇r\\mathbf\{H\}^\{r\}, relation\-specific edgesℰr\\mathcal\{E\}\_\{r\}, and the label vector𝐲∈\{0,1\}N\\mathbf\{y\}\\in\\\{0,1\\\}^\{N\}, providing a well\-aligned and comprehensive input formulation for the downstream anomaly scoring functionff\.
Computing infrastructures\.We implement the proposed method with Python 3\.9 and PyTorch 2\.8\.0\. The key dependencies include scikit\-learn, Pandas, and Numpy\. All experiments are conducted on a Ubuntu server with Intel Xeon Platinum 8352V CPU \(16 vCPU\) and NVIDIA RTX 4090 GPU \(24GB\) with CUDA 11\.8\.
## Appendix FSupplemental Experiments
Table 4:Anomaly detection performance in terms of AUROC \(in percent, mean±std\)\. OOM denotes out\-of\-memory on a 24GB GPU\. Best results are highlighted inbold\.Table 5:Anomaly detection performance in terms of AUPRC \(in percent, mean±std\)\. OOM denotes out\-of\-memory on a 24GB GPU\. Best results are highlighted inbold\.### F\.1Performance Comparison in Terms of AUROC and AUPRC
Tables[4](https://arxiv.org/html/2606.18621#A6.T4)and[5](https://arxiv.org/html/2606.18621#A6.T5)further report the mean and standard deviation results over multiple runs\. The relatively low variances demonstrate the robustness and stability of RelAD across different datasets\. The results show that RelAD not only achieves consistently strong performance across all datasets, but also maintains relatively low variance, demonstrating its robustness and stability for relational anomaly detection\.
### F\.2Detailed Results of Ablation Study
Table[6](https://arxiv.org/html/2606.18621#A6.T6)reports the full ablation results on the remaining three datasets \(SALT, Stack, Avito\)\. Consistent with the findings in the main text, the full RelAD achieves the best AUROC on all datasets\. Notably, on SALT, removing the attribute branch \(w/o Attr\) causes the largest drop \(from 84\.90 to 65\.74\), in contrast to the main\-text datasets where relational reconstruction dominates\. This suggests that on SALT, attribute deviations serve as the primary anomaly indicator\. On Stack and Avito, the relative importance of each component aligns with the main\-text observations: block\-specific decoding and conditional gating consistently contribute, and removing either relational view degrades performance\.
Table[7](https://arxiv.org/html/2606.18621#A6.T7)reports the full AUPRC ablation results across all six datasets, largely corroborating the conclusions drawn from the AUROC analysis\. Overall, the complete RelAD framework achieves the best or highly competitive AUPRC scores across diverse scenarios\. Specifically, relational modeling \(w/o Relation\) remains the most crucial component on Amazon, ArXiv, and HM, where its removal leads to severe AUPRC degradation \(e\.g\., a massive drop from 29\.56 to 7\.65 on HM, and 13\.89 to 4\.33 on ArXiv\)\. Conversely, mirroring the AUROC findings, the attribute reconstruction branch \(w/o Attr\) proves indispensable on SALT, Stack, and Avito, causing the sharpest AUPRC drops when removed \(e\.g\., 17\.22 to 10\.32 on SALT, and 7\.66 to 5\.63 on Stack\)\. Furthermore, conditional gating and block\-specific decoding \(w/o Gatingandw/o Block Dec\.\) consistently improve AUPRC, validating their capability to isolate fine\-grained anomaly signals\. Regarding the dual\-view design, both views provide complementary benefits depending on the dataset’s structural semantics: removing the child view causes a stark drop on HM \(29\.56 to 9\.77\), whereas removing the self view is more detrimental on ArXiv \(13\.89 to 5\.11\)\. Although partial variants occasionally exhibit marginal AUPRC fluctuations on specific datasets \(e\.g\.,w/o Relationon SALT orw/o Self Viewon Stack\), these minor variances are typical in highly imbalanced datasets\. These results further confirm that RelAD benefits from all proposed components across diverse relational databases, maintaining the most robust and comprehensive performance globally\.
Table 6:AUROC of RelAD and its variants on remaining datasets\.Table 7:AUPRC of RelAD and its variants on all datasets\.
### F\.3Detailed Results of Hyperparameter Analysis
Fig\.[5](https://arxiv.org/html/2606.18621#A6.F5)further presents the sensitivity analysis of RelAD toα\\alphaandβ\\betaon four more representative datasets, including ArXiv, Avito, HM and Stack, to verify the generalization of our model’s parameter adaptability\. Consistent with the observations on Amazon and SALT, the optimal settings of the two weights show distinct differences across datasets, which are highly coupled with the core anomaly patterns of the data\. Specifically, ArXiv and HM achieve the optimal detection performance whenα\\alphais set to a small value near 0, indicating that anomalies in these two datasets are mainly embodied in abnormal multi\-relational connection patterns, so the model relies more on the multi\-relational edge reconstruction module to identify anomalies\. On the contrary, Avito and Stack obtain the best performance with a relatively largeα\\alpha, which demonstrates that attribute deviations are the dominant manifestation of anomalies in these datasets, and the attribute deviation modeling module plays a more critical role in anomaly scoring\.
For the balance weightβ\\beta, all four datasets reach the peak performance at moderate values, rather than extreme settings of 0 or 1\. This result is consistent with our previous findings, which further confirms that the two relational views modeled in RelAD can provide complementary anomaly evidence, and the moderate fusion of the two views can fully mine the relational information in the data to boost detection performance\. The extended experimental results on more datasets consistently validate that anomalies in different relational databases may originate from either attribute deviations or relational connection abnormalities\. Benefiting from the flexible weight tuning mechanism, RelAD can adaptively adjust the fusion weights according to the underlying anomaly source of different data, thus maintaining stable and superior detection performance across diverse application scenarios\.
\(a\)ArXiv
\(b\)Avito
\(c\)HM
\(d\)Stack
Figure 5:Sensitivity analysis of RelAD with respect toα\\alphaandβ\\betaon remaining datasets\.
## Appendix GDetailed Related Work
### G\.1Relational Deep Learning
Relational deep learning \(RDL\) aims to learn from multi\-table relational databases by modeling entities and their primary–foreign key relationshipsRobinsonet al\.\([2024b](https://arxiv.org/html/2606.18621#bib.bib8)\);[Dwivediet al\.](https://arxiv.org/html/2606.18621#bib.bib17); Chenet al\.\([2025b](https://arxiv.org/html/2606.18621#bib.bib18)\); Schlichtkrullet al\.\([2018](https://arxiv.org/html/2606.18621#bib.bib38)\); Miaoet al\.\([2026](https://arxiv.org/html/2606.18621#bib.bib49)\); Qianet al\.\([2026](https://arxiv.org/html/2606.18621#bib.bib50)\)\. To advance this field, prior worksRobinsonet al\.\([2024a](https://arxiv.org/html/2606.18621#bib.bib1)\); Guet al\.\([2026](https://arxiv.org/html/2606.18621#bib.bib19)\); Wuet al\.\([2025](https://arxiv.org/html/2606.18621#bib.bib9)\)have established large\-scale relational benchmarks with diverse real\-world datasets and tasks, thereby promoting the development of relational foundation models and representation learning methodsLiet al\.\([2026a](https://arxiv.org/html/2606.18621#bib.bib43),[c](https://arxiv.org/html/2606.18621#bib.bib42)\)\. Among them, GriffinWanget al\.\([2025](https://arxiv.org/html/2606.18621#bib.bib20)\)transforms relational databases into heterogeneous graphs for graph\-centric relational representation learning\. RT[Ranjanet al\.](https://arxiv.org/html/2606.18621#bib.bib21)introduces a Transformer\-based framework to model rows, columns, and inter\-table relations through relational attention\. PLURELKothapalliet al\.\([2026](https://arxiv.org/html/2606.18621#bib.bib22)\)further studies scaling laws for relational foundation models and shows that large\-scale synthetic relational data can significantly improve relational pretraining and generalization\.
Despite these advances, existing RDL studies mainly focus on predictive tasks such as classification, recommendation, and representation learning, while anomaly detection on relational data remains unexplored\. In this work, we formally define the problem of relational anomaly detection, aiming to identify abnormal central entities from relational databases with multiple related tables and foreign\-key dependencies, and construct corresponding benchmark datasets and evaluation protocols\.
### G\.2Tabular Anomaly Detection
Tabular anomaly detection \(TAD\) aims to detect anomalous instances by measuring deviations from normal patterns in structured tabular dataZuoet al\.\([2025](https://arxiv.org/html/2606.18621#bib.bib56)\); Panget al\.\([2021](https://arxiv.org/html/2606.18621#bib.bib10)\); Borisovet al\.\([2022](https://arxiv.org/html/2606.18621#bib.bib36)\); Shenkar and Wolf \([2022](https://arxiv.org/html/2606.18621#bib.bib11)\); Zuoet al\.\([2023](https://arxiv.org/html/2606.18621#bib.bib55)\)\. Classical methods rely on fixed inductive biases in the original feature space, including isolation\-based approaches such as Isolation ForestLiuet al\.\([2019](https://arxiv.org/html/2606.18621#bib.bib23)\), density\-based methods such as LOFBreuniget al\.\([2000](https://arxiv.org/html/2606.18621#bib.bib24)\), and distance\-based methods such as kNN scoringRamaswamyet al\.\([2000](https://arxiv.org/html/2606.18621#bib.bib25)\)\. These methods are computationally efficient but rely heavily on handcrafted assumptions about data geometry, which limits their robustness in complex data distributions\. Recent advances in deep learning\-based TAD have achieved strong performance improvementsThimonieret al\.\([2024b](https://arxiv.org/html/2606.18621#bib.bib39)\); Tsaiet al\.\([2025](https://arxiv.org/html/2606.18621#bib.bib40)\); Yeet al\.\([2025b](https://arxiv.org/html/2606.18621#bib.bib41)\); Changet al\.\([2023](https://arxiv.org/html/2606.18621#bib.bib44)\); Liet al\.\([2026b](https://arxiv.org/html/2606.18621#bib.bib58)\)\. Among them, representation learning\-based methods form one of the most effective directions\. For instance, MCMYinet al\.\([2024](https://arxiv.org/html/2606.18621#bib.bib33)\)introduces masked cell modeling to reconstruct randomly masked feature values, enabling the model to learn feature interactions in a self\-supervised manner\. DRLYeet al\.\([2025a](https://arxiv.org/html/2606.18621#bib.bib34)\)further proposes decomposed representation learning to disentangle latent factors and enhance anomaly separability\. Despite their effectiveness, existing TAD methods are fundamentally designed for single\-table data and rely on learning from flattened feature representations\. When applied to relational data, they require aggregating multiple related tables into a single feature vector, which inevitably destroys interaction semantics encoded by foreign\-key relations and mixes heterogeneous features in a unified space\.
### G\.3Graph Anomaly Detection
Graph anomaly detection \(GAD\) aims to identify anomalous data that are different from the majorityMaet al\.\([2021](https://arxiv.org/html/2606.18621#bib.bib12)\); Qiaoet al\.\([2025](https://arxiv.org/html/2606.18621#bib.bib13)\); Hoet al\.\([2025](https://arxiv.org/html/2606.18621#bib.bib45)\); Yanet al\.\([2025](https://arxiv.org/html/2606.18621#bib.bib54)\), involving node, edge and graph levels\. In this paper, we focus on the node\-level anomaly detection on graphsPenget al\.\([2018](https://arxiv.org/html/2606.18621#bib.bib15)\); Liet al\.\([2017](https://arxiv.org/html/2606.18621#bib.bib14)\); Donget al\.\([2025](https://arxiv.org/html/2606.18621#bib.bib47)\)\. Recent advances utilize graph neural network \(GNNs\) to build powerful GAD modelsQiaoet al\.\([2025](https://arxiv.org/html/2606.18621#bib.bib13)\); Panet al\.\([2023](https://arxiv.org/html/2606.18621#bib.bib27)\); Qiao and Pang \([2023](https://arxiv.org/html/2606.18621#bib.bib16)\)\. For instance, DOMINANTDinget al\.\([2019](https://arxiv.org/html/2606.18621#bib.bib26)\)introduces GCN\-based graph autoencoder to reconstruct both structure and node attribute information\. CoLALiuet al\.\([2021](https://arxiv.org/html/2606.18621#bib.bib28)\)utilizes a contrastive GNN framework, combining contrastive learning and subgraph extraction to perform self\-supervised GAD\. Although these GAD methods have shown efficacy on homogeneous graphs, real\-world relational data typically involves heterogeneous entity associationsBinget al\.\([2023](https://arxiv.org/html/2606.18621#bib.bib46)\), e\.g\., user–product interactions in risk\-control systems\. While collapsing heterogeneous relations into a homogeneous graph can address this issue, it may discard relation\-specific semantics\. Therefore, exploiting heterogeneous entity relationships for anomaly detection remains an open challenge\.
## Appendix HLimitations and Broader Impacts
Our work focuses on identifying abnormal entities from relational databases by jointly modeling multi\-table attributes and typed foreign\-key relations\. A key limitation is that the current framework does not explicitly model temporal dynamics, although many real\-world anomalies evolve over time through changing entity attributes and interaction patterns\. Future work could extend this setting to dynamic relational databases where entities, attributes, and relations are updated continuously\.
This study may have positive societal impacts by improving anomaly detection in relational applications such as fraud detection, risk monitoring, industrial operation analysis, and abnormal behavior discovery\. At the same time, deployment in high\-stakes domains should be treated carefully\. Incorrect anomaly scores may lead to false positives, unnecessary investigation, or unfair treatment of individuals or organizations\. Since relational databases may contain sensitive personal or business information, practical use should follow privacy\-preserving data governance, access control, and domain\-specific auditing\. The proposed method is intended as a research tool for benchmark evaluation rather than an autonomous decision\-making system, and any real\-world deployment should involve human oversight and fairness assessment\.Similar Articles
Rethinking Structural Anomaly Detection: From Decision Boundaries to Projection Operators
This paper rethinks structural anomaly detection by shifting from decision boundaries to projection operators onto the low-dimensional manifold of normal data, showing that projection-aligned methods outperform existing boundary-based and reconstruction-based approaches.
Bridging Classification and Reconstruction: Cooperative Time Series Anomaly Detection
This paper proposes CoAD, a novel framework that unifies Outlier Exposure (classification) and Masked Autoencoder (reconstruction) paradigms for time series anomaly detection, addressing their respective limitations. Extensive experiments show that CoAD significantly outperforms state-of-the-art methods while being lightweight and fast.
Rethinking Continual Anomaly Detection on the Edge: Benchmarking Under Realistic Industrial Conditions
This paper introduces a unified benchmark for continual anomaly detection in industrial inspection, addressing unrealistic evaluation and edge deployment constraints, and proposes DINOSaur, a training-free method that outperforms existing approaches with zero forgetting and sub-100ms inference on edge hardware.
D2H-AD: A Hybrid Model Utilizing Hyperdimensional Computing for Advanced Anomaly Detection
D2H-AD is a novel anomaly detection framework using Hyperdimensional Computing (HDC) that combines distance-based and density-aware encoding. It outperforms five baselines across multiple benchmarks, offering lightweight, interpretable, and efficient performance for edge AI and IoT.
TPA-AD: A Two-Stage Pseudo Anomaly-Guided Method for Bearing Time-Series Anomaly Detection
TPA-AD is a two-stage pseudo anomaly-guided method for bearing time-series anomaly detection that generates pseudo-anomalous windows near normal boundaries using reconstruction models and contrastive learning, then scores anomalies with KNN—without requiring real anomaly samples during training. It is evaluated on bearing fault and degradation datasets, including high-speed train axle-box bearing data.