hia-gat: A Heterogeneous Interaction-Aware Graph Attention Network For Frame-Level Traffic Conflict Risk Prediction On Freeways

arXiv cs.LG 06/29/26, 04:00 AM Papers
Summary
This paper proposes HIA-GAT, a dual-stream heterogeneous graph attention network that integrates longitudinal and lateral vehicle interactions with a conflict-type-aware gating mechanism for frame-level traffic conflict risk prediction on freeways. Experiments on NGSIM datasets show improved risk-ranking performance, particularly for lateral conflicts, and provide interpretable per-vehicle conflict attribution.
arXiv:2606.27577v1 Announce Type: new Abstract: This paper formulates frame-level freeway risk assessment as a multi-agent scene graph-level binary classification problem, where each video or trajectory frame is labeled risky if any TTC- or PET-based conflict violates a specified severity threshold. We construct a relation-aware graph per frame with vehicles as nodes and two interaction types as edges: same-lane (longitudinal) and adjacent-lane (lateral), augmented with physics-informed edge features aligned to rear-end and lane-change conflict mechanisms. Building on a structured benchmarking suite of non-graph models and graph baselines, we propose HIA-GAT, a dual-stream heterogeneous graph attention network that processes longitudinal and lateral interactions through dedicated attention pathways and fuses them via a conflict-type-aware gating mechanism with event-level gate supervision derived from SSM conflict attribution. Experiments on the NGSIM I-80 and US-101 freeway datasets across nine TTC and PET threshold configurations show that HIA-GAT achieves the best average risk-ranking performance (AUC 0.835 on I-80 and 0.867 on US-101), with the largest gains on PET-only (lane-change) settings where relational structure is essential. Beyond accuracy, the learned gate provides interpretable per-vehicle attribution of dominant conflict type, supporting actionable, real-time freeway safety monitoring. We show that graph structure is critical for modeling lateral conflict risk, while longitudinal risk can often be captured by non-relational aggregation.
Original Article
View Cached Full Text
Cached at: 06/29/26, 05:23 AM
# HIA-GAT: A Heterogeneous Interaction-Aware Graph Attention Network for Frame-Level Traffic Conflict Risk Prediction on Freeways
Source: [https://arxiv.org/html/2606.27577](https://arxiv.org/html/2606.27577)
Mahshid Malazizi Center for Urban Informatics and Progress University of Tennessee at Chattanooga Chattanooga, TN, USA, 37403 ntv774@mocs\.utc\.edu &Seyedmehdi Khaleghian Center for Urban Informatics and Progress University of Tennessee at Chattanooga Chattanooga, TN, USA, 37403 mehdi\-khaleghian@utc\.edu &Mina Sartipi Center for Urban Informatics and Progress University of Tennessee at Chattanooga Chattanooga, TN, USA, 37403 Mina\-Sartipi@utc\.edu &Toru Hirano DENSO International America, Inc\. Southfield, MI, USA &Yunfei Xu DENSO International America, Inc\. Southfield, MI, USA &Hoang H\. Nguyen Center for Urban Informatics and Progress University of Tennessee at Chattanooga Chattanooga, TN, USA, 37403 huuhoang\-nguyen@utc\.edu

###### Abstract

This paper formulates frame\-level freeway risk assessment as a multi\-agent scene \(graph\-level\) binary classification problem, where each video/trajectory frame is labeled risky if any TTC\- or PET\-based conflict violates a specified severity threshold\. We construct a relation\-aware graph per frame with vehicles as nodes and two interaction types as edges—same\-lane \(longitudinal\) and adjacent\-lane \(lateral\)—augmented with physics\-informed edge features aligned to rear\-end and lane\-change conflict mechanisms\. Building on a structured benchmarking suite of non\-graph models and graph baselines, we propose HIA\-GAT, a dual\-stream heterogeneous graph attention network that processes longitudinal and lateral interactions through dedicated attention pathways and fuses them via a conflict\-type\-aware gating mechanism with event\-level gate supervision derived from SSM conflict attribution\. Experiments on the NGSIM I\-80 and US\-101 freeway datasets across nine TTC/PET threshold configurations show that HIA\-GAT achieves the best average risk\-ranking performance \(AUC 0\.835 on I\-80 and 0\.867 on US\-101\), with the largest gains on PET\-only \(lane\-change\) settings where relational structure is essential\. Beyond accuracy, the learned gate provides interpretable per\-vehicle attribution of dominant conflict type, supporting actionable, real\-time freeway safety monitoring\. We show that graph structure is critical for modeling lateral conflict risk, while longitudinal risk can often be captured by non\-relational aggregation\.

## 1I Introduction

Traffic safety analysis at intersections and highway facilities has traditionally relied on historical crash records\. Although essential for long\-term planning, crash\-based methods are reactive and constrained by sparse, delayed observations\[[84](https://arxiv.org/html/2606.27577#bib.bib84)\]\. To support proactive evaluation, surrogate safety measures \(SSMs\), particularly Time\-to\-Collision \(TTC\) and Post\-Encroachment Time \(PET\), are widely used to quantify conflict severity and near\-miss dynamics before crashes occur\[[38](https://arxiv.org/html/2606.27577#bib.bib38)\]\. However, real\-time safety assessment at complex facilities remains challenging\. Roadside video sensing and computer vision now provide a scalable, cost\-effective way to extract contextual and kinematic information for multiple road users\[[12](https://arxiv.org/html/2606.27577#bib.bib12)\]\. Yet converting noisy multi\-agent trajectories into reliable frame\-level safety indicators remains difficult because SSMs are defined on pairwise interactions, while real scenes contain many simultaneous interactions that must be summarized at the scene level\. Existing conflict\-based studies mainly focus on intersections and specific interaction types\[[75](https://arxiv.org/html/2606.27577#bib.bib75),[76](https://arxiv.org/html/2606.27577#bib.bib76),[77](https://arxiv.org/html/2606.27577#bib.bib77)\], while broader SSM research emphasizes thresholds, crash\-outcome links, contributing factors, and validation against crash data\[[75](https://arxiv.org/html/2606.27577#bib.bib75),[78](https://arxiv.org/html/2606.27577#bib.bib78),[79](https://arxiv.org/html/2606.27577#bib.bib79),[80](https://arxiv.org/html/2606.27577#bib.bib80)\]\. This leaves a gap in video\-compatible, real\-time, frame\-level risk assessment for scenes with multiple interacting agents\.

This gap is especially consequential for highway environments, which have been comparatively less emphasized in conflict\-based risk evaluation despite exhibiting distinct traffic dynamics relative to intersections\. On freeways and expressways, the dominant conflict mechanisms are typically rear\-end interactions and lane\-change/sideswipe conflicts\[[21](https://arxiv.org/html/2606.27577#bib.bib21),[82](https://arxiv.org/html/2606.27577#bib.bib82)\]\. Empirical findings further suggest that lane\-changing behavior and its associated risk are strongly modulated by longitudinal spacing to the relevant preceding vehicle following the maneuver\[[81](https://arxiv.org/html/2606.27577#bib.bib81)\]\. These characteristics motivate a modeling approach that \(i\) can represent many concurrent interactions, \(ii\) can distinguish conflict mechanisms \(longitudinal vs\. lateral\), and \(iii\) can deliver frame\-resolved risk predictions suitable for continuous monitoring\.

To address these limitations, we develop a comprehensive*frame\-level*traffic risk assessment framework for highway scenes grounded in TTC and PET\. We organize our evaluation into three tiers of increasing architectural complexity:*\(1\) non\-graph baselines*that operate on frame\-level aggregates,*\(2\) GNN baselines without edge features*to isolate the contribution of relational structure, and*\(3\) a single\-stream GNN baseline \(HomoGAT\)*that incorporates SSM\-relevant information through unified message passing under multiple SSM threshold settings and their combinations\. Building on these baselines, we proposeHIA\-GAT, a heterogeneous interaction\-aware graph attention network that decomposes vehicle interactions into conflict\-type\-specific message\-passing streams—*longitudinal*and*lateral*—and fuses them via a gated mechanism informed by physics\-grounded edge features\. This design enables principled, frame\-level traffic risk classification that is aligned with the underlying mechanisms captured by TTC and PET, while remaining compatible with video\-derived multi\-agent trajectories\.

In summary, this paper makes the following contributions:

- •We formalize*frame\-level*highway risk assessment as a multi\-agent scene classification problem using TTC and PET to derive operational risk labels from trajectory data\.
- •We provide a structured benchmarking suite spanning non\-graph models, topology\-only GNNs, and a single\-stream attention\-based GNN baseline \(HomoGAT\) under multiple SSM threshold configurations and their combinations\.
- •We introduce HIA\-GAT, a heterogeneous interaction\-aware architecture with conflict\-type\-specific message passing and gated fusion using physics\-grounded edge features, enabling mechanism\-aligned and interpretable frame\-level risk prediction\.

## 2II Related Work

Traditional safety analysis relies on historical crash data and regression models\[[24](https://arxiv.org/html/2606.27577#bib.bib24)\], which are inherently reactive and cannot support real\-time assessment\. Surrogate safety measures \(SSMs\) such as TTC\[[38](https://arxiv.org/html/2606.27577#bib.bib38)\]and PET\[[34](https://arxiv.org/html/2606.27577#bib.bib34)\]enable proactive conflict detection\[[18](https://arxiv.org/html/2606.27577#bib.bib18)\], but typically depend on fixed thresholds\[[20](https://arxiv.org/html/2606.27577#bib.bib20)\]and do not scale to multi\-agent, frame\-level risk estimation\. Machine learning methods — including SVMs, Random Forests, and deep learning — improve predictive performance\[[44](https://arxiv.org/html/2606.27577#bib.bib44),[19](https://arxiv.org/html/2606.27577#bib.bib19)\], yet most target specific user groups or rely on handcrafted features, limiting their capacity to represent heterogeneous interactions\.

Graph neural networks address this limitation by explicitly modeling relational structure among traffic agents\. Recent work has applied heterogeneous GNNs to intersection safety, including multi\-relational GCNs for collision probability estimation\[[22](https://arxiv.org/html/2606.27577#bib.bib22)\], spatiotemporal GCNs with dynamic heterogeneous graphs\[[87](https://arxiv.org/html/2606.27577#bib.bib87)\], and graph\-based frameworks with distinct node types for pedestrians and vehicles\[[88](https://arxiv.org/html/2606.27577#bib.bib88)\]\. However, these efforts focus exclusively on intersections and pedestrian scenarios\. On freeways, lane\-changing and merging are the primary conflict contributors, and combining TTC with PET improves crash estimation over either measure alone\[[21](https://arxiv.org/html/2606.27577#bib.bib21)\], yet existing approaches analyze individual conflict pairs in isolation rather than jointly modeling the full multi\-agent scene\[[81](https://arxiv.org/html/2606.27577#bib.bib81)\]\. Our work addresses these gaps by unifying SSM\-based conflict labeling with heterogeneous graph modeling for frame\-level freeway risk assessment\.

## 3III Methodology

### 3\.1Problem Formulation

This study considers*real\-time*traffic conflict risk prediction on freeway segments using high\-resolution vehicle trajectory data\. The raw data consist of kinematic observations recorded at discrete time steps \(frames\)\. The learning objective is to assign each frame a binary label—*risky*or*safe*—based on whether any surrogate safety measure \(SSM\) violation occurs among the vehicles observed in that frame\. Letℱ=\{f1,f2,…,fT\}\\mathcal\{F\}=\\\{f\_\{1\},f\_\{2\},\\ldots,f\_\{T\}\\\}denote the ordered set of frames\. At time steptt, frameftf\_\{t\}contains a set of active vehicles𝒱t=\{v1,v2,…,vNt\}\\mathcal\{V\}\_\{t\}=\\\{v\_\{1\},v\_\{2\},\\ldots,v\_\{N\_\{t\}\}\\\}with state attributes such as position, velocity, acceleration, lane assignment, and headway\. We aim to learn a parameterized predictory^t=gθ\(𝒱t,ℰt\),\\hat\{y\}\_\{t\}=g\_\{\\theta\}\(\\mathcal\{V\}\_\{t\},\\mathcal\{E\}\_\{t\}\),which maps the vehicles and their interactions to a binary labelyt∈\{0,1\}y\_\{t\}\\in\\\{0,1\\\}, whereyt=1y\_\{t\}=1indicates a risky frame andyt=0y\_\{t\}=0indicates a safe frame\. Here,ℰt\\mathcal\{E\}\_\{t\}represents the set of pairwise interaction relations among vehicles at timett\. We cast the problem as*graph\-level*binary classification\. Each frame is represented as a directed, relation\-aware graphGt=\(𝒱t,ℰt,𝐗t,𝐄t\),G\_\{t\}=\(\\mathcal\{V\}\_\{t\},\\mathcal\{E\}\_\{t\},\\mathbf\{X\}\_\{t\},\\mathbf\{E\}\_\{t\}\),where𝐗t∈ℝNt×d0\\mathbf\{X\}\_\{t\}\\in\\mathbb\{R\}^\{N\_\{t\}\\times d\_\{0\}\}is the node feature matrix and𝐄t\\mathbf\{E\}\_\{t\}denotes edge attributes encoding interaction dynamics\. The ground\-truth labelyty\_\{t\}is derived from SSMs computed directly from the trajectories\.

### 3\.2Surrogate Safety Measures for Risk Labeling

SSMs provide a quantitative framework for assessing traffic conflict severity without relying on observed crash records, which are inherently rare and spatially sparse on freeways\. In this study, we employ two complementary SSMs that capture the two predominant conflict types on freeway facilities: rear\-end conflicts arising from longitudinal closing interactions, and sideswipe conflicts arising from lateral lane\-change maneuvers\.

#### 3\.2\.1Time\-to\-Collision \(TTC\)

Time\-to\-Collision \(TTC\) measures the time remaining to a potential rear\-end conflict between a same\-lane followeriiand leaderjj\. For actively closing pairs with a positive bumper\-to\-bumper gap, TTC is computed asTTCij\(t\)=dij\(t\)/Δvij\(t\)\\text\{TTC\}\_\{ij\}\(t\)=d\_\{ij\}\(t\)/\\Delta v\_\{ij\}\(t\), whereΔvij\(t\)=vi\(t\)−vj\(t\)\>vmin\\Delta v\_\{ij\}\(t\)=v\_\{i\}\(t\)\-v\_\{j\}\(t\)\>v\_\{\\min\},dij\(t\)d\_\{ij\}\(t\)is the space headway minus the leader length, andvmin=0\.5v\_\{\\min\}=0\.5ft/s excludes quasi\-stationary pairs\. TTC is evaluated only whendij\(t\)\>0d\_\{ij\}\(t\)\>0\.

#### 3\.2\.2Post\-Encroachment Time \(PET\)

Post\-Encroachment Time captures the temporal margin between successive occupancies of a shared road space during lane\-change maneuvers\. Unlike TTC, which measures an ongoing longitudinal closing interaction, PET measures the temporal gap at a spatial conflict point that has already been traversed by one vehicle and subsequently entered by another \(See Figure[1](https://arxiv.org/html/2606.27577#S3.F1)\)\.

![Refer to caption](https://arxiv.org/html/2606.27577v1/pet.png)Figure 1:Illustration of PET computation on a freeway segment\.Top:Temporal sequence showing the invasion line crossed by the preceding vehicle at timeti−1t\_\{i\-1\}and the lane\-changing vehicle at timetit\_\{i\}; PET is the elapsed time between these two occupancy events\.Bottom:Spatial configuration of the conflict: vehicleFtF\_\{t\}in Lane 1 and vehicleMMperforming a lane change into Lane 2, with the invasion line marking the shared road space where longitudinal overlap creates a potential sideswipe conflict point\.We compute PET using a cell\-based spatial occupancy method\. The longitudinal extent of the freeway is discretized into cells of widthΔs\\Delta s\(set to 5 ft\)\. For each vehicle at each time step, the set of occupied cells is determined from the vehicle’s front and rear positions\. Upon detection of a sustained lane change — defined as a vehicle transitioning to an adjacent lane and maintaining the new lane assignment for at leastτsustain\\tau\_\{\\text\{sustain\}\}consecutive frames \(set to 1\.0 s\) — the algorithm identifies the most recent prior occupant of each cell in the target lane\. The PET for the conflict pair is the minimum temporal gap across all shared cellsPETij\(t\)=tentry,i−texit,jFR\\text\{PET\}\_\{ij\}\(t\)=\\frac\{t\_\{\\text\{entry\},i\}\-t\_\{\\text\{exit\},j\}\}\{\\text\{FR\}\}, wheretentry,it\_\{\\text\{entry\},i\}is the frame at which the lane\-changing vehicleiienters the target\-lane cell,texit,jt\_\{\\text\{exit\},j\}is the last frame at which the previously occupying vehiclejjwas present in that cell, and FR is the frame rate \(10 Hz\)\. A minimum PET floor of 0\.2 s is applied to exclude physically implausible detections arising from sensor noise\.

#### 3\.2\.3Threshold\-Based Risk Labeling

To evaluate model performance across varying levels of conflict severity, we define nine threshold configurations spanning three categories\. For rear\-end interactions, we consider TTC thresholds of0\.5s0\.5\\,\\mathrm\{s\},1\.0s1\.0\\,\\mathrm\{s\}, and1\.5s1\.5\\,\\mathrm\{s\}, corresponding to imminent, critical, and precautionary conflict regimes, respectively\. For lane\-change interactions, we consider PET thresholds of1\.0s1\.0\\,\\mathrm\{s\},1\.5s1\.5\\,\\mathrm\{s\}, and2\.0s2\.0\\,\\mathrm\{s\}to represent close, moderate, and marginal sideswipe conflicts\. Finally, we evaluate combined configurations using joint TTC∣\\midPET criteria—namely,\(TTC<0\.5s∣PET<1\.0s\)\(\\mathrm\{TTC\}<0\.5\\,\\mathrm\{s\}\\ \\mid\\ \\mathrm\{PET\}<1\.0\\,\\mathrm\{s\}\),\(TTC<1\.0s∣PET<1\.5s\)\(\\mathrm\{TTC\}<1\.0\\,\\mathrm\{s\}\\ \\mid\\ \\mathrm\{PET\}<1\.5\\,\\mathrm\{s\}\), and\(TTC<1\.5s∣PET<2\.0s\)\(\\mathrm\{TTC\}<1\.5\\,\\mathrm\{s\}\\ \\mid\\ \\mathrm\{PET\}<2\.0\\,\\mathrm\{s\}\)—where a frame is labeled risky if*either*condition is satisfied\. This multi\-threshold framework enables a systematic robustness analysis across heterogeneous conflict mechanisms and risk severities, ranging from highly imbalanced settings \(e\.g\.,TTC<0\.5s\\mathrm\{TTC\}<0\.5\\,\\mathrm\{s\}with approximately55–14%14\\%risk prevalence\) to substantially more balanced label distributions \(e\.g\.,TTC<1\.5s∣PET<2\.0s\\mathrm\{TTC\}<1\.5\\,\\mathrm\{s\}\\mid\\mathrm\{PET\}<2\.0\\,\\mathrm\{s\}with approximately3030–58%58\\%prevalence\)\. For each frameftf\_\{t\}, the binary label is:

yt=\{1if∃\(i,j\):TTCij\(t\)<τTTCorPETij\(t\)<τPET0otherwisey\_\{t\}=\\begin\{cases\}1&\\text\{if \}\\exists\\,\(i,j\):\\text\{TTC\}\_\{ij\}\(t\)<\\tau\_\{\\text\{TTC\}\}\\;\\text\{or\}\\;\\text\{PET\}\_\{ij\}\(t\)<\\tau\_\{\\text\{PET\}\}\\\\ 0&\\text\{otherwise\}\\end\{cases\}\(1\)whereτTTC\\tau\_\{\\text\{TTC\}\}andτPET\\tau\_\{\\text\{PET\}\}are the respective thresholds for the configuration under evaluation\.

### 3\.3Graph Construction from Trajectory Data

Each traffic frame is transformed into a heterogeneous graph that explicitly encodes the two distinct modes of vehicle interaction observed on freeways: longitudinal \(same\-lane\) interactions and lateral \(adjacent\-lane\) interactions\.

#### 3\.3\.1Node Representation

Each vehicleviv\_\{i\}present in frameftf\_\{t\}is represented as a node with add\-dimensional feature vector𝐱i∈ℝ10\\mathbf\{x\}\_\{i\}\\in\\mathbb\{R\}^\{10\}comprising:

𝐱i=\[xi,yi,vi,ai,ℓi,si,hi,Li,x˙i,λi\]\\mathbf\{x\}\_\{i\}=\[\\,x\_\{i\},\\;y\_\{i\},\\;v\_\{i\},\\;a\_\{i\},\\;\\ell\_\{i\},\\;s\_\{i\},\\;h\_\{i\},\\;L\_\{i\},\\;\\dot\{x\}\_\{i\},\\;\\lambda\_\{i\}\\,\]\(2\)wherexix\_\{i\}andyiy\_\{i\}are the lateral and longitudinal positions,viv\_\{i\}is the longitudinal velocity,aia\_\{i\}is the longitudinal acceleration,ℓi\\ell\_\{i\}is the lane assignment,sis\_\{i\}is the space headway to the preceding vehicle,hih\_\{i\}is the time headway,LiL\_\{i\}is the vehicle length,x˙i\\dot\{x\}\_\{i\}is the lateral velocity \(computed as the finite difference of lateral position\), andλi\\lambda\_\{i\}is a lane\-change indicator flag that equals 1 if the vehicle has changed lanes within the preceding 1\.0 s window \(10 frames\)\. All features are standardized using robust z\-score normalization with percentile\-clipped statistics \(1st and 99th percentiles\) to mitigate the influence of outliers\. For frames with more than 200 vehicles, a random subset of 200 nodes is sampled to maintain computational tractability\.

#### 3\.3\.2Edge Construction and Classification

Edges are established between all vehicle pairs within a spatial proximity radiusr=100r=100ft\. Each edge is classified into one of two types based on the lane relationship between the connected vehicles\.Longitudinal edgesℰlong\\mathcal\{E\}^\{\\text\{long\}\}: connect vehicle pairs occupying the same lane \(\|ℓi−ℓj\|<0\.5\|\\ell\_\{i\}\-\\ell\_\{j\}\|<0\.5\)\. These edges capture car\-following dynamics relevant to rear\-end conflict assessment\.Lateral edgesℰlat\\mathcal\{E\}^\{\\text\{lat\}\}: connect vehicle pairs in adjacent lanes \(0\.5<\|ℓi−ℓj\|<1\.50\.5<\|\\ell\_\{i\}\-\\ell\_\{j\}\|<1\.5\)\. These edges encode cross\-lane spatial relationships relevant to lane\-change conflict assessment\.

#### 3\.3\.3Physics\-Informed Edge Features

Each edge type carries a distinct set of three physics\-informed features designed to encode the interaction dynamics most relevant to its associated conflict type\. For longitudinal edges\(i,j\)∈ℰlong\(i,j\)\\in\\mathcal\{E\}^\{\\text\{long\}\}, the edge feature vector is𝐞ijlong=\[Δvij30,dijr,Δaij20\]\\mathbf\{e\}^\{\\text\{long\}\}\_\{ij\}=\\left\[\\,\\frac\{\\Delta v\_\{ij\}\}\{30\},\\;\\;\\frac\{d\_\{ij\}\}\{r\},\\;\\;\\frac\{\\Delta a\_\{ij\}\}\{20\}\\,\\right\], whereΔvij=vi−vj\\Delta v\_\{ij\}=v\_\{i\}\-v\_\{j\}is the closing rate \(directly related to TTC\),dijd\_\{ij\}is the Euclidean distance normalized by the edge radius, andΔaij=ai−aj\\Delta a\_\{ij\}=a\_\{i\}\-a\_\{j\}is the acceleration differential indicating whether the closing rate is increasing or decreasing\. For lateral edges\(i,j\)∈ℰlat\(i,j\)\\in\\mathcal\{E\}^\{\\text\{lat\}\}, the edge feature vector is𝐞ijlat=\[x˙i5,λi,oij15\]\\mathbf\{e\}^\{\\text\{lat\}\}\_\{ij\}=\\left\[\\,\\frac\{\\dot\{x\}\_\{i\}\}\{5\},\\;\\;\\lambda\_\{i\},\\;\\;\\frac\{o\_\{ij\}\}\{15\}\\,\\right\], wherex˙i\\dot\{x\}\_\{i\}is the lateral velocity of the source vehicle,λi\\lambda\_\{i\}is its lane\-change flag, andoijo\_\{ij\}is the longitudinal overlap between the two vehicles computed asoij=max⁡\(0,min⁡\(yifront,yjfront\)−max⁡\(yirear,yjrear\)\)o\_\{ij\}=\\max\(0,\\min\(y\_\{i\}^\{\\text\{front\}\},y\_\{j\}^\{\\text\{front\}\}\)\-\\max\(y\_\{i\}^\{\\text\{rear\}\},y\_\{j\}^\{\\text\{rear\}\}\)\)\. The overlap feature captures the spatial proximity condition that makes sideswipe conflicts physically possible — a lane change is only dangerous if the vehicles share longitudinal road space\.

This edge\-type separation is a deliberate design choice\. The closing rateΔvij\\Delta v\_\{ij\}is the critical feature for predicting TTC\-based conflicts, while the lateral velocity and overlap are critical for predicting PET\-based conflicts\. By providing each interaction type through its own dedicated edge channel, the model receives physics\-aligned information that reflects the distinct mechanisms underlying rear\-end and sideswipe conflicts\.

### 3\.4Proposed Architecture: HIA\-GAT

We propose the Heterogeneous Interaction\-Aware Graph Attention Network \(HIA\-GAT\), a dual\-stream graph neural network architecture that processes longitudinal and lateral vehicle interactions through dedicated pathways and fuses them via a learned, conflict\-type\-aware gating mechanism\. Figure[2](https://arxiv.org/html/2606.27577#S3.F2)presents the overall architecture\.

![Refer to caption](https://arxiv.org/html/2606.27577v1/arch-long.png)Figure 2:Overall architecture of the proposed HIA\-GAT model#### 3\.4\.1Stream\-Specific Input Projections

To enable each stream to develop specialized representations, the shared node features𝐱i\\mathbf\{x\}\_\{i\}are first transformed through stream\-specific input projections𝐱ilong=ELU\(𝐖long𝐱i\+𝐛long\),𝐱ilat=ELU\(𝐖lat𝐱i\+𝐛lat\)\\mathbf\{x\}\_\{i\}^\{\\text\{long\}\}=\\text\{ELU\}\(\\mathbf\{W\}^\{\\text\{long\}\}\\mathbf\{x\}\_\{i\}\+\\mathbf\{b\}^\{\\text\{long\}\}\),\\quad\\mathbf\{x\}\_\{i\}^\{\\text\{lat\}\}=\\text\{ELU\}\(\\mathbf\{W\}^\{\\text\{lat\}\}\\mathbf\{x\}\_\{i\}\+\\mathbf\{b\}^\{\\text\{lat\}\}\), where𝐖long,𝐖lat∈ℝd×d\\mathbf\{W\}^\{\\text\{long\}\},\\mathbf\{W\}^\{\\text\{lat\}\}\\in\\mathbb\{R\}^\{d\\times d\}are learnable weight matrices\. These projections allow the longitudinal stream to emphasize features relevant to car\-following dynamics \(e\.g\., velocity, headway\) while the lateral stream can attend more to lane\-change indicators \(e\.g\., lateral velocity, lane\-change flag\)\.

#### 3\.4\.2Dual\-Stream Graph Attention Layers

Each stream employs a two\-layer Graph Attention Network \(GAT\) that operates exclusively on its corresponding edge type\. The longitudinal stream processes messages along same\-lane edgesℰlong\\mathcal\{E\}^\{\\text\{long\}\}with longitudinal edge features𝐞ijlong\\mathbf\{e\}^\{\\text\{long\}\}\_\{ij\}, while the lateral stream processes messages along adjacent\-lane edgesℰlat\\mathcal\{E\}^\{\\text\{lat\}\}with lateral edge features𝐞ijlat\\mathbf\{e\}^\{\\text\{lat\}\}\_\{ij\}\. For the longitudinal stream, the first GAT layer with multi\-head attention computesαijk=exp⁡\(LeakyReLU\(𝐚k⊤\[𝐖k𝐱ilong‖𝐖k𝐱jlong‖𝐖ek𝐞ijlong\]\)\)∑m∈𝒩ilongexp⁡\(LeakyReLU\(𝐚k⊤\[𝐖k𝐱ilong‖𝐖k𝐱mlong‖𝐖ek𝐞imlong\]\)\)\\alpha\_\{ij\}^\{k\}=\\frac\{\\exp\\left\(\\text\{LeakyReLU\}\\left\(\\mathbf\{a\}^\{k\\top\}\[\\mathbf\{W\}^\{k\}\\mathbf\{x\}\_\{i\}^\{\\text\{long\}\}\\\|\\mathbf\{W\}^\{k\}\\mathbf\{x\}\_\{j\}^\{\\text\{long\}\}\\\|\\mathbf\{W\}^\{k\}\_\{e\}\\mathbf\{e\}\_\{ij\}^\{\\text\{long\}\}\]\\right\)\\right\)\}\{\\sum\_\{m\\in\\mathcal\{N\}^\{\\text\{long\}\}\_\{i\}\}\\exp\\left\(\\text\{LeakyReLU\}\\left\(\\mathbf\{a\}^\{k\\top\}\[\\mathbf\{W\}^\{k\}\\mathbf\{x\}\_\{i\}^\{\\text\{long\}\}\\\|\\mathbf\{W\}^\{k\}\\mathbf\{x\}\_\{m\}^\{\\text\{long\}\}\\\|\\mathbf\{W\}^\{k\}\_\{e\}\\mathbf\{e\}\_\{im\}^\{\\text\{long\}\}\]\\right\)\\right\)\}, and𝐡ilong,\(1\)=ELU\(BN\(∥k=1K∑j∈𝒩ilongαijk𝐖k𝐱jlong\)\)\\mathbf\{h\}\_\{i\}^\{\\text\{long\},\(1\)\}=\\text\{ELU\}\\left\(\\text\{BN\}\\left\(\\Big\\\|\_\{k=1\}^\{K\}\\sum\_\{j\\in\\mathcal\{N\}^\{\\text\{long\}\}\_\{i\}\}\\alpha\_\{ij\}^\{k\}\\mathbf\{W\}^\{k\}\\mathbf\{x\}\_\{j\}^\{\\text\{long\}\}\\right\)\\right\)\. whereKKis the number of attention heads \(set to 4\),∥\\\|denotes concatenation,𝒩ilong\\mathcal\{N\}^\{\\text\{long\}\}\_\{i\}is the set of same\-lane neighbors of vehicleii, and BN denotes batch normalization\. The edge features𝐞ijlong\\mathbf\{e\}\_\{ij\}^\{\\text\{long\}\}are incorporated into the attention coefficient computation, enabling the model to modulate message strength based on the physical interaction dynamics \(e\.g\., assigning higher attention to vehicle pairs with large closing rates\)\.

The second GAT layer uses single\-head attention to produce the final per\-node longitudinal embedding𝐡ilong∈ℝH\\mathbf\{h\}\_\{i\}^\{\\text\{long\}\}\\in\\mathbb\{R\}^\{H\}, whereHHis the hidden dimension \(set to 64\)\. The lateral stream follows an identical architecture but operates on lateral edges and edge features, producing𝐡ilat∈ℝH\\mathbf\{h\}\_\{i\}^\{\\text\{lat\}\}\\in\\mathbb\{R\}^\{H\}\.

#### 3\.4\.3Conflict\-Type\-Aware Gated Fusion

The central architectural innovation of HIA\-GAT is a per\-node gating mechanism that dynamically determines the relative contribution of the longitudinal and lateral streams based on each vehicle’s involvement in different conflict types\. For each nodeii, the gate value is computed from the concatenated stream embeddings𝐠i=σ\(𝐖g\[𝐡ilong∥𝐡ilat\]\+𝐛g\)∈\(0,1\)H\\mathbf\{g\}\_\{i\}=\\sigma\\left\(\\mathbf\{W\}\_\{g\}\[\\mathbf\{h\}\_\{i\}^\{\\text\{long\}\}\\\|\\mathbf\{h\}\_\{i\}^\{\\text\{lat\}\}\]\+\\mathbf\{b\}\_\{g\}\\right\)\\in\(0,1\)^\{H\}, whereσ\\sigmais the sigmoid activation and𝐖g∈ℝH×2H\\mathbf\{W\}\_\{g\}\\in\\mathbb\{R\}^\{H\\times 2H\}is a learnable weight matrix\. The fused node embedding is then𝐡i=𝐠i⊙𝐡ilong\+\(1−𝐠i\)⊙𝐡ilat\\mathbf\{h\}\_\{i\}=\\mathbf\{g\}\_\{i\}\\odot\\mathbf\{h\}\_\{i\}^\{\\text\{long\}\}\+\(1\-\\mathbf\{g\}\_\{i\}\)\\odot\\mathbf\{h\}\_\{i\}^\{\\text\{lat\}\}, where⊙\\odotdenotes element\-wise multiplication\. A gate value close to 1\.0 routes the representation through the longitudinal stream, while a value close to 0\.0 routes it through the lateral stream\. Vehicles involved in both conflict types receive intermediate gate values\.

#### 3\.4\.4Event\-Level Gate Supervision

A key challenge in training gated fusion architectures is ensuring that the gate learns to differentiate between conflict types rather than collapsing to a trivial solution\. We address this through an event\-level supervision mechanism that leverages the conflict\-type information inherently available from the SSM computation\.

During the risk labeling process \(Section[3\.2](https://arxiv.org/html/2606.27577#S3.SS2)\), we identify not only the frame\-level risk label but also the specific vehicles participating in each detected conflict event\. We leverage this event attribution to define per\-node gate supervision targetsgi∗g\_\{i\}^\{\*\}, assigninggi∗=1\.0g\_\{i\}^\{\*\}=1\.0when vehicleiiparticipates in a TTC\-based conflict pair,gi∗=0\.0g\_\{i\}^\{\*\}=0\.0when it participates in a PET\-based conflict pair, andgi∗=0\.5g\_\{i\}^\{\*\}=0\.5when it is simultaneously implicated in both TTC and PET conflicts; conversely, if vehicleiiis not involved in any conflict event, its gate target is masked and excluded from supervision\.

The gate supervision loss is computed as the mean squared error between the predicted gate values and the targets, applied only to supervised nodesℒgate=1\|ℳ\|∑i∈ℳ‖𝐠¯i−gi∗‖2\\mathcal\{L\}\_\{\\text\{gate\}\}=\\frac\{1\}\{\|\\mathcal\{M\}\|\}\\sum\_\{i\\in\\mathcal\{M\}\}\\\|\\bar\{\\mathbf\{g\}\}\_\{i\}\-g\_\{i\}^\{\*\}\\\|^\{2\}, whereℳ\\mathcal\{M\}is the set of supervised nodes \(those involved in at least one conflict event\) and𝐠¯i\\bar\{\\mathbf\{g\}\}\_\{i\}is the mean gate value across hidden dimensions for nodeii\. Critically, only a small fraction of nodes receive supervision — typically 0\.1–1\.5% of all nodes in a frame — yet the gate generalizes to unsupervised nodes through the shared gate projection layer, which learns to map node representations to conflict\-type likelihood\.

#### 3\.4\.5Graph\-Level Readout and Classification

The fused per\-node embeddings are aggregated into a single graph\-level representation using attention\-weighted global pooling:𝐳t=∑i∈𝒱tβi𝐡i,whereβi=exp⁡\(MLPatt\(𝐡i\)\)∑j∈𝒱texp⁡\(MLPatt\(𝐡j\)\)\\mathbf\{z\}\_\{t\}=\\sum\_\{i\\in\\mathcal\{V\}\_\{t\}\}\\beta\_\{i\}\\,\\mathbf\{h\}\_\{i\},\\quad\\text\{where\}\\quad\\beta\_\{i\}=\\frac\{\\exp\\left\(\\text\{MLP\}\_\{\\text\{att\}\}\(\\mathbf\{h\}\_\{i\}\)\\right\)\}\{\\sum\_\{j\\in\\mathcal\{V\}\_\{t\}\}\\exp\\left\(\\text\{MLP\}\_\{\\text\{att\}\}\(\\mathbf\{h\}\_\{j\}\)\\right\)\}, whereMLPatt\\text\{MLP\}\_\{\\text\{att\}\}is a two\-layer neural network that computes the attention score for each node\. This attention pooling mechanism allows the model to focus on the most informative vehicles \(typically those involved in conflict events\) when producing the graph\-level representation, rather than treating all vehicles equally\.

The graph embedding𝐳t∈ℝH\\mathbf\{z\}\_\{t\}\\in\\mathbb\{R\}^\{H\}is passed through a three\-layer MLP classifiery^t=MLPcls\(𝐳t\)=𝐖3⋅ReLU\(𝐖2⋅ReLU\(𝐖1𝐳t\)\)\\hat\{y\}\_\{t\}=\\text\{MLP\}\_\{\\text\{cls\}\}\(\\mathbf\{z\}\_\{t\}\)=\\mathbf\{W\}\_\{3\}\\cdot\\text\{ReLU\}\(\\mathbf\{W\}\_\{2\}\\cdot\\text\{ReLU\}\(\\mathbf\{W\}\_\{1\}\\mathbf\{z\}\_\{t\}\)\), with hidden dimensionsH→H→H/2→1H\\to H\\to H/2\\to 1and dropout regularization between layers\. The output logit is transformed to a probability via the sigmoid function\.

#### 3\.4\.6Training Objective

The total training loss combines the frame\-level binary cross\-entropy classification loss with the gate supervision lossℒ=ℒBCE\+αℒgate\\mathcal\{L\}=\\mathcal\{L\}\_\{\\text\{BCE\}\}\+\\alpha\\,\\mathcal\{L\}\_\{\\text\{gate\}\}, whereℒBCE=−1N∑t\[ytlog⁡\(p^t\)\+\(1−yt\)log⁡\(1−p^t\)\]\\mathcal\{L\}\_\{\\text\{BCE\}\}=\-\\frac\{1\}\{N\}\\sum\_\{t\}\\left\[y\_\{t\}\\log\(\\hat\{p\}\_\{t\}\)\+\(1\-y\_\{t\}\)\\log\(1\-\\hat\{p\}\_\{t\}\)\\right\]is the weighted binary cross\-entropy loss with inverse class\-frequency weighting to address class imbalance, andα\\alphais the gate supervision weight \(set to 0\.5\)\. The model is trained using the Adam optimizer with a learning rate of3×10−43\\times 10^\{\-4\}, weight decay of10−410^\{\-4\}, gradient clipping at norm 1\.0, learning rate warmup over the first 5 epochs, and ReduceLROnPlateau scheduling\. Early stopping with a patience of 15 epochs is applied based on validation loss\.

### 3\.5Baseline Methods

To comprehensively evaluate the proposed HIA\-GAT, we compare against seven baseline methods organized in three tiers of increasing architectural complexity:

Non\-graph baselinesoperate on frame\-level aggregate feature vectors \(70 dimensions\) constructed by computing statistical summaries \(mean, standard deviation, minimum, maximum\) of the 10 node features across all vehicles in the frame, supplemented with graph structural statistics \(node count, edge type ratios\) and edge feature statistics \(mean, standard deviation, minimum, maximum of the 3\-dimensional longitudinal and lateral edge features\)\. These baselines include Logistic Regression, Random Forest \(200 trees\), XGBoost \(200 estimators\), and a multi\-layer perceptron \(MLP\) with two hidden layers\.

GNN baselines without edge featuresinclude GCN and GraphSAGE, which operate on the homogeneous graph \(all edges combined\) using only node features\. These baselines quantify the contribution of graph topology alone, independent of physics\-informed edge attributes\.

Single\-stream GNN baseline \(HomoGAT\)uses the same GAT architecture as each individual stream in HIA\-GAT, operating on the combined homogeneous edge set with 3\-dimensional edge features \(closing rate, distance, lane difference\)\. This baseline isolates the contribution of the dual\-stream decomposition and gated fusion mechanism by providing edge features to a single unified attention network\.

All GNN models share identical hyperparameters \(hidden dimension 64, 4 attention heads, dropout 0\.3, learning rate3×10−43\\times 10^\{\-4\}, patience 15\) and use the same data splits \(70/15/15 train/validation/test with stratified sampling and fixed random seed\) to ensure fair comparison\. Performance is evaluated using both threshold\-optimized F1 score \(F1opt\{\}\_\{\\text\{opt\}\}, where the classification threshold is selected to maximize F1 on the validation set\) and Area Under the ROC Curve \(AUC\), which provides a threshold\-independent measure of discriminative ability\.

## 4IV Experiments

### 4\.1Dataset and Data Labeling

We use NGSIM vehicle trajectories\[[85](https://arxiv.org/html/2606.27577#bib.bib85),[86](https://arxiv.org/html/2606.27577#bib.bib86)\]recorded at 10 Hz on two California freeway segments: I\-80 in Emeryville, with 45 minutes of PM peak congestion, and US\-101 in Los Angeles, with 45 minutes of AM peak traffic in a weaving section\. Each record includes vehicle position, velocity, acceleration, lane, dimensions, and headway\. Preprocessing disambiguated overlapping Frame\_ID sequences, removed duplicate boundary records, and added two derived features: lateral velocity and a lane\-change flag indicating a sustained lane change within the previous 1\.0 s\. The final I\-80 and US\-101 datasets contain 4,564,923 and 4,098,933 vehicle\-frame records over 29,679 and 28,156 unique frames, respectively\. Frame\-level risk labels are generated for the nine threshold configurations in Section[3\.2\.3](https://arxiv.org/html/2606.27577#S3.SS2.SSS3)\. Experiments were run on an NVIDIA A100\-SXM4\-80GB GPU using PyTorch Geometric\.

Table 1:Performance comparison on the I\-80 and US\-101 datasets\. Best AUC per configuration is bolded\. %R denotes risk prevalence\. Each cell reports F1 score \(left\) and AUC \(right\)\. Shaded rows summarize category averages\.
### 4\.2Evaluation Metrics

We evaluate all models using two complementary metrics\. The primary metric is the Area Under the Receiver Operating Characteristic Curve \(AUC\), which measures the model’s ability to rank frames by risk severity independently of any classification threshold\. AUC is particularly suited to traffic conflict prediction because the operational goal is risk prioritization — identifying which frames warrant attention — rather than binary classification at a fixed threshold\. Moreover, AUC is robust to class imbalance, which varies substantially across our nine configurations \(from 3\.5% to 57\.5% risk prevalence\)\. As a secondary metric, we report the threshold\-optimized F1 score, where the classification threshold is selected to maximize F1 on the validation set\. F1 provides a measure of the best achievable classification performance but is sensitive to the probability distribution shape and class balance, making it less stable for cross\-configuration comparison\.

### 4\.3Overall Comparison

Table[3](https://arxiv.org/html/2606.27577#S4.T3)present the full comparison of all eight methods across the nine threshold configurations on the I\-80 and US\-101 datasets, respectively\. Two clear patterns emerge from the aggregate results\. First, the proposed HIA\-GAT achieves the highest average AUC on both datasets \(0\.835 on I\-80, 0\.867 on US\-101\), outperforming all non\-graph and graph baselines\. Second, Random Forest achieves the highest average F1 \(0\.537 on I\-80, 0\.486 on US\-101\)\. This divergence between AUC and F1 rankings reflects a fundamental distinction: AUC evaluates the full ranking of risk scores across all operating points, whereas F1 evaluates classification accuracy at a single best threshold\. HIA\-GAT produces well\-calibrated probability scores that capture nuanced risk gradations, while tree\-based ensembles produce sharper probability distributions that yield cleaner decision boundaries at optimal thresholds despite slightly inferior overall ranking\. Among the GNN baselines, a consistent architectural progression is observed\. GCN, which operates on graph topology alone without edge features, achieves the lowest GNN performance \(average AUC of 0\.778 on I\-80, 0\.796 on US\-101\)\. GraphSAGE improves upon GCN through its neighborhood sampling mechanism but similarly lacks edge feature integration\. HomoGAT, which incorporates 3\-dimensional physics\-informed edge features into the attention computation, yields a substantial improvement over GCN \(\+0\.032 AUC on I\-80, \+0\.060 on US\-101\), demonstrating that edge features encoding pairwise vehicle dynamics are critical for risk prediction\. The proposed HIA\-GAT extends this further with its dual\-stream architecture and event\-level gate supervision, achieving \+0\.025 AUC over HomoGAT on I\-80 and \+0\.011 on US\-101, and winning 7 of 9 configurations on both datasets by AUC\.

### 4\.4Conflict\-Type Stratified Analysis

The nine threshold configurations can be partitioned into three categories — TTC\-only \(rear\-end conflicts\), PET\-only \(lane\-change conflicts\), and combined — revealing distinct patterns in model suitability\. On TTC\-only configurations, non\-graph models perform strongly due to the nature of rear\-end conflict detection\. TTC is fundamentally determined by the closing rate and gap distance between a following vehicle and its leader — quantities that are well captured by aggregate frame\-level statistics such as the mean and maximum closing rate across all vehicle pairs\. Random Forest achieves average TTC AUC of 0\.901 on I\-80 and 0\.940 on US\-101\. GNN models remain competitive \(HIA\-GAT: 0\.868 and 0\.858\) but the additional representational capacity of graph\-level reasoning provides diminishing returns when the risk signal is already well\-encoded in simple summary statistics\. On PET\-only configurations, the pattern reverses decisively\. Non\-graph methods degrade substantially, with RF and XGBoost achieving average PET AUC of only 0\.715 and 0\.689 on I\-80, respectively \(Table[3](https://arxiv.org/html/2606.27577#S4.T3)\)\. In contrast, all GNN models outperform all non\-graph baselines, with HIA\-GAT and GraphSAGE achieving 0\.805 and 0\.810 on I\-80, and 0\.873 and 0\.865 on US\-101\. This gap — approximately 0\.10 AUC separating graph from non\-graph methods — provides strong evidence that graph structure is essential for lane\-change conflict detection\. PET conflicts are inherently relational events: they arise from the spatial interaction between a lane\-changing vehicle and an adjacent\-lane occupant sharing longitudinal road space\. Aggregate statistics collapse this pairwise structure, losing the specific vehicle pair information that makes PET detection possible\. On combined configurations \(TTC\|PET\), the results reflect an intermediate regime\. RF leads overall due to its strong TTC performance, but HIA\-GAT achieves the largest improvement over HomoGAT in this category \(\+0\.034 AUC on I\-80, \+0\.028 on US\-101\), suggesting that the dual\-stream gating mechanism provides its greatest value when both conflict types coexist within the same frame and must be differentiated\. The exception isTTC<0\.5\\mathrm\{TTC\}<0\.5on US\-101 \(5\.6% prevalence\), where HIA\-GAT underperforms on both F1 and AUC due to training instability on this extremely sparse configuration — the dual\-stream architecture splits an already limited set of positive examples across two streams, reducing the effective supervision signal per stream below the threshold needed for stable attention learning\.

### 4\.5Gate Supervision Analysis

A central design feature of HIA\-GAT is the conflict\-type aware gating mechanism\. Table[4](https://arxiv.org/html/2606.27577#S4.T4)summarizes the gate behavior across all nine configurations on both datasets\. We evaluate whether the gate successfully learns to differentiate between longitudinal and lateral conflicts through its supervision signal\. On single\-type configurations, the gate achieves near\-perfect conflict type routing\. For TTC\-only configurations, the average gate value across all nodes converges above 0\.5 \(0\.69\-0\.77 on I\-80, 0\.58–0\.67 on US\-101\), indicating that the majority of the representation is routed through the longitudinal stream\. For PET\-only configurations, the gate converges below 0\.5 \(0\.43–0\.46 on I\-80, 0\.40–0\.43 on US\-101\), routing the representation predominantly through the lateral stream\. The gate direction accuracy – defined as the fraction of supervised nodes where the gate value correctly exceeds 0\.5 for TTC\-involved vehicles or falls below 0\.5 for PET\-involved vehicles — exceeds 99% on all single\-type configurations across both datasets\.

![Refer to caption](https://arxiv.org/html/2606.27577v1/gate33.png)Figure 3:Per\-vehicle gate values on a representative I\-80 test frame \(TTC<<1\.0 \| PET<<1\.5\)\. Blue vehicles \(g¯i→1\.0\\bar\{g\}\_\{i\}\\to 1\.0\) are routed through the longitudinal stream; red vehicles \(g¯i→0\.0\\bar\{g\}\_\{i\}\\to 0\.0\) through the lateral stream\. The gate correctly identifies the rear\-end conflict pair \(v3v\_\{3\}/v4v\_\{4\}\) and the lane\-change conflict pair \(v8v\_\{8\}/v11v\_\{11\}\), while safe vehicles remain near 0\.5\.On combined configurations, the gate exhibits substantial per\-node variation \(standard deviation 0\.19–0\.29\), indicating that different vehicles within the same frame are routed through different streams depending on their conflict involvement\. The aggregate gate mean shifts toward the TTC\-dominant direction \(0\.54–0\.70\) because TTC events are approximately 6–11 times more numerous than PET events, which reduces PET gate direction accuracy on the most imbalanced combined configurations\. Notably, only 0\.1–1\.5% of nodes in each frame receive gate supervision, yet the gate generalizes to all nodes through the shared projection layer — demonstrating that the model learns a mapping from node representations to conflict\-type likelihood rather than memorizing supervised examples\.

Table 2:Gate supervision analysis for HIA\-GAT\.g¯\\bar\{g\}: mean gate value across all nodes \(1\.0 = fully longitudinal, 0\.0 = fully lateral\)\.σg\\sigma\_\{g\}: standard deviation\. Dir\. Acc\.: fraction of supervised nodes with correct gate direction \(\>\>0\.5 for TTC,<<0\.5 for PET\)\.
### 4\.6Discussion

The most consequential finding of this study is the stark performance gap between graph\-based and non\-graph methods on PET\-only configurations\. While non\-graph baselines achieve competitive or superior performance on TTC detection, they consistently fail on PET detection, with AUC values in the 0\.69–0\.77 range compared to 0\.80–0\.87 for GNN models\. This finding has a clear physical interpretation\. Rear\-end conflicts are characterized by aggregate kinematic signatures — high mean closing rates, small minimum headways — that are preserved by statistical summaries of frame\-level features\. Lane\-change conflicts, by contrast, are inherently pairwise events defined by the spatial and temporal relationship between two specific vehicles: one changing lanes and one occupying the target space\. When these pairwise interactions are collapsed into frame\-level means and maxima, the critical relational information is lost\. This result suggests that graph\-based representations are not merely a modeling convenience for traffic risk prediction, but a necessary structural choice for comprehensive conflict detection that spans both longitudinal and lateral interaction types\.

The comparison between GCN/GraphSAGE and HomoGAT/HIA\-GAT isolates the value of physics\-informed edge features\. On I\-80 TTC configurations, HomoGAT improves over GCN by\+0\.030\+0\.030average AUC by explicitly encoding closing rate, gap distance, and acceleration differential—the key variables underlying TTC\. HIA\-GAT further extends this idea through conflict\-specific streams: the longitudinal stream uses closing rate and acceleration differential, while the lateral stream uses lateral velocity, lane\-change flag, and longitudinal overlap\. This prevents one attention function from learning two distinct conflict mechanisms simultaneously\. The gate also improves interpretability by indicating the dominant conflict type for each risky vehicle: values near 1\.0 correspond to TTC\-related rear\-end interactions, values near 0\.0 to PET\-related lane\-change conflicts, and safe vehicles remain near 0\.5, as shown in Figure[4](https://arxiv.org/html/2606.27577#S4.F4)\. This attribution is obtained from SSM\-derived labels with negligible added annotation or computational cost\.

### 4\.7Dataset and Data Labeling

We use NGSIM vehicle trajectories\[[85](https://arxiv.org/html/2606.27577#bib.bib85),[86](https://arxiv.org/html/2606.27577#bib.bib86)\]recorded at 10 Hz on two California freeway segments: I\-80 in Emeryville, with 45 minutes of PM peak congestion, and US\-101 in Los Angeles, with 45 minutes of AM peak traffic in a weaving section\. Each record includes vehicle position, velocity, acceleration, lane, dimensions, and headway\. Preprocessing disambiguated overlapping Frame\_ID sequences, removed duplicate boundary records, and added two derived features: lateral velocity and a lane\-change flag indicating a sustained lane change within the previous 1\.0 s\. The final I\-80 and US\-101 datasets contain 4,564,923 and 4,098,933 vehicle\-frame records over 29,679 and 28,156 unique frames, respectively\. Frame\-level risk labels are generated for the nine threshold configurations in Section[3\.2\.3](https://arxiv.org/html/2606.27577#S3.SS2.SSS3)\. Experiments were run on an NVIDIA A100\-SXM4\-80GB GPU using PyTorch Geometric\.

Table 3:Performance comparison on the I\-80 and US\-101 datasets\. Best AUC per configuration is bolded\. %R denotes risk prevalence\. Each cell reports F1 score \(left\) and AUC \(right\)\. Shaded rows summarize category averages\.
### 4\.8Evaluation Metrics

We evaluate all models using two complementary metrics\. The primary metric is the Area Under the Receiver Operating Characteristic Curve \(AUC\), which measures the model’s ability to rank frames by risk severity independently of any classification threshold\. AUC is particularly suited to traffic conflict prediction because the operational goal is risk prioritization — identifying which frames warrant attention — rather than binary classification at a fixed threshold\. Moreover, AUC is robust to class imbalance, which varies substantially across our nine configurations \(from 3\.5% to 57\.5% risk prevalence\)\. As a secondary metric, we report the threshold\-optimized F1 score, where the classification threshold is selected to maximize F1 on the validation set\. F1 provides a measure of the best achievable classification performance but is sensitive to the probability distribution shape and class balance, making it less stable for cross\-configuration comparison\.

### 4\.9Overall Comparison

Table[3](https://arxiv.org/html/2606.27577#S4.T3)present the full comparison of all eight methods across the nine threshold configurations on the I\-80 and US\-101 datasets, respectively\. Two clear patterns emerge from the aggregate results\. First, the proposed HIA\-GAT achieves the highest average AUC on both datasets \(0\.835 on I\-80, 0\.867 on US\-101\), outperforming all non\-graph and graph baselines\. Second, Random Forest achieves the highest average F1 \(0\.537 on I\-80, 0\.486 on US\-101\)\. This divergence between AUC and F1 rankings reflects a fundamental distinction: AUC evaluates the full ranking of risk scores across all operating points, whereas F1 evaluates classification accuracy at a single best threshold\. HIA\-GAT produces well\-calibrated probability scores that capture nuanced risk gradations, while tree\-based ensembles produce sharper probability distributions that yield cleaner decision boundaries at optimal thresholds despite slightly inferior overall ranking\. Among the GNN baselines, a consistent architectural progression is observed\. GCN, which operates on graph topology alone without edge features, achieves the lowest GNN performance \(average AUC of 0\.778 on I\-80, 0\.796 on US\-101\)\. GraphSAGE improves upon GCN through its neighborhood sampling mechanism but similarly lacks edge feature integration\. HomoGAT, which incorporates 3\-dimensional physics\-informed edge features into the attention computation, yields a substantial improvement over GCN \(\+0\.032 AUC on I\-80, \+0\.060 on US\-101\), demonstrating that edge features encoding pairwise vehicle dynamics are critical for risk prediction\. The proposed HIA\-GAT extends this further with its dual\-stream architecture and event\-level gate supervision, achieving \+0\.025 AUC over HomoGAT on I\-80 and \+0\.011 on US\-101, and winning 7 of 9 configurations on both datasets by AUC\.

### 4\.10Conflict\-Type Stratified Analysis

The nine threshold configurations can be partitioned into three categories — TTC\-only \(rear\-end conflicts\), PET\-only \(lane\-change conflicts\), and combined — revealing distinct patterns in model suitability\. On TTC\-only configurations, non\-graph models perform strongly due to the nature of rear\-end conflict detection\. TTC is fundamentally determined by the closing rate and gap distance between a following vehicle and its leader — quantities that are well captured by aggregate frame\-level statistics such as the mean and maximum closing rate across all vehicle pairs\. Random Forest achieves average TTC AUC of 0\.901 on I\-80 and 0\.940 on US\-101\. GNN models remain competitive \(HIA\-GAT: 0\.868 and 0\.858\) but the additional representational capacity of graph\-level reasoning provides diminishing returns when the risk signal is already well\-encoded in simple summary statistics\. On PET\-only configurations, the pattern reverses decisively\. Non\-graph methods degrade substantially, with RF and XGBoost achieving average PET AUC of only 0\.715 and 0\.689 on I\-80, respectively \(Table[3](https://arxiv.org/html/2606.27577#S4.T3)\)\. In contrast, all GNN models outperform all non\-graph baselines, with HIA\-GAT and GraphSAGE achieving 0\.805 and 0\.810 on I\-80, and 0\.873 and 0\.865 on US\-101\. This gap — approximately 0\.10 AUC separating graph from non\-graph methods — provides strong evidence that graph structure is essential for lane\-change conflict detection\. PET conflicts are inherently relational events: they arise from the spatial interaction between a lane\-changing vehicle and an adjacent\-lane occupant sharing longitudinal road space\. Aggregate statistics collapse this pairwise structure, losing the specific vehicle pair information that makes PET detection possible\. On combined configurations \(TTC\|PET\), the results reflect an intermediate regime\. RF leads overall due to its strong TTC performance, but HIA\-GAT achieves the largest improvement over HomoGAT in this category \(\+0\.034 AUC on I\-80, \+0\.028 on US\-101\), suggesting that the dual\-stream gating mechanism provides its greatest value when both conflict types coexist within the same frame and must be differentiated\. The exception isTTC<0\.5\\mathrm\{TTC\}<0\.5on US\-101 \(5\.6% prevalence\), where HIA\-GAT underperforms on both F1 and AUC due to training instability on this extremely sparse configuration — the dual\-stream architecture splits an already limited set of positive examples across two streams, reducing the effective supervision signal per stream below the threshold needed for stable attention learning\.

### 4\.11Gate Supervision Analysis

A central design feature of HIA\-GAT is the conflict\-type aware gating mechanism\. Table[4](https://arxiv.org/html/2606.27577#S4.T4)summarizes the gate behavior across all nine configurations on both datasets\. We evaluate whether the gate successfully learns to differentiate between longitudinal and lateral conflicts through its supervision signal\. On single\-type configurations, the gate achieves near\-perfect conflict type routing\. For TTC\-only configurations, the average gate value across all nodes converges above 0\.5 \(0\.69\-0\.77 on I\-80, 0\.58–0\.67 on US\-101\), indicating that the majority of the representation is routed through the longitudinal stream\. For PET\-only configurations, the gate converges below 0\.5 \(0\.43–0\.46 on I\-80, 0\.40–0\.43 on US\-101\), routing the representation predominantly through the lateral stream\. The gate direction accuracy – defined as the fraction of supervised nodes where the gate value correctly exceeds 0\.5 for TTC\-involved vehicles or falls below 0\.5 for PET\-involved vehicles — exceeds 99% on all single\-type configurations across both datasets\.

![Refer to caption](https://arxiv.org/html/2606.27577v1/gate33.png)Figure 4:Per\-vehicle gate values on a representative I\-80 test frame \(TTC<<1\.0 \| PET<<1\.5\)\. Blue vehicles \(g¯i→1\.0\\bar\{g\}\_\{i\}\\to 1\.0\) are routed through the longitudinal stream; red vehicles \(g¯i→0\.0\\bar\{g\}\_\{i\}\\to 0\.0\) through the lateral stream\. The gate correctly identifies the rear\-end conflict pair \(v3v\_\{3\}/v4v\_\{4\}\) and the lane\-change conflict pair \(v8v\_\{8\}/v11v\_\{11\}\), while safe vehicles remain near 0\.5\.On combined configurations, the gate exhibits substantial per\-node variation \(standard deviation 0\.19–0\.29\), indicating that different vehicles within the same frame are routed through different streams depending on their conflict involvement\. The aggregate gate mean shifts toward the TTC\-dominant direction \(0\.54–0\.70\) because TTC events are approximately 6–11 times more numerous than PET events, which reduces PET gate direction accuracy on the most imbalanced combined configurations\. Notably, only 0\.1–1\.5% of nodes in each frame receive gate supervision, yet the gate generalizes to all nodes through the shared projection layer — demonstrating that the model learns a mapping from node representations to conflict\-type likelihood rather than memorizing supervised examples\.

Table 4:Gate supervision analysis for HIA\-GAT\.g¯\\bar\{g\}: mean gate value across all nodes \(1\.0 = fully longitudinal, 0\.0 = fully lateral\)\.σg\\sigma\_\{g\}: standard deviation\. Dir\. Acc\.: fraction of supervised nodes with correct gate direction \(\>\>0\.5 for TTC,<<0\.5 for PET\)\.
### 4\.12Discussion

The most consequential finding of this study is the stark performance gap between graph\-based and non\-graph methods on PET\-only configurations\. While non\-graph baselines achieve competitive or superior performance on TTC detection, they consistently fail on PET detection, with AUC values in the 0\.69–0\.77 range compared to 0\.80–0\.87 for GNN models\. This finding has a clear physical interpretation\. Rear\-end conflicts are characterized by aggregate kinematic signatures — high mean closing rates, small minimum headways — that are preserved by statistical summaries of frame\-level features\. Lane\-change conflicts, by contrast, are inherently pairwise events defined by the spatial and temporal relationship between two specific vehicles: one changing lanes and one occupying the target space\. When these pairwise interactions are collapsed into frame\-level means and maxima, the critical relational information is lost\. This result suggests that graph\-based representations are not merely a modeling convenience for traffic risk prediction, but a necessary structural choice for comprehensive conflict detection that spans both longitudinal and lateral interaction types\.

The comparison between GCN/GraphSAGE and HomoGAT/HIA\-GAT isolates the value of physics\-informed edge features\. On I\-80 TTC configurations, HomoGAT improves over GCN by\+0\.030\+0\.030average AUC by explicitly encoding closing rate, gap distance, and acceleration differential—the key variables underlying TTC\. HIA\-GAT further extends this idea through conflict\-specific streams: the longitudinal stream uses closing rate and acceleration differential, while the lateral stream uses lateral velocity, lane\-change flag, and longitudinal overlap\. This prevents one attention function from learning two distinct conflict mechanisms simultaneously\. The gate also improves interpretability by indicating the dominant conflict type for each risky vehicle: values near 1\.0 correspond to TTC\-related rear\-end interactions, values near 0\.0 to PET\-related lane\-change conflicts, and safe vehicles remain near 0\.5, as shown in Figure[4](https://arxiv.org/html/2606.27577#S4.F4)\. This attribution is obtained from SSM\-derived labels with negligible added annotation or computational cost\.

## 5VI Conclusion

This study proposed HIA\-GAT, a heterogeneous interaction\-aware graph attention network for freeway traffic conflict risk prediction that processes longitudinal and lateral vehicle interactions through dedicated dual\-stream pathways with physics\-informed edge features and a conflict\-type\-aware gating mechanism\. Evaluated against seven baselines across nine threshold configurations on the NGSIM I\-80 and US\-101 datasets, HIA\-GAT achieved the highest average AUC on both datasets \(0\.835 and 0\.867\), with event\-level gate supervision attaining over 99% conflict\-type routing accuracy on single\-type configurations\. The comprehensive comparison revealed that graph structure is essential for lane\-change conflict detection, where GNN models outperformed non\-graph baselines by approximately 0\.10 AUC, while physics\-informed edge features proved critical for rear\-end conflict prediction\. Beyond predictive performance, the gate mechanism provides per\-vehicle interpretability — identifying which vehicles are involved in which conflict type — offering actionable information for traffic safety monitoring systems\.

## 6VII Acknowledgement

This work was supported in part by the Transportation Network Growth Opportunity \(TNGO\) initiative funded by the Tennessee Department of Economic and Community Development, in collaboration with the University of Tennessee at Chattanooga and industry partners\.

## References

- \[1\]National Highway Traffic Safety Administration\.Fatality Analysis Reporting System \(FARS\)\.[https://www\.nhtsa\.gov/research\-data/fatality\-analysis\-reporting\-system\-fars](https://www.nhtsa.gov/research-data/fatality-analysis-reporting-system-fars), n\.d\., Accessed: 2025\-07\-07\.
- \[2\]Federal Highway Administration\.About Intersection Safety\.[https://highways\.dot\.gov/safety/intersection\-safety/about](https://highways.dot.gov/safety/intersection-safety/about), n\.d\., Accessed: 2025\-07\-07\.
- \[3\]Richard Tay and Shakil Mohammad Rifaat\.Factors contributing to the severity of intersection crashes\.Journal of Advanced Transportation, 41\(3\):245–265, 2007\.
- \[4\]Xuesong Wang and Mohamed Abdel\-Aty\.Temporal and spatial analyses of rear\-end crashes at signalized intersections\.Accident Analysis & Prevention, 38\(6\):1137–1150, 2006\.
- \[5\]Xuesong Wang and Mohamed Abdel\-Aty\.Right\-angle crash occurrence at signalized intersections\.Transportation Research Record, 2019\(1\):156–168, 2007\.
- \[6\]Andrew Northmore and Eric Hildebrand\.Intersection characteristics that influence collision severity and cost\.Journal of safety research, 70:49–57, 2019\.
- \[7\]Shane Turner and Alan Nicholson\.Intersection accident estimation: the role of intersection location and non\-collision flows\.Accident Analysis & Prevention, 30\(4\):505–517, 1998\.
- \[8\]Flora Dilys Salim, Seng Wai Loke, Andry Rakotonirainy, Bala Srinivasan, and Shonali Krishnaswamy\.Collision pattern modeling and real\-time collision detection at road intersections\.In2007 IEEE Intelligent transportation systems conference, pages 161–166\. IEEE, 2007\.
- \[9\]Abhijit Sarkar, Hananeh Alambeigi, Anthony McDonald, Gustav Markkula, and Jeff Hickman\.Role of peripheral vision in brake reaction time during safety critical events\.InProceedings of the Human Factors and Ergonomics Society Annual Meeting, volume 65, number 1, pages 695–699\. SAGE Publications Sage CA: Los Angeles, CA, 2021\.
- \[10\]Bor\-Shong Liu\.Association of intersection approach speed with driver characteristics, vehicle type and traffic conditions comparing urban and suburban areas\.Accident Analysis & Prevention, 39\(2\):216–223, 2007\.
- \[11\]Lotte Larsen and Peter Kines\.Multidisciplinary in\-depth investigations of head\-on and left\-turn road collisions\.Accident Analysis & Prevention, 34\(3\):367–380, 2002\.
- \[12\]Charith Chitraranjan, Vipooshan Vipulananthan, and Thuvarakan Sritharan\.Vision\-Based Collision Warning Systems with Deep Learning: A Systematic Review\.Journal of Imaging, 11\(2\):64, 2025\.
- \[13\]Abolfazl Razi, Xiwen Chen, Huayu Li, Hao Wang, Brendan Russo, Yan Chen, and Hongbin Yu\.Deep learning serves traffic safety analysis: A forward\-looking review\.IET Intelligent Transport Systems, 17\(1\):22–71, 2023\.
- \[14\]SM Sohel Mahmud, Luis Ferreira, Md Shamsul Hoque, and Ahmad Tavassoli\.Micro\-simulation modelling for traffic safety: A review and potential application to heterogeneous traffic environment\.IATSS research, 43\(1\):27–36, 2019\.
- \[15\]Jaeyoung Lee, Mohamed Abdel\-Aty, and Ximiao Jiang\.Development of zone system for macro\-level traffic safety analysis\.Journal of transport geography, 38:13–21, 2014\.
- \[16\]Hassan T Abdelwahab and Mohamed A Abdel\-Aty\.Artificial neural networks and logit models for traffic safety analysis of toll plazas\.Transportation Research Record, 1784\(1\):115–125, 2002\.
- \[17\]Helai Huang and Mohamed Abdel\-Aty\.Multilevel data and Bayesian analysis in traffic safety\.Accident Analysis & Prevention, 42\(6\):1556–1565, 2010\.
- \[18\]Yetis Sazi Murat and Ziya Cakici\.An integration of different computing approaches in traffic safety analysis\.Transportation research procedia, 22:265–274, 2017\.
- \[19\]Tuan Thanh Nguyen and Phuong Thao Cao\.Assessment of Traffic Safety Between Pedestrians and Vehicles Using Traffic Conflict Technique\.InInternational Conference on Sustainability in Civil Engineering, pages 763–769\. Springer, 2022\.
- \[20\]Madhumita Paul and Indrajit Ghosh\.Post encroachment time threshold identification for right\-turn related crashes at unsignalized intersections on intercity highways under mixed traffic\.International journal of injury control and safety promotion, 27\(2\):121–135, 2020\.
- \[21\]Weiwei Qi, Wei Wang, Bin Shen, and Jiabin Wu\.A modified post encroachment time model of urban road merging area based on lane\-change characteristics\.IEEE Access, 8:72835–72846, 2020\.
- \[22\]Akash Sonth, Abhijit Sarkar, Sparsh Jain, Hirva Bhagat, and Zachary R Doerzaph\.Real\-time risk prediction at signalized intersections using a graph neural network\.Safe\-D University Transportation Center, 2023\.
- \[23\]C Traffic\.Manual on uniform traffic control devices\.US Department of Transportation, Federal Highway Administation, 2009\.
- \[24\]Pnina Gershon, Kellienne R Sita, Chunming Zhu, Johnathon P Ehsani, Sheila G Klauer, Tom A Dingus, and Bruce G Simons\-Morton\.Distracted driving, visual inattention, and crash risk among teenage drivers\.American journal of preventive medicine, 56\(4\):494–500, 2019\.
- \[25\]Dominique Lord and Fred Mannering\.The statistical analysis of crash\-frequency data: A review and assessment of methodological alternatives\.Transportation research part A: policy and practice, 44\(5\):291–305, 2010\.
- \[26\]Peter T Savolainen, Fred L Mannering, Dominique Lord, and Mohammed A Quddus\.The statistical analysis of highway crash\-injury severities: A review and assessment of methodological alternatives\.Accident Analysis & Prevention, 43\(5\):1666–1676, 2011\.
- \[27\]Helai Huang, Hoong Chor Chin, and Md Mazharul Haque\.Severity of driver injury and vehicle damage in traffic crashes at intersections: a Bayesian hierarchical analysis\.Accident Analysis & Prevention, 40\(1\):45–54, 2008\.
- \[28\]Khondoker Billah, Qasim Adegbite, Hatim O Sharif, Samer Dessouky, and Lauren Simcic\.Analysis of intersection traffic safety in the city of San Antonio, 2013–2017\.Sustainability, 13\(9\):5296, 2021\.
- \[29\]N Mohamed Hasain and Mokaddes Ali Ahmed\.Safety evaluation of unsignalized intersection with heterogeneous traffic using Post Encroachment Time and conflicting vehicle speed\.European Transport/Trasporti Europei, 88\(88\):1–14, 2022\.
- \[30\]Carl Johnsson, Aliaksei Laureshyn, and Carmelo Dágostino\.Validation of surrogate measures of safety with a focus on bicyclist–motor vehicle interactions\.Accident Analysis & Prevention, 153:106037, 2021\.
- \[31\]Katja Vogel\.A comparison of headway and time to collision as safety indicators\.Accident analysis & prevention, 35\(3\):427–433, 2003\.
- \[32\]Raymond J Kiefer, Carol A Flannagan, and Christian J Jerome\.Time\-to\-collision judgments under realistic driving conditions\.Human factors, 48\(2\):334–345, 2006\.
- \[33\]Timothy L Brown\.Adjusted minimum time\-to\-collision \(TTC\): A robust approach to evaluating crash scenarios\.InProceedings of the Driving Simulation Conference North America, volume 40, pages 40–48, 2005\.
- \[34\]Lakshmi N Peesapati, Michael P Hunter, and Michael O Rodgers\.Can post encroachment time substitute intersection characteristics in crash prediction models?Journal of safety research, 66:205–211, 2018\.
- \[35\]Chuanyun Fu and Tarek Sayed\.Comparison of threshold determination methods for the deceleration rate to avoid a crash \(DRAC\)\-based crash estimation\.Accident Analysis & Prevention, 153:106051, 2021\.
- \[36\]Suzi Iryanti Fadilah and Azizul Rahman Mohd Shariff\.A time gap interval for safe following distance \(TGFD\) in avoiding car collision in wireless vehicular networks \(VANET\) environment\.In2014 5th International Conference on Intelligent Systems, Modelling and Simulation, pages 683–689\. IEEE, 2014\.
- \[37\]Michiel M Minderhoud and Piet HL Bovy\.Extended time\-to\-collision measures for road traffic safety assessment\.Accident Analysis & Prevention, 33\(1\):89–97, 2001\.
- \[38\]John C Hayward\.Near miss determination through use of a scale of danger\.Pennsylvania State University University Park, 1972\.
- \[39\]Anthony Bisulco, Vijay Kumar, and Kostas Daniilidis\.EV\-TTC: Event\-Based Time to Collision under Low Light Conditions\.IEEE Robotics and Automation Letters, 2025\.
- \[40\]Jino Thomas Therattil, More Prathamesh Avinash, and Nipjyoti Bharadwaj\.Safety Analysis at Unsignalized T Intersection Using PET and Extreme Value Theorem\.Transportation in Developing Economies, 11\(1\):1–15, 2025\.
- \[41\]Long Short\-Term Memory\.Sepp hochreiter and jürgen schmidhuber\.Neural Computation, 9\(8\):1735, 1997\.
- \[42\]Peng Chen, Weiliang Zeng, Guizhen Yu, and Yunpeng Wang\.Surrogate safety analysis of pedestrian\-vehicle conflict at intersections using unmanned aerial vehicle videos\.Journal of advanced transportation, 2017\(1\):5202150, 2017\.
- \[43\]Quan Yu and Yuting Zhou\.Traffic safety analysis on mixed traffic flows at signalized intersection based on Haar\-Adaboost algorithm and machine learning\.Safety Science, 120:248–253, 2019\.
- \[44\]Emre Kuşkapan, Mohammad Ali Sahraei, Merve Kayaci Çodur, and Muhammed Yasin Çodur\.Pedestrian safety at signalized intersections: Spatial and machine learning approaches\.Journal of Transport & Health, 24:101322, 2022\.
- \[45\]Philippe Barbosa Silva, Michelle Andrade, and Sara Ferreira\.Machine learning applied to road safety modeling: A systematic literature review\.Journal of traffic and transportation engineering \(English edition\), 7\(6\):775–790, 2020\.
- \[46\]Alex Zyner, Stewart Worrall, and Eduardo Nebot\.A recurrent neural network solution for predicting driver intention at unsignalized intersections\.IEEE Robotics and Automation Letters, 3\(3\):1759–1764, 2018\.
- \[47\]Lingxin Lu, Hui Wang, Yan Wan, and Feifei Xu\.A detection transformer\-based intelligent identification method for multiple types of road traffic safety facilities\.Sensors, 24\(10\):3252, 2024\.
- \[48\]Samuel L Gomes, Elizângela de S Rebouças, Edson Cavalcanti Neto, João P Papa, Victor HC de Albuquerque, Pedro P Rebouças Filho, and Joao Manuel RS Tavares\.Embedded real\-time speed limit sign recognition using image processing and machine learning techniques\.Neural Computing and Applications, 28\(Suppl 1\):573–584, 2017\.
- \[49\]Edson Cavalcanti Neto, Samuel Luz Gomes, Pedro Pedrosa Rebouças Filho, and Victor Hugo C de Albuquerque\.Brazilian vehicle identification using a new embedded plate recognition system\.Measurement, 70:36–46, 2015\.
- \[50\]Lynn Scholl, Mohamed Elagaty, Bismarck Ledezma\-Navarro, Edgar Zamora, and Luis Miranda\-Moreno\.A surrogate video\-based safety methodology for diagnosis and evaluation of low\-cost pedestrian\-safety countermeasures: The case of Cochabamba, Bolivia\.Sustainability, 11\(17\):4737, 2019\.
- \[51\]Abhinav Nippani, Dongyue Li, Haotian Ju, Haris Koutsopoulos, and Hongyang Zhang\.Graph neural networks for road safety modeling: Datasets and evaluations for accident analysis\.Advances in neural information processing systems, 36:52009–52032, 2023\.
- \[52\]Thomas N Kipf and Max Welling\.Semi\-supervised classification with graph convolutional networks\.arXiv preprint arXiv:1609\.02907, 2016\.
- \[53\]Zhenqin Wu, Bharath Ramsundar, Evan N Feinberg, Joseph Gomes, Caleb Geniesse, Aneesh S Pappu, Karl Leswing, and Vijay Pande\.MoleculeNet: a benchmark for molecular machine learning\.Chemical science, 9\(2\):513–530, 2018\.
- \[54\]Franco Scarselli, Marco Gori, Ah Chung Tsoi, Markus Hagenbuchner, and Gabriele Monfardini\.The graph neural network model\.IEEE transactions on neural networks, 20\(1\):61–80, 2008\.
- \[55\]Xiaoyu Mo, Yang Xing, and Chen Lv\.Heterogeneous edge\-enhanced graph attention network for multi\-agent trajectory prediction\.arXiv preprint arXiv:2106\.07161, 2021\.
- \[56\]Arnav Vaibhav Malawade, Shih\-Yuan Yu, Brandon Hsu, Deepan Muthirayan, Pramod P Khargonekar, and Mohammad Abdullah Al Faruque\.Spatiotemporal scene\-graph embedding for autonomous vehicle collision prediction\.IEEE Internet of Things Journal, 9\(12\):9379–9388, 2022\.
- \[57\]Frederik Diehl, Thomas Brunner, Michael Truong Le, and Alois Knoll\.Graph neural networks for modelling traffic participant interaction\.In2019 IEEE Intelligent Vehicles Symposium \(IV\), pages 695–701\. IEEE, 2019\.
- \[58\]Kamaldeep Singh Oberoi, Géraldine Del Mondo, Yohan Dupuis, and Pascal Vasseur\.Spatial modeling of urban road traffic using graph theory\.InProceedings of Spatial Analysis and GEOmatics \(SAGEO\) 2017, pages 264–277, 2017\.
- \[59\]Yuyan Annie Pan et al\.Urban intersection traffic flow prediction: A physics\-guided stepwise framework utilizing spatio\-temporal graph neural network algorithms\.Multimodal Transportation, 4\(2\):100207, 2025\.
- \[60\]Tobias Fleck, Karam Daaboul, Michael Weber, Philip Schörner, Marek Wehmer, Jens Doll, Stefan Orf, Nico Sußmann, Christian Hubschneider, Marc René Zofka et al\.Towards large scale urban traffic reference data: Smart infrastructure in the test area autonomous driving baden\-württemberg\.InIntelligent Autonomous Systems 15: Proceedings of the 15th International Conference IAS\-15, pages 964–982\. Springer, 2019\.
- \[61\]Maximilian Zipfl, Tobias Fleck, Marc René Zofka, and J Marius Zöllner\.From traffic sensor data to semantic traffic descriptions: The test area autonomous driving baden\-württemberg dataset \(taf\-bw dataset\)\.In2020 IEEE 23rd International Conference on Intelligent Transportation Systems \(ITSC\), pages 1–7\. IEEE, 2020\.
- \[62\]Philipp Bender, Julius Ziegler, and Christoph Stiller\.Lanelets: Efficient map representation for autonomous driving\.In2014 IEEE Intelligent Vehicles Symposium Proceedings, pages 420–425\. IEEE, 2014\.
- \[63\]Fabian Poggenhans, Jan\-Hendrik Pauls, Johannes Janosovits, Stefan Orf, Maximilian Naumann, Florian Kuhnt, and Matthias Mayr\.Lanelet2: A High\-Definition Map Framework for the Future of Automated Driving\.InProc\. IEEE Intell\. Trans\. Syst\. Conf\., Hawaii, USA, November 2018\.[http://www\.mrt\.kit\.edu/z/publ/download/2018/Poggenhans2018Lanelet2\.pdf](http://www.mrt.kit.edu/z/publ/download/2018/Poggenhans2018Lanelet2.pdf)\.
- \[64\]Mordechai Haklay and Patrick Weber\.Openstreetmap: User\-generated street maps\.IEEE Pervasive computing, 7\(4\):12–18, 2008\.
- \[65\]Mohammad Shokrolah Shirazi and Brendan Tran Morris\.Looking at intersections: a survey of intersection monitoring, behavior and safety analysis of recent studies\.IEEE Transactions on Intelligent Transportation Systems, 18\(1\):4–24, 2016\.
- \[66\]Brian L Allen, B Tom Shin, and Peter J Cooper\.Analysis of traffic conflicts and collisions\.Technical report, 1978\.
- \[67\]A Richard A Van der Horst, Maartje de Goede, Stefanie de Hair\-Buijssen, and Rob Methorst\.Traffic conflicts on bicycle paths: A systematic observation of behaviour from video\.Accident Analysis & Prevention, 62:358–368, 2014\.
- \[68\]Huanyun Zhou and Fei Huang\.Development of traffic safety evaluation method based on simulated conflicts at signalized intersections\.Procedia\-Social and Behavioral Sciences, 96:881–885, 2013\.
- \[69\]Maximilian Zipfl and J Marius Zöllner\.Towards traffic scene description: The semantic scene graph\.In2022 IEEE 25th International Conference on Intelligent Transportation Systems \(ITSC\), pages 3748–3755\. IEEE, 2022\.
- \[70\]Brendan L Douglas\.The weisfeiler\-lehman method and graph isomorphism testing\.arXiv preprint arXiv:1101\.5211, 2011\.
- \[71\]Keyulu Xu, Weihua Hu, Jure Leskovec, and Stefanie Jegelka\.How powerful are graph neural networks?arXiv preprint arXiv:1810\.00826, 2018\.
- \[72\]Petar Veličković, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Lio, and Yoshua Bengio\.Graph attention networks\.arXiv preprint arXiv:1710\.10903, 2017\.
- \[73\]Justin Gilmer, Samuel S Schoenholz, Patrick F Riley, Oriol Vinyals, and George E Dahl\.Message passing neural networks\.InMachine learning meets quantum physics, pages 199–214\. Springer, 2020\.
- \[74\]U\.S\. Department of Transportation Federal Highway Administration\.Next Generation Simulation \(NGSIM\) Program I\-80 Videos\.[https://data\.transportation\.gov](https://data.transportation.gov/), 2016, \[Dataset\]\. Provided by ITS DataHub through Data\.transportation\.gov\.
- \[75\]Dungar Singh, Pritikana Das, and Indrajit Ghosh\.Conflict\-Based safety evaluations at unsignalized intersections using surrogate safety measures\.Heliyon, 10\(5\), 2024\.
- \[76\]Someswara Rao Bonela and B Raghuram Kadali\.Review of traffic safety evaluation at T\-intersections using surrogate safety measures in developing countries context\.IATSS research, 46\(3\):307–321, 2022\.
- \[77\]Avinash Chaudhari, Ninad Gore, Shriniwas Arkatkar, Gaurang Joshi, and Srinivas Pulugurtha\.Exploring pedestrian surrogate safety measures by road geometry at midblock crosswalks: A perspective under mixed traffic conditions\.IATSS research, 45\(1\):87–101, 2021\.
- \[78\]Tamer Bataineh, Nischal Bhattarai, Keshav Jimee, Yibin Zhang, Martin Lucero, Junxuan Zhao, and Hongchao Liu\.Evaluating driving behavior and intersection safety using roadside LiDAR: A study of safety indicators across collision metrics\.Journal of Transportation Safety & Security, 17\(10\):1105–1137, 2025\.
- \[79\]N Mohamed Hasain and Mokaddes Ali Ahmed\.Proposing an effective approach for traffic safety assessment on heterogeneous traffic conditions using surrogate safety measures and speed of the involved vehicles\.Traffic injury prevention, 25\(2\):219–227, 2024\.
- \[80\]Branislav Dimitrijevic, Sina Darban Khales, Roksana Asadi, and Joyoung Lee\.Short\-term segment\-level crash risk prediction using advanced data modeling with proactive and reactive crash data\.Applied Sciences, 12\(2\):856, 2022\.
- \[81\]Wenjun Xiang, Chen Peng, Jian Feng, Tianyue Zhang, and Jinrui Zhu\.Research on the Lane\-Changing Behavior of Drivers at Highway Interchanges\.InCICTP 2024, pages 2508–2517\. 2024\.
- \[82\]Yi Zhao, Chuwei Zhao, Ou Zheng, and Jianxiao Ma\.Traffic safety risk assessment and characterization of lane\-changing behavior in urban expressway interchange weaving area\.Travel Behaviour and Society, 41:101106, 2025\.
- \[83\]Yifeng Lin, Fengjuan Xu, and Huiying Wen\.Analyzing lane\-changing characteristics when interacting with different current\-lane preceding vehicles\.Journal of Transportation Safety & Security, pages 1–36, 2025\.
- \[84\]Federal Highway Administration \(FHWA\)\.5\.0 Using Safety Analyses for Planning\.[https://highways\.dot\.gov/safety/zero\-deaths/applying\-safety\-data\-and\-analysis\-performance\-based\-transportation\-planning/50](https://highways.dot.gov/safety/zero-deaths/applying-safety-data-and-analysis-performance-based-transportation-planning/50), 2025, U\.S\. Department of Transportation\. Part of the guide ’Applying Safety Data and Analysis to Performance\-Based Transportation Planning’, Accessed: 2026\-02\-22\.
- \[85\]U\.S\. Department of Transportation Federal Highway Administration\.Next Generation Simulation \(NGSIM\) Program I\-80 Videos\.[https://doi\.org/10\.21949/1504477](https://doi.org/10.21949/1504477), ITS DataHub through Data\.transportation\.gov, 2016\.
- \[86\]U\.S\. Department of Transportation Federal Highway Administration\.Next Generation Simulation \(NGSIM\) Program US\-101 Videos\.[https://doi\.org/10\.21949/1504477](https://doi.org/10.21949/1504477), ITS DataHub through Data\.transportation\.gov, 2016\.
- \[87\]Fei Peng and HaiYang Xuan\.PASTGCN: A Real\-Time Intersection Conflict Risk Prediction Model via Spatiotemporal Graph Convolutional Networks with Multimodal Sensor Fusion\.Informatica, 49\(26\), 2025\.
- \[88\]Kaliprasana Muduli, Indrajit Ghosh, and Satish V Ukkusuri\.A graph\-based spatio\-temporal framework for predicting safety\-critical pedestrian–vehicle interactions at unsignalized crosswalks\.Accident Analysis & Prevention, 228:108409, 2026\.
hia-gat: A Heterogeneous Interaction-Aware Graph Attention Network For Frame-Level Traffic Conflict Risk Prediction On Freeways

Similar Articles

A Global-Local Graph Attention Network for Traffic Forecasting

A Temporally Augmented Graph Attention Network for Affordance Classification

Hierarchical Global Attention (HGA)

Graph-Conditioned Mixture of Graph Neural Network Experts for Traffic Forecasting

Graph-Driven Cross-Industry Real-Time Monitoring Framework for Anti-Money Laundering Detection in Converged Mobility-Energy Supply Chain Networks

Submit Feedback

Similar Articles

A Global-Local Graph Attention Network for Traffic Forecasting
A Temporally Augmented Graph Attention Network for Affordance Classification
Hierarchical Global Attention (HGA)
Graph-Conditioned Mixture of Graph Neural Network Experts for Traffic Forecasting
Graph-Driven Cross-Industry Real-Time Monitoring Framework for Anti-Money Laundering Detection in Converged Mobility-Energy Supply Chain Networks