TRACER: Training-Free Closed-Loop Structured Inference for Traffic Accident Reconstruction

arXiv cs.LG Papers

Summary

TRACER is a training-free framework for traffic accident reconstruction that formulates the problem as closed-loop structured inference, iteratively refining event-anchored motion hypotheses under geometric and kinematic constraints, achieving improved fidelity and consistency over data-driven and physics-based baselines.

arXiv:2606.25002v1 Announce Type: new Abstract: Traffic accident reconstruction is a forensic inverse problem that requires recovering physically consistent motion from sparse and heterogeneous evidence. Existing learning-based approaches predominantly optimize for semantic plausibility or visual realism, rather than quantitative agreement with measurable geometry and dynamics. Here, we present TRACER, a training-free framework that formulates reconstruction as a closed-loop structured inference process. Instead of directly generating dense trajectories, our framework constructs and iteratively refines event-anchored motion hypotheses under geometric, kinematic, and interaction constraints, guided by structured case memory and consistency-driven diagnosis. This design enables incremental, interpretable corrections when evidence is insufficient, making the accident reconstruction process more aligned with the workflow of human experts. Experiments on real-world accident data show that TRACER achieves improved geometric fidelity, velocity consistency, and collision accuracy over both data-driven and physics-based baselines.
Original Article
View Cached Full Text

Cached at: 06/25/26, 05:11 AM

# TRACER: Training-Free Closed-Loop Structured Inference for Traffic Accident Reconstruction
Source: [https://arxiv.org/html/2606.25002](https://arxiv.org/html/2606.25002)
Yanchen Guan, Chengyue Wang, Bin Rao, Haicheng Liao, Jiaxun Zhang, Shang Gao, Chengzhong Xu, and Zhenning Li\*State Key Laboratory of Internet of Things for Smart City University of Macau Macau SAR, 999078, China

###### Abstract

Traffic accident reconstruction is a forensic inverse problem that requires recovering physically consistent motion from sparse and heterogeneous evidence\. Existing learning\-based approaches predominantly optimize for semantic plausibility or visual realism, rather than quantitative agreement with measurable geometry and dynamics\. Here, we present TRACER, a training\-free framework that formulates reconstruction as a closed\-loop structured inference process\. Instead of directly generating dense trajectories, our framework constructs and iteratively refines event\-anchored motion hypotheses under geometric, kinematic, and interaction constraints, guided by structured case memory and consistency\-driven diagnosis\. This design enables incremental, interpretable corrections when evidence is insufficient, making the accident reconstruction process more aligned with the workflow of human experts\. Experiments on real\-world accident data show that TRACER achieves improved geometric fidelity, velocity consistency, and collision accuracy over both data\-driven and physics\-based baselines\.

## 1Introduction

Accident reconstruction aims to recover the pre\-impact motion process that led to an observed crash from partial post\-hoc evidence[davis2003bayesian](https://arxiv.org/html/2606.25002#bib.bib1)\. This evidence is often sparse, noisy, and heterogeneous, including textual narratives, vehicle attributes, road geometry, scene sketches, and partial collision cues[wach2016calculation](https://arxiv.org/html/2606.25002#bib.bib2)\. The task therefore requires more than generating plausible trajectories: the recovered motion must remain consistent with the accident description, road topology, vehicle kinematics, multi\-agent interaction, and final impact configuration[gao2020vectornet](https://arxiv.org/html/2606.25002#bib.bib3)\.

This setting poses a structured inverse inference problem: several motion histories may explain the same report, especially when the evidence only specifies high\-level actions such as turning, braking, yielding, stopping, or running a red light[arora2021survey](https://arxiv.org/html/2606.25002#bib.bib4)\. Under such ambiguity, once the generated dense trajectory becomes inconsistent, the source of the error is difficult to localize, making it unclear which stage of the motion evolution process is responsible for the failure\. Consequently, direct dense trajectory generation is not well suited for interpretable diagnosis and localized refinement[ivanovic2021mats](https://arxiv.org/html/2606.25002#bib.bib5)\. In contrast, accident reasoning is naturally organized around event\-level stages, including upstream approach, maneuver onset, conflict entry, evasive response, and final collision approach[menzel2018scenarios](https://arxiv.org/html/2606.25002#bib.bib6)\. These stages provide a compact interface between sparse evidence and metric trajectory recovery\.

Existing accident reconstruction approaches have used accident reports, scene layouts, or simulators to synthesize accident\-related trajectories and visualizations[guo2024sovar](https://arxiv.org/html/2606.25002#bib.bib7);[li2025avd2](https://arxiv.org/html/2606.25002#bib.bib8)\. However, many reconstructed results remain vulnerable to structured inconsistencies\. A trajectory may match the narrative but violate lane topology, reach the accident point while implying an unrealistic speed profile, or approach the collision from a direction incompatible with the reported impact side\. These failure modes indicate that the central challenge of accident reconstruction is not merely generating a seemingly plausible trajectory, but rather constructing a structured motion hypothesis that remains diagnosable, localizable, and repairable under incomplete evidence\.

We propose TRACER, a closed\-loop framework for physically constrained inverse motion reconstruction\. TRACER represents each vehicle’s pre\-impact motion as an event\-anchored hypothesis composed of sparse control points, segment relations, speed stages, and collision\-approach constraints\. A structured case memory provides weak priors over route extent, maneuver style, speed tendency, and approach configuration\. Large language models are integrated as structured reasoning components across the inference loop: they propose event\-level hypotheses, assess semantic and action\-level consistency, and generate localized refinements conditioned on checker diagnostics\. Deterministic modules handle dense trajectory realization, numerical consistency checking, and edit acceptance, keeping the final reconstruction auditable and physically constrained\.

Our contributions are threefold\. First, we introduce an event\-anchored motion representation that aligns with the granularity of accident evidence and supports interpretable diagnosis and localized repair\. Second, we develop a closed\-loop structured consistency minimization framework that couples LLM\-assisted planning, hybrid checking, LLM\-guided refinement, and deterministic dense realization\. Third, we incorporate structured case memory as a weak prior mechanism that regularizes underdetermined reconstruction and provides pattern references\. Experiments on real\-world accident reconstruction data show that TRACER improves geometric accuracy, speed consistency, and collision\-level fidelity over representative trajectory\-generation baselines\.

## 2Related Work

Traffic accident reconstruction is a forensic inverse problem that recovers the causal and kinematic process of a crash from heterogeneous evidence, including measurements, sketches, vehicle damage, and witness or driver statements[zheng2020determinants](https://arxiv.org/html/2606.25002#bib.bib9);[struble2020automotive](https://arxiv.org/html/2606.25002#bib.bib10);[fernandes2018application](https://arxiv.org/html/2606.25002#bib.bib11);[ryan2024accident](https://arxiv.org/html/2606.25002#bib.bib12)\. Conventional reconstruction relies on trained investigators to integrate physical evidence into structured reports for legal, engineering, and safety analyses[rivers2010technical](https://arxiv.org/html/2606.25002#bib.bib13);[rivers2006evidence](https://arxiv.org/html/2606.25002#bib.bib14)\. Although evidentially rigorous, this process requires substantial expertise, specialized equipment, and prolonged on\-site operations[carper2000forensic](https://arxiv.org/html/2606.25002#bib.bib15);[raviv2017analyzing](https://arxiv.org/html/2606.25002#bib.bib16);[faizan2021forensic](https://arxiv.org/html/2606.25002#bib.bib17);[smith1957physical](https://arxiv.org/html/2606.25002#bib.bib18), limiting its scalability\.

Recent learning\-based methods seek to automate reconstruction by using accident reports as semantic inputs[chen2025transforming](https://arxiv.org/html/2606.25002#bib.bib19);[jiao2018virtual](https://arxiv.org/html/2606.25002#bib.bib20);[li2024steering](https://arxiv.org/html/2606.25002#bib.bib21)\. Existing approaches include language\-conditioned simulation, such as SoVAR[guo2024sovar](https://arxiv.org/html/2606.25002#bib.bib7); text\-conditioned visual generation, such as AVD2[li2025avd2](https://arxiv.org/html/2606.25002#bib.bib8); and LLM\-assisted simulation pipelines, such as AccidentSim[zhang2025accidentsim](https://arxiv.org/html/2606.25002#bib.bib22)\. These methods improve semantic controllability or visual plausibility, but often leave key physical states, geometric consistency, and quantitative trajectory fidelity insufficiently constrained\. Guan et al\.[guan2026learningphysicallygroundedtraffic](https://arxiv.org/html/2606.25002#bib.bib23)move toward evidence\-grounded reconstruction by coupling textual cues with scene geometry and motion constraints, yet existing methods still largely rely on one\-shot generation without a structured intermediate representation that can be validated and revised\.

This limitation is critical under sparse, noisy, and heterogeneous evidence, where multiple motion histories may appear plausible\. Human experts typically resolve such ambiguity through iterative hypothesis construction and refinement, reconciling semantic, geometric, and physical constraints\. Motivated by this process, we formulate accident reconstruction as structured closed\-loop inference with explicit intermediate hypotheses, retrieval\-based priors, and consistency\-guided refinement\.

## 3Methodology

Given an accident case, our goal is to recover a dense multi\-vehicle pre\-impact motion history that is consistent with the available semantic, geometric, and collision evidence\. We denote the input evidence as

X=\{xtext,xveh,xroad,p⋆,q⋆,c⋆,w\},X=\\\{x^\{\\mathrm\{text\}\},x^\{\\mathrm\{veh\}\},x^\{\\mathrm\{road\}\},p^\{\\star\},q^\{\\star\},c^\{\\star\},w\\\},\(1\)wherextextx^\{\\mathrm\{text\}\}is the accident summary,xvehx^\{\\mathrm\{veh\}\}contains vehicle\-level attributes such as travel direction, pre\-event movement, attempted avoidance maneuver, and speed limit,xroadx^\{\\mathrm\{road\}\}denotes road and lane geometry,p⋆p^\{\\star\}is the accident point,q⋆q^\{\\star\}is an impact\-side cue,c⋆c^\{\\star\}is the collision pair when available, andwwdenotes weak scene priors\. The output is a dense pre\-impact trajectory set

T^=\{𝐬^v​\(tn\)\}v=1V,𝐬^v​\(tn\)=\[x^v​\(tn\),y^v​\(tn\),u^v​\(tn\)\],\\hat\{T\}=\\\{\\hat\{\\mathbf\{s\}\}\_\{v\}\(t\_\{n\}\)\\\}\_\{v=1\}^\{V\},\\qquad\\hat\{\\mathbf\{s\}\}\_\{v\}\(t\_\{n\}\)=\[\\hat\{x\}\_\{v\}\(t\_\{n\}\),\\hat\{y\}\_\{v\}\(t\_\{n\}\),\\hat\{u\}\_\{v\}\(t\_\{n\}\)\],\(2\)whereVVis the number of involved vehicles andtnt\_\{n\}indexes the pre\-impact time steps\.

Figure[1](https://arxiv.org/html/2606.25002#S3.F1)illustrates the overall framework\. TRACER reconstructsT^\\hat\{T\}through a structured hypothesis\.

H=\{Hv\}v=1VH=\\\{H\_\{v\}\\\}\_\{v=1\}^\{V\}\(3\)denote an event\-level trajectory hypothesis, where eachHvH\_\{v\}contains sparse control points, segment relations, and speed\-temporal variables\. The method seeks a hypothesis that minimizes a structured consistency energy:

ℰ​\(H;X,M\)=\\displaystyle\\mathcal\{E\}\(H;X,M\)=λsem​ℰsem\+λact​ℰact\+λgeo​ℰgeo\+λspd​ℰspd\+λkin​ℰkin\+λcol​ℰcol,\\displaystyle\\lambda\_\{\\mathrm\{sem\}\}\\mathcal\{E\}\_\{\\mathrm\{sem\}\}\+\\lambda\_\{\\mathrm\{act\}\}\\mathcal\{E\}\_\{\\mathrm\{act\}\}\+\\lambda\_\{\\mathrm\{geo\}\}\\mathcal\{E\}\_\{\\mathrm\{geo\}\}\+\\lambda\_\{\\mathrm\{spd\}\}\\mathcal\{E\}\_\{\\mathrm\{spd\}\}\+\\lambda\_\{\\mathrm\{kin\}\}\\mathcal\{E\}\_\{\\mathrm\{kin\}\}\+\\lambda\_\{\\mathrm\{col\}\}\\mathcal\{E\}\_\{\\mathrm\{col\}\},\(4\)whereM=M​\(X\)M=M\(X\)denotes retrieved case memory\. The terms respectively measure consistency with the accident description, action sequence, road geometry, path–speed relation, vehicle kinematics, and collision configuration\. Since this objective involves discrete semantic judgments, geometric constraints, and collision\-specific reasoning, TRACER approximates its minimization through a closed\-loop constrained inference process:

X→M​\(X\)→H\(0\)→C\(0\)→H\(1\)→⋯→T^,X\\rightarrow M\(X\)\\rightarrow H^\{\(0\)\}\\rightarrow C^\{\(0\)\}\\rightarrow H^\{\(1\)\}\\rightarrow\\cdots\\rightarrow\\hat\{T\},\(5\)where the planner proposes an initial event\-level hypothesis, the checker diagnoses violations of the consistency energy, the refiner reduces the diagnosed inconsistencies, and the realizer converts the refined hypothesis into dense trajectories\.

![Refer to caption](https://arxiv.org/html/2606.25002v1/LLM-total.png)Figure 1:Overview of TRACER\. Our framework reconstructs pre\-impact trajectories through closed\-loop constrained inference\. Structured memory provides weak priors, the planner generates an event\-anchored hypothesis, the checker diagnoses semantic, action\-level, geometric, kinematic, and collision inconsistencies, and the refiner repairs the hypothesis with the criticism from checker\.### 3\.1Structured Case Memory

Sparse accident evidence often leaves multiple motion histories plausible\. To reduce this ambiguity, we use a structured case memory from the training split\. Each memory item is represented as

mi=\{zisem,zigeo,ziint,πi\},m\_\{i\}=\\\{z\_\{i\}^\{\\mathrm\{sem\}\},z\_\{i\}^\{\\mathrm\{geo\}\},z\_\{i\}^\{\\mathrm\{int\}\},\\pi\_\{i\}\\\},\(6\)wherezisemz\_\{i\}^\{\\mathrm\{sem\}\}summarizes semantic cues such as vehicle count, pre\-event movement, avoidance maneuver, impact\-side evidence, and speed\-related information;zigeoz\_\{i\}^\{\\mathrm\{geo\}\}summarizes road layout and lane\-structure cues;ziintz\_\{i\}^\{\\mathrm\{int\}\}describes interaction and approach relations; andπi\\pi\_\{i\}stores compact motion priors, including approximate path extent, approach angle, dominant motion pattern, speed tendency, and braking style\.

The retrieved memory set is

M​\(X\)=\{m\(1\),…,m\(K\)\},M\(X\)=\\\{m\_\{\(1\)\},\\dots,m\_\{\(K\)\}\\\},\(7\)
Each training case is converted into a memory descriptor containing semantic, interaction, road\-geometric, speed, and trajectory summaries\. The retrieval first ranks cases by semantic and interaction compatibility, and then reranks the top candidates by road\-geometry similarity\. The resulting top\-KKcases are used as structured references for the planner and speed planner\. The memory is used as a weak prior over the reconstruction hypothesis, because textual evidence is often insufficient to determine metric motion uniquely, while similar cases can still provide useful regularities about how vehicles approach conflict regions under comparable road and interaction conditions\. It narrows the plausible range of approach paths, maneuver styles, speed profiles, and collision\-approach configurations, therefore regularizes the hypothesis space\. The final reconstruction remains determined by the current case evidence and by the subsequent consistency checking and refinement process\.

### 3\.2Event\-Anchored Planner

The planner constructs an initial structured hypothesisH\(0\)H^\{\(0\)\}from the accident evidence and retrieved memory priors\. For each vehiclevv, the hypothesis is represented as

Hv=\(Pv,Rv,Uv\),H\_\{v\}=\(P\_\{v\},R\_\{v\},U\_\{v\}\),\(8\)wherePvP\_\{v\}contains event\-anchored control points,RvR\_\{v\}contains segment\-level motion relations, andUvU\_\{v\}contains the speed profile and temporal allocation associated with the event anchors\.

Pv=\{𝐩v,k\}k=0Ka−1\.P\_\{v\}=\\\{\\mathbf\{p\}\_\{v,k\}\\\}\_\{k=0\}^\{K\_\{a\}\-1\}\.\(9\)In our implementation,Ka=10K\_\{a\}=10, corresponding to a compact accident\-oriented motion skeleton, as shown in Figure[2](https://arxiv.org/html/2606.25002#S3.F2)\.

The event\-anchored representation is more suitable for accident reconstruction than one\-shot dense trajectory generation\. Accident reports usually describe motion through a small number of semantically meaningful stages, such as pre\-event stages and avoidance stages, rather than through frame\-level description\. A dense sequence generated directly from such evidence is difficult to verify because an error may be distributed across many time steps and to identify the specific action corresponding to each time step\. By contrast, an event\-level hypothesis exposes the key reconstruction decisions: where the vehicle approaches from, when it enters the conflict region, how it maneuvers or responds, and how it reaches the collision point\. These decisions can be checked against road geometry, action semantics, speed feasibility, and impact configuration before dense trajectories are realized\. Moreover, when reconstruction errors occur, the event\-anchor representation enables the inconsistency to be localized to specific anchors\. This establishes an explicit correspondence between the reconstructed trajectory and the input behavioral evidence, making the underlying failure easier to diagnose and supporting localized trajectory editing and refinement\.

![Refer to caption](https://arxiv.org/html/2606.25002v1/LLM-anchors.png)Figure 2:Event\-anchored trajectory hypothesis\. The planner represents each vehicle trajectory using sparse control points aligned with accident\-relevant stages, such as upstream approach, maneuver onset, conflict entry, evasive response, and final collision approach\. This categorization aligns with the inputs from the original accident reports, which delineate four primary phases: Initial, Motion, Reaction, and Collision\.The planner is LLM\-assisted but constrained by the structured hypothesis format\. It receives the accident description, vehicle attributes, road geometry, collision cues, and retrieved memory summaries, and proposes event anchors, segment relations, and speed\-temporal variables:

H\(0\)=fplan​\(X,M\)\.H^\{\(0\)\}=f\_\{\\mathrm\{plan\}\}\(X,M\)\.\(10\)The geometric part of the hypothesis specifies where the vehicle approaches from, how it traverses the road structure, and how it reaches the collision region\. The speed\-temporal part specifies a staged pre\-impact motion profile, including approximate cruising, maneuver, avoidance, and collision\-approach phases\. Formally, we write

Uv=\(\{uv​\(tn\)\}n=0T−1,ℓv,τv\),U\_\{v\}=\\left\(\\\{u\_\{v\}\(t\_\{n\}\)\\\}\_\{n=0\}^\{T\-1\},\\ell\_\{v\},\\tau\_\{v\}\\right\),\(11\)whereuv​\(tn\)u\_\{v\}\(t\_\{n\}\)is the speed at timetnt\_\{n\},ℓv\\ell\_\{v\}denotes speed\-stage labels, andτv\\tau\_\{v\}denotes event\-time allocation for key anchors or motion phases\.

The memory priors influence both spatial and temporal components of the initial hypothesis\. They suggest plausible route extent, approach angle, maneuver style, speed tendency, and braking behavior, but they do not determine coordinates or dense trajectories directly\. This is important because a plausible geometric path may still imply an invalid reconstruction if its length is inconsistent with the inferred speed profile\. The planner therefore initializesPvP\_\{v\},RvR\_\{v\}, andUvU\_\{v\}jointly, allowing the checker to evaluate semantic, geometric, and path–speed consistency within the same structured hypothesis\.

### 3\.3Consistency Checker

The checker evaluates the current hypothesisH\(r\)H^\{\(r\)\}by identifying violations of the structured consistency energy\. It produces a diagnostic object

C\(r\)=fcheck​\(H\(r\),X,M\),C^\{\(r\)\}=f\_\{\\mathrm\{check\}\}\(H^\{\(r\)\},X,M\),\(12\)whereC\(r\)C^\{\(r\)\}contains violation scores and localized issue descriptions\. The diagnosis specifies which vehicle, segment, event anchor, or collision relation contributes to the inconsistency\.

The checker combines LLM\-based semantic judgment with deterministic physical and geometric evaluation\. The semantic consistency term evaluates whether the hypothesis agrees with the accident report and vehicle attributes:

ℰsem=∑v=1Vdsem​\(Hv,xtext,xveh\)\.\\mathcal\{E\}\_\{\\mathrm\{sem\}\}=\\sum\_\{v=1\}^\{V\}d\_\{\\mathrm\{sem\}\}\(H\_\{v\},x^\{\\mathrm\{text\}\},x^\{\\mathrm\{veh\}\}\)\.\(13\)The action\-consistency term evaluates whether the inferred sequence of actions is compatible with the reported maneuver, avoidance behavior, and interaction process:

ℰact=∑v=1Vdact​\(Hv,xtext,xveh\)\.\\mathcal\{E\}\_\{\\mathrm\{act\}\}=\\sum\_\{v=1\}^\{V\}d\_\{\\mathrm\{act\}\}\(H\_\{v\},x^\{\\mathrm\{text\}\},x^\{\\mathrm\{veh\}\}\)\.\(14\)These two terms are assessed by the LLM checker because they require interpretation of high\-level descriptions such as turning, stopping, swerving, yielding, or running a red light\.

The remaining terms are evaluated through explicit computation whenever possible\. The geometric term checks lane adherence, travel direction, and road compatibility:

ℰgeo=∑v=1Vdgeo​\(Hv,xroad\)\.\\mathcal\{E\}\_\{\\mathrm\{geo\}\}=\\sum\_\{v=1\}^\{V\}d\_\{\\mathrm\{geo\}\}\(H\_\{v\},x^\{\\mathrm\{road\}\}\)\.\(15\)The path–speed term couples the geometric path length with the speed\-implied travel distance\. LetLv​\(H\)L\_\{v\}\(H\)be the path length induced by the event anchors and segment relations, and let

Dv​\(Uv\)=Δ​t​∑nuv​\(tn\)D\_\{v\}\(U\_\{v\}\)=\\Delta t\\sum\_\{n\}u\_\{v\}\(t\_\{n\}\)\(16\)be the distance implied by the speed profile\. We use

ℰspd=∑v=1V\|Lv​\(H\)Dv​\(Uv\)\+ϵ−1\|\.\\mathcal\{E\}\_\{\\mathrm\{spd\}\}=\\sum\_\{v=1\}^\{V\}\\left\|\\frac\{L\_\{v\}\(H\)\}\{D\_\{v\}\(U\_\{v\}\)\+\\epsilon\}\-1\\right\|\.\(17\)The kinematic term penalizes unstable speed variation:

ℰkin=∑v=1Vmax⁡\(0,maxn⁡\|uv​\(tn\+1\)−uv​\(tn\)\|−3\.0\)8\.0\.\\mathcal\{E\}\_\{\\mathrm\{kin\}\}=\\sum\_\{v=1\}^\{V\}\\frac\{\\max\\left\(0,\\max\_\{n\}\|u\_\{v\}\(t\_\{n\+1\}\)\-u\_\{v\}\(t\_\{n\}\)\|\-3\.0\\right\)\}\{8\.0\}\.\(18\)Finally, the collision term evaluates whether the final approach is compatible with the accident point and available collision cues:

ℰcol=∑\(i,j\)∈c⋆dcol​\(Hi,Hj,p⋆,q⋆\)\.\\mathcal\{E\}\_\{\\mathrm\{col\}\}=\\sum\_\{\(i,j\)\\in c^\{\\star\}\}d\_\{\\mathrm\{col\}\}\(H\_\{i\},H\_\{j\},p^\{\\star\},q^\{\\star\}\)\.\(19\)
The checker does not only score the hypothesis\. Its main role is to identify where and why the reconstruction is inconsistent\. This localized diagnosis is essential because most reconstruction failures are not global\. A trajectory may have a plausible upstream approach but an invalid collision tail, or a correct endpoint but an implausible speed–path relation\. By exposing these failure modes at the event and segment level, the checker provides the information needed for targeted refinement\.

### 3\.4Consistency\-Guided Refiner

The refiner updates the current hypothesis according to the checker diagnosis:

H\(r\+1\)=frefine​\(H\(r\),C\(r\),X,M\)\.H^\{\(r\+1\)\}=f\_\{\\mathrm\{refine\}\}\(H^\{\(r\)\},C^\{\(r\)\},X,M\)\.\(20\)Its objective is to reduce the structured consistency energy:

ℰ​\(H\(r\+1\);X,M\)<ℰ​\(H\(r\);X,M\)\.\\mathcal\{E\}\(H^\{\(r\+1\)\};X,M\)<\\mathcal\{E\}\(H^\{\(r\)\};X,M\)\.\(21\)The refinement step is local\. Rather than regenerating a complete dense trajectory, the refiner edits the event\-level hypothesis by adjusting control points, segment relations, speed\-stage variables, or event timing\. The possible corrections include modifying an upstream approach, changing a maneuver segment, improving path–speed compatibility, or repairing the final collision\-approach region\. The detailed local refinement procedure is provided in AppendixLABEL:app:local\_refinement\.

The refiner is also LLM\-assisted\. The LLM receives the current structured hypothesis, the checker diagnosis, the accident evidence, and memory priors, and proposes corrections in the same hypothesis space used by the planner\. This keeps planning and refinement aligned: both operate on event anchors, segment relations, and speed\-temporal variables\. Deterministic checks are then used to evaluate whether the proposed update reduces the relevant inconsistency\.

This closed\-loop design gives TRACER an explicit constrained inference mechanism\. The planner produces an initial explanation of the accident process, the checker turns the consistency energy into localized violations, and the refiner modifies the hypothesis to reduce these violations\. The loop continues until the hypothesis becomes stable or the predefined refinement budget is reached\.

### 3\.5Trajectory Realization

The final stage converts the refined event\-level hypothesis into dense trajectories\. For each vehicle, the event anchors and segment relations define a continuous geometric path

Γv​\(s\),s∈\[0,Lv\],\\Gamma\_\{v\}\(s\),\\qquad s\\in\[0,L\_\{v\}\],\(22\)wheressis arc length andLvL\_\{v\}is the total path length\. This construction separates spatial path generation from temporal parameterization\. The anchors and segment relations determine the road\-consistent path, while the speed and temporal variables determine how the vehicle progresses along that path during the pre\-impact window\.

Given the speed profileuv​\(tn\)u\_\{v\}\(t\_\{n\}\), we setsv​\(t0\)=0s\_\{v\}\(t\_\{0\}\)=0and update the traveled distance as

sv​\(tn\+1\)=clip⁡\(sv​\(tn\)\+Δ​t​uv​\(tn\),0,Lv\)\.s\_\{v\}\(t\_\{n\+1\}\)=\\operatorname\{clip\}\\left\(s\_\{v\}\(t\_\{n\}\)\+\\Delta t\\,u\_\{v\}\(t\_\{n\}\),0,L\_\{v\}\\right\)\.\(23\)The dense trajectory is then obtained by evaluating the path at the corresponding arc\-length positions:

\[x^v​\(tn\),y^v​\(tn\)\]=Γv​\(sv​\(tn\)\),u^v​\(tn\)=uv​\(tn\)\.\[\\hat\{x\}\_\{v\}\(t\_\{n\}\),\\hat\{y\}\_\{v\}\(t\_\{n\}\)\]=\\Gamma\_\{v\}\(s\_\{v\}\(t\_\{n\}\)\),\\qquad\\hat\{u\}\_\{v\}\(t\_\{n\}\)=u\_\{v\}\(t\_\{n\}\)\.\(24\)
For vehicles involved in the collision, the terminal segment is constrained by the accident point and available impact\-side evidence\. This ensures that the reconstructed trajectories are not only endpoint\-aligned, but also consistent with the local contact geometry near impact\. The final output is therefore a dense trajectory sequence derived from a memory\-regularized, checked, and refined event\-level reconstruction hypothesis\.

## 4Experiments

### 4\.1Dataset

We evaluate our method on CISS\-REC, a real\-world traffic accident reconstruction dataset\. CISS\-REC contains 6,217 in\-depth accident cases collected in the United States, with 5,217 cases used for training and 1,000 held out for testing\. Each case provides heterogeneous reconstruction evidence, including textual accident reports, total\-station\-based scene measurements, high\-precision maps, vehicle trajectories, and EDR signals\.

As our method is training\-free, the original training split is not used for parameter optimization\. Instead, it is converted into a structured case memory that provides retrieval\-based accident priors\. All methods are evaluated on the same 1,000\-case test split to ensure a fair comparison with learning\-based baselines\.

### 4\.2Evaluation metrics

We assess reconstruction quality from geometric, kinematic, collision, semantic, and physical perspectives\. Geometric fidelity is measured by average keypoint distance \(AKD\), which computes the mean Euclidean error between reconstructed and reference vehicle positions, and by average accident\-point distance \(AAPD\), which further decomposes collision localization error into tangential and normal components, denoted asA​A​P​DtanAAPD\_\{\\mathrm\{tan\}\}andA​A​P​DnormAAPD\_\{\\mathrm\{norm\}\}\. Kinematic accuracy is evaluated by average velocity deviation \(AVD\) against EDR\-derived reference speeds\.

To evaluate event\-level collision consistency, collision rate \(CR\) measures whether the reconstructed trajectories produce a valid contact event, while collision surface accuracy \(CSA\) measures whether the reconstructed impact region matches the reported contact surfaces\. We further report behavior accuracy \(BA\) and role accuracy \(RA\) evaluated by LLM to quantify consistency with annotated participant actions and causal roles\. Finally, physical motion consistency is evaluated using average acceleration error \(AccE\) and curvature error \(KE\), which compare reconstructed vehicle dynamics with the corresponding reference trajectories\.

### 4\.3Evaluation results

Table 1:Quantitative comparison of different models for accident reconstruction\. Distance\-based metrics are reported inmm, velocity\-based metrics inm/sm/s, and accuracy\-based metrics in %\.Table[1](https://arxiv.org/html/2606.25002#S4.T1)presents the performance comparison between the proposed framework and other baseline models on the CISS\-REC dataset\. All learning\-based baselines are adapted to the same reconstruction input contract, including encoded textual evidence, vehicle attributes, road geometry, initial lane, accident\-point information, and vehicle\- or collision\-related conditions\. In the experiments, our framework adopts Qwen\-3\.5\-Flash as the default backbone LLM, with the number of retrieved cases set to Top\-K=5K=5and the maximum number of refinement iterations set to33\.

According to the results, the proposed framework achieves the best overall performance in trajectory shape, velocity consistency, collision localization, collision pattern reconstruction, and semantic behavior\-role consistency\. Notably, although PC\-Crash and Momentum\-Energy explicitly constrain vehicles to collide at a common point and therefore obtain high collision rates, their simplified collision assumptions lead to poorer collision localization and impact\-pattern accuracy\.

Compared with the second\-best model, which is the first trajectory reconstruction approach designed for weakly supervised settings with physical constraints, our method not only outperforms it across all metrics, but also demonstrates a significant advantage in AVD, achieving performance comparable to STGCN, which primarily captures velocity trends\. This improvement can be attributed to two key factors: first, the use of case memory enables the model to learn velocity evolution patterns from similar accident cases; second, the explicit coupling between trajectory geometry and speed planning allows motion patterns to guide velocity hypothesis construction, thereby alleviating the optimization conflict between trajectory and velocity learning\. Furthermore, significant improvements in acceleration error also substantiate this view\.

![Refer to caption](https://arxiv.org/html/2606.25002v1/LLM-cases.png)Figure 3:Qualitative visualization of accident trajectory reconstruction results\. Dark solid trajectories denote the reconstructed motions, while light dashed trajectories denote the original trajectories\. \(a\) Original trajectory and reconstructed result of a representative accident case\. \(b\) Case memory used for the reconstruction in \(a\)\. \(c\) Reconstruction result without the refiner\. \(d\) Reconstruction result without case memory\. \(e\) Original trajectory and reconstructed result in a multi\-vehicle accident scenario\. \(f\) Original trajectory and reconstructed result of a failure case\.Figure[3](https://arxiv.org/html/2606.25002#S4.F3)presents visualization results of several representative reconstruction cases\. Figure[3](https://arxiv.org/html/2606.25002#S4.F3)\(a\) shows a turning\-versus\-straight conflict scenario, while Fig\.[3](https://arxiv.org/html/2606.25002#S4.F3)\(b\) illustrates the corresponding retrieved case memory selected based on behavioral and contextual conditions\. The motion patterns in the two cases are similar, yet they are not entirely identical in terms of scale, direction, and specific details\. The final result does not directly replicate the retrieved case; instead, it utilizes it as a weak prior and re\-infers the outcome based on current evidence\. Figures[3](https://arxiv.org/html/2606.25002#S4.F3)\(c\) and[3](https://arxiv.org/html/2606.25002#S4.F3)\(d\) present reconstruction results after ablating the refiner and case memory modules, respectively\. It can be observed that for relatively complex motion patterns, such as turning vehicles, the combination of case memory and the refiner effectively corrects implausible trajectory segments through weak prior guidance and consistency\-based repair\. Moreover, due to the event\-anchored representation, the refiner is able to adjust trajectory segments within the maneuver phase while preserving the consistency of the conflict phase, enabling local trajectory correction without requiring full regeneration\.

Figures[3](https://arxiv.org/html/2606.25002#S4.F3)\(e\) and[3](https://arxiv.org/html/2606.25002#S4.F3)\(f\) further highlight several limitations of the current framework\. Figure[3](https://arxiv.org/html/2606.25002#S4.F3)\(e\) depicts a multi\-vehicle scenario\. Although the reconstructed trajectories successfully reproduce the collision events described in the report, the model fails to capture abrupt motion changes between multiple collisions of the same vehicle, instead producing overly smooth trajectories\. Figure[3](https://arxiv.org/html/2606.25002#S4.F3)\(f\) presents a failure case, where the red vehicle initially travels in an open area between two grass regions\. As a result, the boundaries on both sides of its initial position are not recognized as valid road edges, causing the vehicle location to fall outside the candidate lane set\. Consequently, the model assigns this vehicle to the same lane as the blue vehicle based on semantic cues, leading to a significant deviation in the initial position and ultimately affecting the overall reconstruction outcome\.

### 4\.4Ablation studies

Table[2](https://arxiv.org/html/2606.25002#S4.T2)shows that the proposed components contribute to different aspects of reconstruction\. Removing case memory produces the overall degradation, which suggests that memory is important for resolving the underdetermined global approach geometry and motion scale\. Using only the top\-33retrieved cases recovers most of the performance, but remains slightly worse than the full model, indicating that a richer memory context provides more stable priors\.

Speed–path coupling mainly affects velocity consistency\. Without it, AVD increases to6\.376\.37, whereas AKD, CR, and CSA remain close to the full model\. This indicates that the coupling term primarily aligns path length with speed evolution rather than changing the collision outcome\. In contrast, removing the checker and refiner mainly harms local accident\-point alignment, which confirms the role of closed\-loop diagnosis and refinement in correcting local geometric inconsistencies\. The event\-anchor representation provides substantial improvements in spatial localization, collision consistency, and collision pattern reconstruction for traffic accident reconstruction\. By generating and evaluating trajectories according to accident\-oriented stages, the framework achieves stronger interpretability, improved localized reparability, and more stable collision consistency\.

The refinement\-round ablation shows that one round already achieves performance close to the full model, and two rounds provide only marginal additional gain\. Removing collision\-tail repair has a focused effect on terminal collision quality: CR drops from88\.27%88\.27\\%to82\.31%82\.31\\%, CSA drops from57\.06%57\.06\\%to50\.00%50\.00\\%, and both AAPD errors increase\. This demonstrates that terminal repair improves impact\-point and contact\-side fidelity while leaving the global path largely unchanged\. Overall, the results indicate that memory, speed–path coupling, checker\-refiner inference, and collision\-tail repair address complementary reconstruction errors\.

Table 2:Ablation study of key modules of our proposed model\.

## 5Conclusion

We presented TRACER, a training\-free framework for traffic accident reconstruction from sparse and heterogeneous evidence\. TRACER reconstructs pre\-impact motion through event\-anchored hypotheses that can be checked and repaired at semantically meaningful stages\. Experiments on CISS\-REC show that this structured inference design improves geometric fidelity, velocity consistency, and collision\-level accuracy\. The ablation studies further indicate that structured case memory mainly helps resolve ambiguous approach geometry and motion scale, speed–path coupling improves velocity consistency, and the checker–refiner loop together with collision\-tail repair improves local accident\-point and contact\-side fidelity\. TRACER is intended as an assistive reconstruction tool, not an autonomous liability\-assessment system\. Future work should extend the fixed skeleton to adaptive event graphs, incorporate stronger dynamic\-feasibility constraints, and quantify uncertainty under incomplete or ambiguous evidence\. In forensic, legal, or insurance\-related applications, the reconstructions should therefore be interpreted together with physical evidence and expert judgment\.

## References

- \[1\]Gary A Davis\.Bayesian reconstruction of traffic accidents\.Law, Probability and Risk, 2\(2\):69–89, 2003\.
- \[2\]Wojciech Wach\.Calculation reliability in vehicle accident reconstruction\.Forensic science international, 263:27–38, 2016\.
- \[3\]Jiyang Gao, Chen Sun, Hang Zhao, Yi Shen, Dragomir Anguelov, Congcong Li, and Cordelia Schmid\.Vectornet: Encoding hd maps and agent dynamics from vectorized representation\.InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 11525–11533, 2020\.
- \[4\]Saurabh Arora and Prashant Doshi\.A survey of inverse reinforcement learning: Challenges, methods and progress\.Artificial Intelligence, 297:103500, 2021\.
- \[5\]Boris Ivanovic, Amine Elhafsi, Guy Rosman, Adrien Gaidon, and Marco Pavone\.Mats: An interpretable trajectory forecasting representation for planning and control\.InConference on Robot Learning, pages 2243–2256\. PMLR, 2021\.
- \[6\]Till Menzel, Gerrit Bagschik, and Markus Maurer\.Scenarios for development, test and validation of automated vehicles\.In2018 IEEE intelligent vehicles symposium \(IV\), pages 1821–1827\. IEEE, 2018\.
- \[7\]An Guo, Yuan Zhou, Haoxiang Tian, Chunrong Fang, Yunjian Sun, Weisong Sun, Xinyu Gao, Anh Tuan Luu, Yang Liu, and Zhenyu Chen\.Sovar: Build generalizable scenarios from accident reports for autonomous driving testing\.InProceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering, pages 268–280, 2024\.
- \[8\]Cheng Li, Keyuan Zhou, Tong Liu, Yu Wang, Mingqiao Zhuang, Huan\-ang Gao, Bu Jin, and Hao Zhao\.Avd2: Accident video diffusion for accident video description\.arXiv preprint arXiv:2502\.14801, 2025\.
- \[9\]Zhenjie Zheng, Zhengli Wang, Liyun Zhu, and Hai Jiang\.Determinants of the congestion caused by a traffic accident in urban road networks\.Accident Analysis & Prevention, 136:105327, 2020\.
- \[10\]Donald E Struble and John D Struble\.Automotive accident reconstruction: practices and principles\.CRC Press, 2020\.
- \[11\]Fábio AO Fernandes, Ricardo J Alves de Sousa, and Mariusz Ptak\.Application of numerical methods for accident reconstruction and forensic analysis\.InHead Injury Simulation in Road Traffic Accidents, pages 59–98\. Springer, 2018\.
- \[12\]David S Ryan and Esmaeil Zarei\.Accident investigation processes and techniques in sociotechnical systems\.InSafety Causation Analysis in Sociotechnical Systems: Advanced Models and Techniques, pages 21–45\. Springer, 2024\.
- \[13\]Robert W Rivers\.Technical Traffic Crash Investigators’ Handbook:\(level 3\): a Technical Reference, Training, Investigation and Reconstruction Manual\.Charles C Thomas Publisher, 2010\.
- \[14\]Robert W Rivers\.Evidence in traffic crash investigation and reconstruction: identification, interpretation and analysis of evidence, and the traffic crash investigation and reconstruction process\.Charles C Thomas Publisher, 2006\.
- \[15\]Kenneth L Carper\.Forensic engineering\.CRC press, 2000\.
- \[16\]Gabriel Raviv, Barak Fishbain, and Aviad Shapira\.Analyzing risk factors in crane\-related near\-miss and accident reports\.Safety science, 91:192–205, 2017\.
- \[17\]Khurram Faizan and Adeel Abid\.Forensic investigation of road traffic accident cases in pakistan and types of physical evidence\.Pakistan Social Sciences Review, 5\(4\):405–422, 2021\.
- \[18\]H Ward Smith\.Physical evidence in the investigation of traffic accidents\.J\. Crim\. L\. Criminology & Police Sci\., 48:93, 1957\.
- \[19\]Yanzhan Chen, Qian Zhang, and Fan Yu\.Transforming traffic accident investigations: a virtual\-real\-fusion framework for intelligent 3d traffic accident reconstruction\.Complex & Intelligent Systems, 11\(1\):76, 2025\.
- \[20\]Peifeng Jiao, Qifeng Miao, Meichao Zhang, and Weidong Zhao\.A virtual reality method for digitally reconstructing traffic accidents from videos or still images\.Forensic science international, 292:176–180, 2018\.
- \[21\]Zhenning Li, Zhiyong Cui, Haicheng Liao, John Ash, Guohui Zhang, Chengzhong Xu, and Yinhai Wang\.Steering the future: Redefining intelligent transportation systems with foundation models\.Chain, 1\(1\):46–53, 2024\.
- \[22\]Xiangwen Zhang, Qian Zhang, Longfei Han, Qiang Qu, and Xiaoming Chen\.Accidentsim: Generating physically realistic vehicle collision videos from real\-world accident reports\.arXiv preprint arXiv:2503\.20654, 2025\.
- \[23\]Yanchen Guan, Haicheng Liao, Chengyue Wang, and Zhenning Li\.Learning physically grounded traffic accident reconstruction from public accident reports, 2026\.
- \[24\]Bing Yu, Haoteng Yin, and Zhanxing Zhu\.Spatio\-temporal graph convolutional networks: A deep learning framework for traffic forecasting\.arXiv preprint arXiv:1709\.04875, 2017\.
- \[25\]Larry Schumaker\.Spline functions: basic theory\.Cambridge university press, 2007\.
- \[26\]Greg Van Houdt, Carlos Mosquera, and Gonzalo Nápoles\.A review on the long short\-term memory model\.Artificial intelligence review, 53\(8\):5929–5955, 2020\.
- \[27\]Nigamaa Nayakanti, Rami Al\-Rfou, Aurick Zhou, Kratarth Goel, Khaled S Refaat, and Benjamin Sapp\.Wayformer: Motion forecasting via simple & efficient attention networks\.arXiv preprint arXiv:2207\.05844, 2022\.
- \[28\]Zikang Zhou, Luyao Ye, Jianping Wang, Kui Wu, and Kejie Lu\.Hivt: Hierarchical vector transformer for multi\-agent motion prediction\.InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 8823–8833, 2022\.
- \[29\]Hermann Steffan and Andreas Moser\.The collision and trajectory models of pc\-crash\.Technical report, SAE Technical Paper, 1996\.
- \[30\]Xiaoyun Zhang, Xianlong Jin, and Jie Shen\.Virtual reconstruction of two types of traffic accident by the tire marks\.InInternational Conference on Artificial Reality and Telexistence, pages 1128–1135\. Springer, 2006\.

Similar Articles

TRACER: Trace-Based Adaptive Cost-Efficient Routing for LLM Classification

Hugging Face Daily Papers

TRACER is an open-source system that trains lightweight ML surrogates on production traces from LLM classification endpoints, routing requests through a parity gate that activates surrogates only when agreement with the original model exceeds a specified threshold. This approach achieves 83-100% surrogate coverage on intent classification benchmarks while maintaining interpretability into handling boundaries and failure modes.