Mobility Anomaly Generation using LLM-Driven Behavior with Kinematic Constraints

arXiv cs.AI Papers

Summary

Introduces a generative framework that uses LLM agents to inject behavioral anomalies into simulated trajectories and applies kinematic and map constraints to produce realistic anomalous mobility data with ground truth.

arXiv:2606.10314v1 Announce Type: new Abstract: Although the study of human trajectory anomalies is critical for advancing spatial data mining, empirical research remains severely hindered by a pervasive lack of ground-truth datasets. Despite the availability of several real-world and simulated human trajectory collections, these datasets exclusively capture normal mobility patterns and lack annotated anomalies. This specific scarcity is fundamentally driven by the inherent statistical rarity of anomalous events, precluding the feasibility of conventional observational methods. Compounding this challenge, the systematic acquisition of large-scale mobility data is strictly bottlenecked by prohibitive costs and stringent privacy regulations. To overcome these fundamental limitations and establish a reliable human trajectory anomalies dataset with annotated ground truth, we introduce a novel, end-to-end generative framework designed to synthesize realistic trajectory anomalies at scale. Our architecture bridges the gap between purely synthetic mobility data and complex real-world physical constraints by operating directly on baseline simulated trajectories. We employ Large Language Model (LLM) agents to systematically inject semantically meaningful behavioral anomalies such as irregular out-of-distribution check-ins and skipped routine visits. To ensure rigorous spatial validity, the system leverages map-constrained routing reconstruction to recalculate the physical transitions between these LLM agent-modified staypoints. Moreover, to narrow the simulation-to-reality gap, we augment the resulting trajectories with a context-aware spatial noise model, parameterized by environmental and location-specific variables, to accurately emulate heterogeneous GPS sensor degradation.
Original Article
View Cached Full Text

Cached at: 06/10/26, 06:14 AM

# Mobility Anomaly Generation using LLM-Driven Behavior with Kinematic Constraints
Source: [https://arxiv.org/abs/2606.10314](https://arxiv.org/abs/2606.10314)
[View PDF](https://arxiv.org/pdf/2606.10314)

> Abstract:Although the study of human trajectory anomalies is critical for advancing spatial data mining, empirical research remains severely hindered by a pervasive lack of ground\-truth datasets\. Despite the availability of several real\-world and simulated human trajectory collections, these datasets exclusively capture normal mobility patterns and lack annotated anomalies\. This specific scarcity is fundamentally driven by the inherent statistical rarity of anomalous events, precluding the feasibility of conventional observational methods\. Compounding this challenge, the systematic acquisition of large\-scale mobility data is strictly bottlenecked by prohibitive costs and stringent privacy regulations\. To overcome these fundamental limitations and establish a reliable human trajectory anomalies dataset with annotated ground truth, we introduce a novel, end\-to\-end generative framework designed to synthesize realistic trajectory anomalies at scale\. Our architecture bridges the gap between purely synthetic mobility data and complex real\-world physical constraints by operating directly on baseline simulated trajectories\. We employ Large Language Model \(LLM\) agents to systematically inject semantically meaningful behavioral anomalies such as irregular out\-of\-distribution check\-ins and skipped routine visits\. To ensure rigorous spatial validity, the system leverages map\-constrained routing reconstruction to recalculate the physical transitions between these LLM agent\-modified staypoints\. Moreover, to narrow the simulation\-to\-reality gap, we augment the resulting trajectories with a context\-aware spatial noise model, parameterized by environmental and location\-specific variables, to accurately emulate heterogeneous GPS sensor degradation\.

## Submission history

From: Yueyang Liu \[[view email](https://arxiv.org/show-email/a19a296b/2606.10314)\] **\[v1\]**Tue, 9 Jun 2026 02:09:02 UTC \(21,546 KB\)

Similar Articles

TrajGenAgent: A Hierarchical LLM Agent for Human Mobility Trajectory Generation

arXiv cs.AI

TrajGenAgent proposes a hierarchical LLM agent framework that decouples macro-level activity planning from micro-level spatiotemporal instantiation for realistic human mobility trajectory generation without fine-tuning. It also introduces an anomaly-detection-based evaluation for behavioral fidelity.

Mechanisms of Misgeneralization in Physical Sequence Modeling

arXiv cs.LG

This paper identifies and analyzes 'physical misgeneralization' in generative sequence models, where individual trajectories appear plausible but the aggregate distribution over physical quantities is incorrect, and proposes a kernel-informed mitigation.

Hidden Latent-State Shifts in LLMs: Why Current Alignment Is Blind to Real Internal Dangers — Especially With Agents

Reddit r/artificial

This paper demonstrates that LLMs can enter measurably different internal latent states under coherent context while maintaining aligned outputs, revealing a blind spot in current alignment methods that only monitor surface tokens. The Gemma-3-12B-IT experiment shows strong residual stream geometry shifts that existing safety frameworks cannot detect, with implications for agentic AI deployment.