Mobility Anomaly Generation using LLM-Driven Behavior with Kinematic Constraints
Summary
Introduces a generative framework that uses LLM agents to inject behavioral anomalies into simulated trajectories and applies kinematic and map constraints to produce realistic anomalous mobility data with ground truth.
View Cached Full Text
Cached at: 06/10/26, 06:14 AM
# Mobility Anomaly Generation using LLM-Driven Behavior with Kinematic Constraints Source: [https://arxiv.org/abs/2606.10314](https://arxiv.org/abs/2606.10314) [View PDF](https://arxiv.org/pdf/2606.10314) > Abstract:Although the study of human trajectory anomalies is critical for advancing spatial data mining, empirical research remains severely hindered by a pervasive lack of ground\-truth datasets\. Despite the availability of several real\-world and simulated human trajectory collections, these datasets exclusively capture normal mobility patterns and lack annotated anomalies\. This specific scarcity is fundamentally driven by the inherent statistical rarity of anomalous events, precluding the feasibility of conventional observational methods\. Compounding this challenge, the systematic acquisition of large\-scale mobility data is strictly bottlenecked by prohibitive costs and stringent privacy regulations\. To overcome these fundamental limitations and establish a reliable human trajectory anomalies dataset with annotated ground truth, we introduce a novel, end\-to\-end generative framework designed to synthesize realistic trajectory anomalies at scale\. Our architecture bridges the gap between purely synthetic mobility data and complex real\-world physical constraints by operating directly on baseline simulated trajectories\. We employ Large Language Model \(LLM\) agents to systematically inject semantically meaningful behavioral anomalies such as irregular out\-of\-distribution check\-ins and skipped routine visits\. To ensure rigorous spatial validity, the system leverages map\-constrained routing reconstruction to recalculate the physical transitions between these LLM agent\-modified staypoints\. Moreover, to narrow the simulation\-to\-reality gap, we augment the resulting trajectories with a context\-aware spatial noise model, parameterized by environmental and location\-specific variables, to accurately emulate heterogeneous GPS sensor degradation\. ## Submission history From: Yueyang Liu \[[view email](https://arxiv.org/show-email/a19a296b/2606.10314)\] **\[v1\]**Tue, 9 Jun 2026 02:09:02 UTC \(21,546 KB\)
Similar Articles
TrajGenAgent: A Hierarchical LLM Agent for Human Mobility Trajectory Generation
TrajGenAgent proposes a hierarchical LLM agent framework that decouples macro-level activity planning from micro-level spatiotemporal instantiation for realistic human mobility trajectory generation without fine-tuning. It also introduces an anomaly-detection-based evaluation for behavioral fidelity.
When Plausible Is Not Realistic: Evaluating Human Mobility in LLM-Based Urban Simulation
This paper introduces a validation framework to evaluate whether LLM-based urban simulators reproduce empirical human mobility patterns. Using data from Paris and Shanghai, the authors find a substantial gap between plausible narratives and realistic mobility constraints, and provide open infrastructure for reproducible evaluation.
Your Agent Has a Genome: Sequence-Level Behavioral Analysis and Runtime Governance of LLM-Powered Autonomous Agents
This paper introduces Base Sequence Analysis, a framework that encodes LLM agent runtime behavior into compact sequences, revealing high-risk patterns like the 'P-X-P' trigram and a verification deficit. It presents Governor, a runtime intervention system that improves task success by 6.2% and reduces token consumption by 44%.
Mechanisms of Misgeneralization in Physical Sequence Modeling
This paper identifies and analyzes 'physical misgeneralization' in generative sequence models, where individual trajectories appear plausible but the aggregate distribution over physical quantities is incorrect, and proposes a kernel-informed mitigation.
Hidden Latent-State Shifts in LLMs: Why Current Alignment Is Blind to Real Internal Dangers — Especially With Agents
This paper demonstrates that LLMs can enter measurably different internal latent states under coherent context while maintaining aligned outputs, revealing a blind spot in current alignment methods that only monitor surface tokens. The Gemma-3-12B-IT experiment shows strong residual stream geometry shifts that existing safety frameworks cannot detect, with implications for agentic AI deployment.