Disjoint or Overlapping? Inference Windowing for Reconstruction-Based Time Series Anomaly Detection
Summary
This paper investigates the impact of inference stride (disjoint vs. overlapping windows) on reconstruction-based time series anomaly detection, finding that overlapping windows consistently improve performance by up to 28% across multiple model architectures on the TSB-AD and UCR benchmarks.
View Cached Full Text
Cached at: 06/10/26, 06:14 AM
# Disjoint or Overlapping? Inference Windowing for Reconstruction-Based Time Series Anomaly Detection Source: [https://arxiv.org/abs/2606.09874](https://arxiv.org/abs/2606.09874) [View PDF](https://arxiv.org/pdf/2606.09874) > Abstract:Reconstruction\-based methods are widely used for time series anomaly detection, where models are trained to reconstruct subsequences, and anomalies are identified through reconstruction errors\. However, reported results are often hard to compare due to heterogeneous evaluation practices and underspecified inference procedures\. In this paper, we revisit reconstruction\-based anomaly detection in the univariate offline setting and study the role of the inference stride, which controls whether subsequences are processed as disjoint windows or with overlap\. We propose a unified training, tuning, and multi\-seed evaluation protocol on the curated TSB\-AD benchmark, and study how overlapping inference affects anomaly detection performance for a range of reconstruction models, including PCA\-based baselines, DLinear, an AutoEncoder, TimesNet, and Transformer variants\. The results show that across all models, overlapping windows yield consistent improvements, with average relative gain up to \+28%, and can alter method rankings\. We further analyze variability across datasets, random seeds, and hyperparameter configurations\. Finally, we complement the benchmark study with an evaluation on the full UCR archive using localization criteria aligned with sliding\-window reconstruction\. Overall, our results highlight that reconstruction\-based anomaly detection performance depends not only on model architecture and training, but also on inference choices, motivating a clear and reproducible protocol\. Our results show that reconstructionbased baselines achieve strong performance on both TSB\-AD and UCR benchmarks, supporting them as competitive and practical approaches for univariate time series anomaly detection\. ## Submission history From: Guillaume Coulaud \[[view email](https://arxiv.org/show-email/beb68c6a/2606.09874)\] \[via CCSD proxy\] **\[v1\]**Tue, 2 Jun 2026 19:42:30 UTC \(441 KB\)
Similar Articles
Bridging Classification and Reconstruction: Cooperative Time Series Anomaly Detection
This paper proposes CoAD, a novel framework that unifies Outlier Exposure (classification) and Masked Autoencoder (reconstruction) paradigms for time series anomaly detection, addressing their respective limitations. Extensive experiments show that CoAD significantly outperforms state-of-the-art methods while being lightweight and fast.
TPA-AD: A Two-Stage Pseudo Anomaly-Guided Method for Bearing Time-Series Anomaly Detection
TPA-AD is a two-stage pseudo anomaly-guided method for bearing time-series anomaly detection that generates pseudo-anomalous windows near normal boundaries using reconstruction models and contrastive learning, then scores anomalies with KNN—without requiring real anomaly samples during training. It is evaluated on bearing fault and degradation datasets, including high-speed train axle-box bearing data.
Rethinking Structural Anomaly Detection: From Decision Boundaries to Projection Operators
This paper rethinks structural anomaly detection by shifting from decision boundaries to projection operators onto the low-dimensional manifold of normal data, showing that projection-aligned methods outperform existing boundary-based and reconstruction-based approaches.
Back to Repair: A Minimal Denoising Network\ for Time Series Anomaly Detection
This paper introduces JuRe (Just Repair), a minimal denoising network for time series anomaly detection that matches or exceeds complex neural baselines on the TSB-AD and UCR benchmarks, demonstrating that a proper manifold-projection training objective is more important than architectural complexity.
Anomalies in Multivariate Time Series Benchmarks Are Mostly Univariate
This paper introduces a diagnostic framework for multivariate time series anomaly detection benchmarks and finds that labeled anomalies are mostly detectable from individual channels, challenging the need for cross-channel modeling. The authors call for more structurally diverse evaluation sets.