LGAIMar 24, 2025

RoCA: Robust Contrastive One-class Time Series Anomaly Detection with Contaminated Data

arXiv:2503.18385v14 citationsh-index: 26Has Code
Originality Incremental advance
AI Analysis

It addresses a practical limitation in anomaly detection for time-series data, where training data often contains anomalies, making it incremental by enhancing robustness in a specific domain.

The paper tackles the problem of time-series anomaly detection with contaminated training data by proposing RoCA, a robust approach that fuses one-class classification and contrastive learning, achieving performance improvements of 6% on AIOps datasets and 5-10% on multivariate datasets compared to methods ignoring contamination.

The accumulation of time-series signals and the absence of labels make time-series Anomaly Detection (AD) a self-supervised task of deep learning. Methods based on normality assumptions face the following three limitations: (1) A single assumption could hardly characterize the whole normality or lead to some deviation. (2) Some assumptions may go against the principle of AD. (3) Their basic assumption is that the training data is uncontaminated (free of anomalies), which is unrealistic in practice, leading to a decline in robustness. This paper proposes a novel robust approach, RoCA, which is the first to address all of the above three challenges, as far as we are aware. It fuses the separated assumptions of one-class classification and contrastive learning in a single training process to characterize a more complete so-called normality. Additionally, it monitors the training data and computes a carefully designed anomaly score throughout the training process. This score helps identify latent anomalies, which are then used to define the classification boundary, inspired by the concept of outlier exposure. The performance on AIOps datasets improved by 6% compared to when contamination was not considered (COCA). On two large and high-dimensional multivariate datasets, the performance increased by 5% to 10%. RoCA achieves the highest average performance on both univariate and multivariate datasets. The source code is available at https://github.com/ruiking04/RoCA.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes