LGMLJul 30, 2020

Anomaly Detection at Scale: The Case for Deep Distributional Time Series Models

arXiv:2007.15541v112 citationsHas Code
Originality Incremental advance
AI Analysis

This addresses the problem of scalable anomaly detection for cloud resource monitoring, offering incremental improvements over prior methods.

The paper tackles anomaly detection in time series data, particularly for monitoring cloud services, by modeling time series of probability distributions instead of real values, achieving up to 17% average improvement over existing tools on real-world data.

This paper introduces a new methodology for detecting anomalies in time series data, with a primary application to monitoring the health of (micro-) services and cloud resources. The main novelty in our approach is that instead of modeling time series consisting of real values or vectors of real values, we model time series of probability distributions over real values (or vectors). This extension to time series of probability distributions allows the technique to be applied to the common scenario where the data is generated by requests coming in to a service, which is then aggregated at a fixed temporal frequency. Our method is amenable to streaming anomaly detection and scales to monitoring for anomalies on millions of time series. We show the superior accuracy of our method on synthetic and public real-world data. On the Yahoo Webscope data set, we outperform the state of the art in 3 out of 4 data sets and we show that we outperform popular open-source anomaly detection tools by up to 17% average improvement for a real-world data set.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes