LGAIDBOct 31, 2023

Raising the ClaSS of Streaming Time Series Segmentation

arXiv:2310.20431v39 citationsh-index: 19
Originality Highly original
AI Analysis

This addresses the need for efficient and accurate segmentation of high-frequency sensor data streams, which is crucial for monitoring processes in various domains like human, industrial, and natural systems.

The authors tackled the problem of streaming time series segmentation by introducing ClaSS, a novel algorithm that uses self-supervised classification and statistical tests to detect change points, achieving significantly higher precision than eight state-of-the-art competitors in benchmarks and real-world data.

Ubiquitous sensors today emit high frequency streams of numerical measurements that reflect properties of human, animal, industrial, commercial, and natural processes. Shifts in such processes, e.g. caused by external events or internal state changes, manifest as changes in the recorded signals. The task of streaming time series segmentation (STSS) is to partition the stream into consecutive variable-sized segments that correspond to states of the observed processes or entities. The partition operation itself must in performance be able to cope with the input frequency of the signals. We introduce ClaSS, a novel, efficient, and highly accurate algorithm for STSS. ClaSS assesses the homogeneity of potential partitions using self-supervised time series classification and applies statistical tests to detect significant change points (CPs). In our experimental evaluation using two large benchmarks and six real-world data archives, we found ClaSS to be significantly more precise than eight state-of-the-art competitors. Its space and time complexity is independent of segment sizes and linear only in the sliding window size. We also provide ClaSS as a window operator with an average throughput of 1k data points per second for the Apache Flink streaming engine.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes