LG AINov 20, 2019

Discovering Subdimensional Motifs of Different Lengths in Large-Scale Multivariate Time Series

arXiv:1911.09218v12.715 citationsh-index: 35Has Code

Originality Incremental advance

AI Analysis

This work addresses a scalability bottleneck for researchers and practitioners analyzing large multivariate time series, though it is incremental as it builds on existing motif discovery approaches.

The paper tackles the problem of detecting variable-length subdimensional motifs in large-scale multivariate time series, which is challenging due to scalability issues, and introduces the CHIME algorithm that significantly reduces memory cost and increases speed compared to state-of-the-art methods.

Detecting repeating patterns of different lengths in time series, also called variable-length motifs, has received a great amount of attention by researchers and practitioners. Despite the significant progress that has been made in recent single dimensional variable-length motif discovery work, detecting variable-length \textit{subdimensional motifs}---patterns that are simultaneously occurring only in a subset of dimensions in multivariate time series---remains a difficult task. The main challenge is scalability. On the one hand, the brute-force enumeration solution, which searches for motifs of all possible lengths, is very time consuming even in single dimensional time series. On the other hand, previous work show that index-based fixed-length approximate motif discovery algorithms such as random projection are not suitable for detecting variable-length motifs due to memory requirement. In this paper, we introduce an approximate variable-length subdimensional motif discovery algorithm called \textbf{C}ollaborative \textbf{HI}erarchy based \textbf{M}otif \textbf{E}numeration (CHIME) to efficiently detect variable-length subdimensional motifs given a minimum motif length in large-scale multivariate time series. We show that the memory cost of the approach is significantly smaller than that of random projection. Moreover, the speed of the proposed algorithm is significantly faster than that of the state-of-the-art algorithms. We demonstrate that CHIME can efficiently detect meaningful variable-length subdimensional motifs in large real world multivariate time series datasets.

View on arXiv PDF Code

Similar