AILGFeb 12, 2017

Similarity Preserving Representation Learning for Time Series Clustering

arXiv:1702.03584v324 citations
AI Analysis

This addresses the challenge of applying standard clustering algorithms to time series data, which is incremental as it adapts existing methods rather than introducing a new paradigm.

The paper tackles the problem of clustering time series data by proposing a representation learning framework that converts time series of varying lengths into an instance-feature matrix while preserving pairwise similarities, resulting in more effective, efficient, and flexible clustering compared to state-of-the-art methods.

A considerable amount of clustering algorithms take instance-feature matrices as their inputs. As such, they cannot directly analyze time series data due to its temporal nature, usually unequal lengths, and complex properties. This is a great pity since many of these algorithms are effective, robust, efficient, and easy to use. In this paper, we bridge this gap by proposing an efficient representation learning framework that is able to convert a set of time series with various lengths to an instance-feature matrix. In particular, we guarantee that the pairwise similarities between time series are well preserved after the transformation, thus the learned feature representation is particularly suitable for the time series clustering task. Given a set of $n$ time series, we first construct an $n\times n$ partially-observed similarity matrix by randomly sampling $\mathcal{O}(n \log n)$ pairs of time series and computing their pairwise similarities. We then propose an efficient algorithm that solves a non-convex and NP-hard problem to learn new features based on the partially-observed similarity matrix. By conducting extensive empirical studies, we show that the proposed framework is more effective, efficient, and flexible, compared to other state-of-the-art time series clustering methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes