MELGMLSep 22, 2021

Quantile-based fuzzy C-means clustering of multivariate time series: Robust techniques

arXiv:2109.11027v130 citations
Originality Incremental advance
AI Analysis

This work addresses robust clustering for time series data in fields like finance and environment, but it is incremental as it builds on existing fuzzy C-means and quantile techniques.

The authors tackled the problem of clustering multivariate time series robustly in the presence of outliers by proposing three robust fuzzy C-means methods based on quantile cross-spectral density and PCA, achieving substantial effectiveness in simulations with clear outperformance over alternatives.

Three robust methods for clustering multivariate time series from the point of view of generating processes are proposed. The procedures are robust versions of a fuzzy C-means model based on: (i) estimates of the quantile cross-spectral density and (ii) the classical principal component analysis. Robustness to the presence of outliers is achieved by using the so-called metric, noise and trimmed approaches. The metric approach incorporates in the objective function a distance measure aimed at neutralizing the effect of the outliers, the noise approach builds an artificial cluster expected to contain the outlying series and the trimmed approach eliminates the most atypical series in the dataset. All the proposed techniques inherit the nice properties of the quantile cross-spectral density, as being able to uncover general types of dependence. Results from a broad simulation study including multivariate linear, nonlinear and GARCH processes indicate that the algorithms are substantially effective in coping with the presence of outlying series (i.e., series exhibiting a dependence structure different from that of the majority), clearly poutperforming alternative procedures. The usefulness of the suggested methods is highlighted by means of two specific applications regarding financial and environmental series.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes