MEAISep 19, 2022

SMIXS: Novel efficient algorithm for non-parametric mixture regression-based clustering

arXiv:2209.09030v1h-index: 20
AI Analysis

This work addresses computational and stability issues in clustering for longitudinal data analysis, such as in atmospheric science, but is incremental as it builds on existing GMM and spline methods.

The authors tackled the problem of high computational complexity and numerical instability in non-parametric mixture regression-based clustering for longitudinal data by developing SMIXS, which incorporates speed-ups, a stable variance estimator, and a new smoothing parameter procedure, showing better clustering and regression performance than GMM on a synthetic dataset.

We investigate a novel non-parametric regression-based clustering algorithm for longitudinal data analysis. Combining natural cubic splines with Gaussian mixture models (GMM), the algorithm can produce smooth cluster means that describe the underlying data well. However, there are some shortcomings in the algorithm: high computational complexity in the parameter estimation procedure and a numerically unstable variance estimator. Therefore, to further increase the usability of the method, we incorporated approaches to reduce its computational complexity, we developed a new, more stable variance estimator, and we developed a new smoothing parameter estimation procedure. We show that the developed algorithm, SMIXS, performs better than GMM on a synthetic dataset in terms of clustering and regression performance. We demonstrate the impact of the computational speed-ups, which we formally prove in the new framework. Finally, we perform a case study by using SMIXS to cluster vertical atmospheric measurements to determine different weather regimes.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes