MLLGNov 19, 2015

Canonical Autocorrelation Analysis

arXiv:1511.06419v11 citations
Originality Incremental advance
AI Analysis

This method addresses the need for interpretable, unsupervised correlation analysis in domains like medical diagnosis and security, though it appears incremental as an extension of existing CCA techniques.

The authors tackled the problem of finding multiple-to-multiple linear correlations within a single set of variables, introducing Canonical Autocorrelation Analysis (CAA) as an extension of sparse CCA, and demonstrated its utility in anomaly detection with competitive performance on breast cancer data and significant improvements in radiation threat detection.

We present an extension of sparse Canonical Correlation Analysis (CCA) designed for finding multiple-to-multiple linear correlations within a single set of variables. Unlike CCA, which finds correlations between two sets of data where the rows are matched exactly but the columns represent separate sets of variables, the method proposed here, Canonical Autocorrelation Analysis (CAA), finds multivariate correlations within just one set of variables. This can be useful when we look for hidden parsimonious structures in data, each involving only a small subset of all features. In addition, the discovered correlations are highly interpretable as they are formed by pairs of sparse linear combinations of the original features. We show how CAA can be of use as a tool for anomaly detection when the expected structure of correlations is not followed by anomalous data. We illustrate the utility of CAA in two application domains where single-class and unsupervised learning of correlation structures are particularly relevant: breast cancer diagnosis and radiation threat detection. When applied to the Wisconsin Breast Cancer data, single-class CAA is competitive with supervised methods used in literature. On the radiation threat detection task, unsupervised CAA performs significantly better than an unsupervised alternative prevalent in the domain, while providing valuable additional insights for threat analysis.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes