MLLGFeb 26, 2025

Nonlinear Sparse Generalized Canonical Correlation Analysis for Multi-view High-dimensional Data

arXiv:2502.18756v1h-index: 4
Originality Incremental advance
AI Analysis

This work addresses the need for integrative analysis in biomedical studies, such as multi-omics, by extending existing methods to handle nonlinear dependence, sparsity, and multiple views simultaneously, though it is incremental as it builds on prior two-view techniques.

The authors tackled the problem of analyzing multi-view high-dimensional biomedical data by proposing three nonlinear sparse generalized canonical correlation analysis methods, with HSIC-SGCCA outperforming competitors in variable selection as shown in simulations and TCGA-BRCA data analysis.

Motivation: Biomedical studies increasingly produce multi-view high-dimensional datasets (e.g., multi-omics) that demand integrative analysis. Existing canonical correlation analysis (CCA) and generalized CCA methods address at most two of the following three key aspects simultaneously: (i) nonlinear dependence, (ii) sparsity for variable selection, and (iii) generalization to more than two data views. There is a pressing need for CCA methods that integrate all three aspects to effectively analyze multi-view high-dimensional data. Results: We propose three nonlinear, sparse, generalized CCA methods, HSIC-SGCCA, SA-KGCCA, and TS-KGCCA, for variable selection in multi-view high-dimensional data. These methods extend existing SCCA-HSIC, SA-KCCA, and TS-KCCA from two-view to multi-view settings. While SA-KGCCA and TS-KGCCA yield multi-convex optimization problems solved via block coordinate descent, HSIC-SGCCA introduces a necessary unit-variance constraint previously ignored in SCCA-HSIC, resulting in a nonconvex, non-multiconvex problem. We efficiently address this challenge by integrating the block prox-linear method with the linearized alternating direction method of multipliers. Simulations and TCGA-BRCA data analysis demonstrate that HSIC-SGCCA outperforms competing methods in multi-view variable selection.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes