LGMLAug 13, 2023

Weighted Sparse Partial Least Squares for Joint Sample and Feature Selection

arXiv:2308.06740v1h-index: 72
Originality Incremental advance
AI Analysis

This is an incremental improvement for data fusion in fields like biomedical analysis, enabling outlier removal and sample subset identification.

The paper tackled the problem of sparse Partial Least Squares (sPLS) being unable to detect latent subsets of samples by proposing an ℓ∞/ℓ0-norm constrained weighted sparse PLS method for joint sample and feature selection, and extended it to multi-view data fusion with efficient algorithms and experimental validation.

Sparse Partial Least Squares (sPLS) is a common dimensionality reduction technique for data fusion, which projects data samples from two views by seeking linear combinations with a small number of variables with the maximum variance. However, sPLS extracts the combinations between two data sets with all data samples so that it cannot detect latent subsets of samples. To extend the application of sPLS by identifying a specific subset of samples and remove outliers, we propose an $\ell_\infty/\ell_0$-norm constrained weighted sparse PLS ($\ell_\infty/\ell_0$-wsPLS) method for joint sample and feature selection, where the $\ell_\infty/\ell_0$-norm constrains are used to select a subset of samples. We prove that the $\ell_\infty/\ell_0$-norm constrains have the Kurdyka-Ł{ojasiewicz}~property so that a globally convergent algorithm is developed to solve it. Moreover, multi-view data with a same set of samples can be available in various real problems. To this end, we extend the $\ell_\infty/\ell_0$-wsPLS model and propose two multi-view wsPLS models for multi-view data fusion. We develop an efficient iterative algorithm for each multi-view wsPLS model and show its convergence property. As well as numerical and biomedical data experiments demonstrate the efficiency of the proposed methods.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes