MLLGDec 17, 2025

High-Dimensional Partial Least Squares: Spectral Analysis and Fundamental Limitations

arXiv:2512.15684v11 citationsh-index: 1
Originality Incremental advance
AI Analysis

This provides a theoretical foundation for a widely used data integration method, clarifying its advantages and limitations for researchers in statistics and machine learning.

The paper tackled the lack of theoretical understanding of Partial Least Squares (PLS) in high-dimensional regimes by analyzing its variant PLS-SVD, showing it asymptotically outperforms separate principal component analysis in detecting common latent subspaces.

Partial Least Squares (PLS) is a widely used method for data integration, designed to extract latent components shared across paired high-dimensional datasets. Despite decades of practical success, a precise theoretical understanding of its behavior in high-dimensional regimes remains limited. In this paper, we study a data integration model in which two high-dimensional data matrices share a low-rank common latent structure while also containing individual-specific components. We analyze the singular vectors of the associated cross-covariance matrix using tools from random matrix theory and derive asymptotic characterizations of the alignment between estimated and true latent directions. These results provide a quantitative explanation of the reconstruction performance of the PLS variant based on Singular Value Decomposition (PLS-SVD) and identify regimes where the method exhibits counter-intuitive or limiting behavior. Building on this analysis, we compare PLS-SVD with principal component analysis applied separately to each dataset and show its asymptotic superiority in detecting the common latent subspace. Overall, our results offer a comprehensive theoretical understanding of high-dimensional PLS-SVD, clarifying both its advantages and fundamental limitations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes