MELGMLApr 24, 2025

SOFARI-R: High-Dimensional Manifold-Based Inference for Latent Responses

arXiv:2504.17874v1h-index: 29
Originality Incremental advance
AI Analysis

This work addresses a specific bottleneck in statistical inference for multi-task learning, offering incremental improvements for researchers in high-dimensional data analysis.

The paper tackles the challenge of designing a valid inference procedure for latent right factor vectors in high-dimensional multi-task learning, which is not straightforward due to asymmetry in singular vectors, by introducing SOFARI-R with two variants that produce bias-corrected estimators with asymptotically normal distributions.

Data reduction with uncertainty quantification plays a key role in various multi-task learning applications, where large numbers of responses and features are present. To this end, a general framework of high-dimensional manifold-based SOFAR inference (SOFARI) was introduced recently in Zheng, Zhou, Fan and Lv (2024) for interpretable multi-task learning inference focusing on the left factor vectors and singular values exploiting the latent singular value decomposition (SVD) structure. Yet, designing a valid inference procedure on the latent right factor vectors is not straightforward from that of the left ones and can be even more challenging due to asymmetry of left and right singular vectors in the response matrix. To tackle these issues, in this paper we suggest a new method of high-dimensional manifold-based SOFAR inference for latent responses (SOFARI-R), where two variants of SOFARI-R are introduced. The first variant deals with strongly orthogonal factors by coupling left singular vectors with the design matrix and then appropriately rescaling them to generate new Stiefel manifolds. The second variant handles the more general weakly orthogonal factors by employing the hard-thresholded SOFARI estimates and delicately incorporating approximation errors into the distribution. Both variants produce bias-corrected estimators for the latent right factor vectors that enjoy asymptotically normal distributions with justified asymptotic variance estimates. We demonstrate the effectiveness of the newly suggested method using extensive simulation studies and an economic application.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes