CVMay 12

Learning Subspace-Preserving Sparse Attention Graphs from Heterogeneous Multiview Data

Jie Chen, Yuanbiao Gou, Chuanbin Liu, Zhu Wang, Xi Peng

arXiv:2605.1188112.3

Predicted impact top 76% in CV · last 90 daysOriginality Incremental advance

AI Analysis

For unsupervised transfer learning, SAGL addresses the challenge of preserving intrinsic subspace structures in heterogeneous multiview data, offering a new approach that consistently beats existing methods.

The paper proposes SAGL, a method that learns subspace-preserving sparse attention graphs from heterogeneous multiview data, outperforming state-of-the-art unsupervised transfer learning approaches on multiple benchmarks.

The high-dimensional features extracted from large-scale unlabeled data via various pretrained models with diverse architectures are referred to as heterogeneous multiview data. Most existing unsupervised transfer learning methods fail to faithfully recover intrinsic subspace structures when exploiting complementary information across multiple views. Therefore, a fundamental challenge involves constructing sparse similarity graphs that preserve these underlying subspace structures for achieving semantic alignment across heterogeneous views. In this paper, we propose a sparse attention graph learning (SAGL) method that learns subspace-preserving sparse attention graphs from heterogeneous multiview data. Specifically, we introduce a bilinear attention factorization scheme to capture asymmetric similarities among the high-dimensional features, which breaks the symmetry bottleneck that is inherent in the traditional representation learning techniques. A dynamic sparsity gating mechanism then predicts a feature-specific compression factor for adaptively controlling the topological contributions of neighbors. Furthermore, we employ a structured sparse projection via $α$-entmax to generate subspace-preserving sparse attention graphs for individual views. SAGL leverages these view-specific graphs to conduct sparse information aggregation, yielding discriminative representations for multiview learning tasks. In addition, we provide a rigorous theoretical analysis that bridges differentiable sparse attention and probability simplex constraints. Extensive experiments conducted on multiple benchmark datasets demonstrate that SAGL consistently outperforms the state-of-the-art unsupervised transfer learning approaches.

View on arXiv PDF

Similar