MEMLApr 26, 2017

SOFAR: large-scale association network learning

arXiv:1704.08349v142 citations
Originality Incremental advance
AI Analysis

This addresses the need for efficient statistical methods in big data applications, offering a novel approach for unsupervised and supervised learning tasks, though it appears incremental as it builds on existing sparse and orthogonal factor frameworks.

The paper tackles the challenge of learning large-scale response-predictor association networks by proposing SOFAR, a method that combines sparsity and orthogonality through sparse singular value decomposition with orthogonality constraints, achieving nonasymptotic error bounds and efficient convergence in simulations and real data.

Many modern big data applications feature large scale in both numbers of responses and predictors. Better statistical efficiency and scientific insights can be enabled by understanding the large-scale response-predictor association network structures via layers of sparse latent factors ranked by importance. Yet sparsity and orthogonality have been two largely incompatible goals. To accommodate both features, in this paper we suggest the method of sparse orthogonal factor regression (SOFAR) via the sparse singular value decomposition with orthogonality constrained optimization to learn the underlying association networks, with broad applications to both unsupervised and supervised learning tasks such as biclustering with sparse singular value decomposition, sparse principal component analysis, sparse factor analysis, and spare vector autoregression analysis. Exploiting the framework of convexity-assisted nonconvex optimization, we derive nonasymptotic error bounds for the suggested procedure characterizing the theoretical advantages. The statistical guarantees are powered by an efficient SOFAR algorithm with convergence property. Both computational and theoretical advantages of our procedure are demonstrated with several simulation and real data examples.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes