MLLGDec 20, 2018

Low-rank Interaction with Sparse Additive Effects Model for Large Data Frames

arXiv:1812.08398v15 citations
Originality Incremental advance
AI Analysis

This work addresses the limitation in low-rank models for large-scale data analysis by enabling explicit modeling of additive effects, which is incremental but improves practical applications like visualization and imputation.

The paper tackles the problem of analyzing large, heterogeneous data frames with missing values by introducing the LORIS model, which simultaneously estimates main additive effects and low-rank interactions, and demonstrates a clear advantage over existing preprocessing methods on simulated and survey data.

Many applications of machine learning involve the analysis of large data frames-matrices collecting heterogeneous measurements (binary, numerical, counts, etc.) across samples-with missing values. Low-rank models, as studied by Udell et al. [30], are popular in this framework for tasks such as visualization, clustering and missing value imputation. Yet, available methods with statistical guarantees and efficient optimization do not allow explicit modeling of main additive effects such as row and column, or covariate effects. In this paper, we introduce a low-rank interaction and sparse additive effects (LORIS) model which combines matrix regression on a dictionary and low-rank design, to estimate main effects and interactions simultaneously. We provide statistical guarantees in the form of upper bounds on the estimation error of both components. Then, we introduce a mixed coordinate gradient descent (MCGD) method which provably converges sub-linearly to an optimal solution and is computationally efficient for large scale data sets. We show on simulated and survey data that the method has a clear advantage over current practices, which consist in dealing separately with additive effects in a preprocessing step.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes