MLLGAPCOMEOct 12, 2021

Deviance Matrix Factorization

arXiv:2110.05674v39 citations
Originality Incremental advance
AI Analysis

This work offers a more general matrix factorization method for researchers in fields like image recognition and NLP, though it builds incrementally on existing statistical techniques.

The authors tackled the problem of extending matrix factorization beyond squared error loss by proposing a deviance-based method that leverages generalized linear models, providing theoretical consistency and practical algorithms. Their approach demonstrated improved flexibility and robustness in simulations and case studies across multiple domains.

We investigate a general matrix factorization for deviance-based data losses, extending the ubiquitous singular value decomposition beyond squared error loss. While similar approaches have been explored before, our method leverages classical statistical methodology from generalized linear models (GLMs) and provides an efficient algorithm that is flexible enough to allow for structural zeros via entry weights. Moreover, by adapting results from GLM theory, we provide support for these decompositions by (i) showing strong consistency under the GLM setup, (ii) checking the adequacy of a chosen exponential family via a generalized Hosmer-Lemeshow test, and (iii) determining the rank of the decomposition via a maximum eigenvalue gap method. To further support our findings, we conduct simulation studies to assess robustness to decomposition assumptions and extensive case studies using benchmark datasets from image face recognition, natural language processing, network analysis, and biomedical studies. Our theoretical and empirical results indicate that the proposed decomposition is more flexible, general, and robust, and can thus provide improved performance when compared to similar methods. To facilitate applications, an R package with efficient model fitting and family and rank determination is also provided.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes