MLCVNov 6, 2017

Randomized Nonnegative Matrix Factorization

arXiv:1711.02037v255 citations
Originality Incremental advance
AI Analysis

This work addresses scalability issues in data mining for big data applications, representing an incremental improvement over existing deterministic methods.

The paper tackles the computational challenge of nonnegative matrix factorization (NMF) for big data by proposing a randomized hierarchical alternating least squares (HALS) algorithm, which achieves near-optimal factorization with substantial speedups compared to deterministic HALS.

Nonnegative matrix factorization (NMF) is a powerful tool for data mining. However, the emergence of `big data' has severely challenged our ability to compute this fundamental decomposition using deterministic algorithms. This paper presents a randomized hierarchical alternating least squares (HALS) algorithm to compute the NMF. By deriving a smaller matrix from the nonnegative input data, a more efficient nonnegative decomposition can be computed. Our algorithm scales to big data applications while attaining a near-optimal factorization. The proposed algorithm is evaluated using synthetic and real world data and shows substantial speedups compared to deterministic HALS.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes