LGAIOCMLFeb 25, 2018

DID: Distributed Incremental Block Coordinate Descent for Nonnegative Matrix Factorization

arXiv:1802.08938v112 citations
Originality Incremental advance
AI Analysis

This addresses the need for scalable NMF algorithms in distributed memory architectures, which is an incremental improvement for applications handling large-scale data.

The authors tackled the problem of scaling nonnegative matrix factorization (NMF) to large datasets distributed across multiple nodes by proposing a novel distributed algorithm called DID, which achieved efficient performance with only one communication step per iteration in numerical experiments.

Nonnegative matrix factorization (NMF) has attracted much attention in the last decade as a dimension reduction method in many applications. Due to the explosion in the size of data, naturally the samples are collected and stored distributively in local computational nodes. Thus, there is a growing need to develop algorithms in a distributed memory architecture. We propose a novel distributed algorithm, called \textit{distributed incremental block coordinate descent} (DID), to solve the problem. By adapting the block coordinate descent framework, closed-form update rules are obtained in DID. Moreover, DID performs updates incrementally based on the most recently updated residual matrix. As a result, only one communication step per iteration is required. The correctness, efficiency, and scalability of the proposed algorithm are verified in a series of numerical experiments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes