LG AI OC MLFeb 25, 2018

DID: Distributed Incremental Block Coordinate Descent for Nonnegative Matrix Factorization

arXiv:1802.08938v14.112 citations

Originality Incremental advance

AI Analysis

This addresses the need for scalable NMF algorithms in distributed memory architectures, which is an incremental improvement for applications handling large-scale data.

The authors tackled the problem of scaling nonnegative matrix factorization (NMF) to large datasets distributed across multiple nodes by proposing a novel distributed algorithm called DID, which achieved efficient performance with only one communication step per iteration in numerical experiments.

Nonnegative matrix factorization (NMF) has attracted much attention in the last decade as a dimension reduction method in many applications. Due to the explosion in the size of data, naturally the samples are collected and stored distributively in local computational nodes. Thus, there is a growing need to develop algorithms in a distributed memory architecture. We propose a novel distributed algorithm, called \textit{distributed incremental block coordinate descent} (DID), to solve the problem. By adapting the block coordinate descent framework, closed-form update rules are obtained in DID. Moreover, DID performs updates incrementally based on the most recently updated residual matrix. As a result, only one communication step per iteration is required. The correctness, efficiency, and scalability of the proposed algorithm are verified in a series of numerical experiments.

View on arXiv PDF

Similar