OC LG MLJun 6, 2012

Factoring nonnegative matrices with linear programs

Victor Bittorf, Benjamin Recht, Christopher Re, Joel A. Tropp

arXiv:1206.1270v2206 citations

Originality Incremental advance

AI Analysis

This provides a scalable and efficient solution for NMF in data analysis, though it appears incremental as it builds on prior theoretical guarantees.

The paper tackles the problem of computing nonnegative matrix factorizations (NMFs) by introducing a linear programming-based method that selects salient features from data to approximate the matrix, with experiments showing it can factor multigigabyte matrices in minutes.

This paper describes a new approach, based on linear programming, for computing nonnegative matrix factorizations (NMFs). The key idea is a data-driven model for the factorization where the most salient features in the data are used to express the remaining features. More precisely, given a data matrix X, the algorithm identifies a matrix C such that X approximately equals CX and some linear constraints. The constraints are chosen to ensure that the matrix C selects features; these features can then be used to find a low-rank NMF of X. A theoretical analysis demonstrates that this approach has guarantees similar to those of the recent NMF algorithm of Arora et al. (2012). In contrast with this earlier work, the proposed method extends to more general noise models and leads to efficient, scalable algorithms. Experiments with synthetic and real datasets provide evidence that the new approach is also superior in practice. An optimized C++ implementation can factor a multigigabyte matrix in a matter of minutes.

View on arXiv PDF

Similar