OCIRJan 4, 2012

Two Algorithms for Orthogonal Nonnegative Matrix Factorization with Application to Clustering

arXiv:1201.0901v2167 citations
AI Analysis

This work addresses clustering tasks such as document classification by improving ONMF algorithms, but it appears incremental as it builds on existing ONMF techniques.

The paper tackles the problem of orthogonal nonnegative matrix factorization (ONMF) for clustering by introducing two new algorithms: an EM-like method derived from equivalence with weighted spherical k-means and an augmented Lagrangian approach that enforces orthogonality strictly at each step. The results show that these methods compare favorably with standard ONMF algorithms on synthetic, text, and image datasets, though no concrete numbers are provided.

Approximate matrix factorization techniques with both nonnegativity and orthogonality constraints, referred to as orthogonal nonnegative matrix factorization (ONMF), have been recently introduced and shown to work remarkably well for clustering tasks such as document classification. In this paper, we introduce two new methods to solve ONMF. First, we show athematical equivalence between ONMF and a weighted variant of spherical k-means, from which we derive our first method, a simple EM-like algorithm. This also allows us to determine when ONMF should be preferred to k-means and spherical k-means. Our second method is based on an augmented Lagrangian approach. Standard ONMF algorithms typically enforce nonnegativity for their iterates while trying to achieve orthogonality at the limit (e.g., using a proper penalization term or a suitably chosen search direction). Our method works the opposite way: orthogonality is strictly imposed at each step while nonnegativity is asymptotically obtained, using a quadratic penalty. Finally, we show that the two proposed approaches compare favorably with standard ONMF algorithms on synthetic, text and image data sets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes