LGCVMLMar 6, 2014

Sparse Principal Component Analysis via Rotation and Truncation

arXiv:1403.1430v244 citations
Originality Incremental advance
AI Analysis

This addresses the need for interpretable sparse PCA methods in data analysis, though it appears incremental as it builds on existing approaches.

The paper tackles the problem of sparse principal component analysis (sparse PCA) by proposing SPCArt, a method that finds a rotation matrix and sparse basis to approximate PCA basis after rotation, achieving state-of-the-art performance with a good tradeoff among criteria like sparsity and explained variance.

Sparse principal component analysis (sparse PCA) aims at finding a sparse basis to improve the interpretability over the dense basis of PCA, meanwhile the sparse basis should cover the data subspace as much as possible. In contrast to most of existing work which deal with the problem by adding some sparsity penalties on various objectives of PCA, in this paper, we propose a new method SPCArt, whose motivation is to find a rotation matrix and a sparse basis such that the sparse basis approximates the basis of PCA after the rotation. The algorithm of SPCArt consists of three alternating steps: rotate PCA basis, truncate small entries, and update the rotation matrix. Its performance bounds are also given. SPCArt is efficient, with each iteration scaling linearly with the data dimension. It is easy to choose parameters in SPCArt, due to its explicit physical explanations. Besides, we give a unified view to several existing sparse PCA methods and discuss the connection with SPCArt. Some ideas in SPCArt are extended to GPower, a popular sparse PCA algorithm, to overcome its drawback. Experimental results demonstrate that SPCArt achieves the state-of-the-art performance. It also achieves a good tradeoff among various criteria, including sparsity, explained variance, orthogonality, balance of sparsity among loadings, and computational speed.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes