LGOCSep 27, 2012

Learning Robust Low-Rank Representations

arXiv:1209.6393v14 citations
Originality Incremental advance
AI Analysis

This work addresses the computational bottleneck of RPCA for applications in image, audio, and video processing, offering a practical solution for real-time use.

The paper tackles the problem of efficiently approximating robust principal component analysis (RPCA) by developing a trainable encoder framework that combines sparse coding and structured non-convex optimization, achieving several orders of magnitude speedup with minimal performance loss and demonstrating state-of-the-art results in music source separation.

In this paper we present a comprehensive framework for learning robust low-rank representations by combining and extending recent ideas for learning fast sparse coding regressors with structured non-convex optimization techniques. This approach connects robust principal component analysis (RPCA) with dictionary learning techniques and allows its approximation via trainable encoders. We propose an efficient feed-forward architecture derived from an optimization algorithm designed to exactly solve robust low dimensional projections. This architecture, in combination with different training objective functions, allows the regressors to be used as online approximants of the exact offline RPCA problem or as RPCA-based neural networks. Simple modifications of these encoders can handle challenging extensions, such as the inclusion of geometric data transformations. We present several examples with real data from image, audio, and video processing. When used to approximate RPCA, our basic implementation shows several orders of magnitude speedup compared to the exact solvers with almost no performance degradation. We show the strength of the inclusion of learning to the RPCA approach on a music source separation application, where the encoders outperform the exact RPCA algorithms, which are already reported to produce state-of-the-art results on a benchmark database. Our preliminary implementation on an iPad shows faster-than-real-time performance with minimal latency.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes