MLLGFeb 20, 2013

Fast methods for denoising matrix completion formulations, with applications to robust seismic data interpolation

arXiv:1302.4886v376 citations
AI Analysis

This work addresses practical challenges in seismic data interpolation and collaborative filtering by providing a user-friendly and robust method for large-scale matrix completion.

The paper tackles the problem of large-scale matrix completion for applications like seismic data interpolation by proposing an algorithm, LR-BPDN, that allows users to specify a target error level, simplifying practical use. It demonstrates advantages on collaborative filtering datasets and achieves high-quality reconstructions for seismic data, even with data contamination.

Recent SVD-free matrix factorization formulations have enabled rank minimization for systems with millions of rows and columns, paving the way for matrix completion in extremely large-scale applications, such as seismic data interpolation. In this paper, we consider matrix completion formulations designed to hit a target data-fitting error level provided by the user, and propose an algorithm called LR-BPDN that is able to exploit factorized formulations to solve the corresponding optimization problem. Since practitioners typically have strong prior knowledge about target error level, this innovation makes it easy to apply the algorithm in practice, leaving only the factor rank to be determined. Within the established framework, we propose two extensions that are highly relevant to solving practical challenges of data interpolation. First, we propose a weighted extension that allows known subspace information to improve the results of matrix completion formulations. We show how this weighting can be used in the context of frequency continuation, an essential aspect to seismic data interpolation. Second, we propose matrix completion formulations that are robust to large measurement errors in the available data. We illustrate the advantages of LR-BPDN on the collaborative filtering problem using the MovieLens 1M, 10M, and Netflix 100M datasets. Then, we use the new method, along with its robust and subspace re-weighted extensions, to obtain high-quality reconstructions for large scale seismic interpolation problems with real data, even in the presence of data contamination.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes