NA LGFeb 28, 2018

NETT: Solving Inverse Problems with Deep Neural Networks

Housen Li, Johannes Schwab, Stephan Antholzer, Markus Haltmeier

arXiv:1803.00092v327.8282 citations

Originality Incremental advance

AI Analysis

It provides a theoretical framework for deep learning methods in inverse problems, addressing a gap for researchers in computational imaging and machine learning, though it is incremental in building on existing Tikhonov regularization concepts.

The paper tackles the lack of theoretical foundations for deep learning in inverse problems by establishing a complete convergence analysis for the NETT approach, which uses neural network-based regularizers, and demonstrates good performance in numerical tests for tomographic sparse data problems, even with unknowns different from training data.

Recovering a function or high-dimensional parameter vector from indirect measurements is a central task in various scientific areas. Several methods for solving such inverse problems are well developed and well understood. Recently, novel algorithms using deep learning and neural networks for inverse problems appeared. While still in their infancy, these techniques show astonishing performance for applications like low-dose CT or various sparse data problems. However, there are few theoretical results for deep learning in inverse problems. In this paper, we establish a complete convergence analysis for the proposed NETT (Network Tikhonov) approach to inverse problems. NETT considers data consistent solutions having small value of a regularizer defined by a trained neural network. We derive well-posedness results and quantitative error estimates, and propose a possible strategy for training the regularizer. Our theoretical results and framework are different from any previous work using neural networks for solving inverse problems. A possible data driven regularizer is proposed. Numerical results are presented for a tomographic sparse data problem, which demonstrate good performance of NETT even for unknowns of different type from the training data. To derive the convergence and convergence rates results we introduce a new framework based on the absolute Bregman distance generalizing the standard Bregman distance from the convex to the non-convex case.

View on arXiv PDF

Similar