LGNAAug 25, 2022

CAS4DL: Christoffel Adaptive Sampling for function approximation via Deep Learning

arXiv:2208.12190v114 citationsh-index: 33
Originality Incremental advance
AI Analysis

This work addresses sample efficiency in scientific computing applications like uncertainty quantification, where each sample is costly, by introducing an incremental adaptive method for deep learning-based function approximation.

The paper tackles the problem of approximating smooth multivariate functions from expensive PDE solutions by proposing CAS4DL, an adaptive sampling strategy for deep learning that uses the Christoffel function to select samples, resulting in substantial savings in sample numbers compared to Monte Carlo sampling, particularly with smooth activation functions.

The problem of approximating smooth, multivariate functions from sample points arises in many applications in scientific computing, e.g., in computational Uncertainty Quantification (UQ) for science and engineering. In these applications, the target function may represent a desired quantity of interest of a parameterized Partial Differential Equation (PDE). Due to the large cost of solving such problems, where each sample is computed by solving a PDE, sample efficiency is a key concerning these applications. Recently, there has been increasing focus on the use of Deep Neural Networks (DNN) and Deep Learning (DL) for learning such functions from data. In this work, we propose an adaptive sampling strategy, CAS4DL (Christoffel Adaptive Sampling for Deep Learning) to increase the sample efficiency of DL for multivariate function approximation. Our novel approach is based on interpreting the second to last layer of a DNN as a dictionary of functions defined by the nodes on that layer. With this viewpoint, we then define an adaptive sampling strategy motivated by adaptive sampling schemes recently proposed for linear approximation schemes, wherein samples are drawn randomly with respect to the Christoffel function of the subspace spanned by this dictionary. We present numerical experiments comparing CAS4DL with standard Monte Carlo (MC) sampling. Our results demonstrate that CAS4DL often yields substantial savings in the number of samples required to achieve a given accuracy, particularly in the case of smooth activation functions, and it shows a better stability in comparison to MC. These results therefore are a promising step towards fully adapting DL towards scientific computing applications.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes