LGGEO-PHNov 16, 2022

Using explainability to design physics-aware CNNs for solving subsurface inverse problems

arXiv:2211.08651v213 citationsh-index: 33
Originality Incremental advance
AI Analysis

This work addresses the challenge of developing interpretable neural networks for geophysical inverse problems, though it is incremental as it applies existing explainability methods to a specific domain.

The authors tackled the problem of designing physics-aware convolutional neural networks (CNNs) for shallow subsurface imaging by using explainability techniques (Score-CAM and Deep SHAP) to select hyperparameters, resulting in a shallow CNN with two convolutional layers and a 3x1 kernel that achieved comparable predictive accuracy while improving descriptive accuracy.

We present a novel method of using explainability techniques to design physics-aware neural networks. We demonstrate our approach by developing a convolutional neural network (CNN) for solving an inverse problem for shallow subsurface imaging. Although CNNs have gained popularity in recent years across many fields, the development of CNNs remains an art, as there are no clear guidelines regarding the selection of hyperparameters that will yield the best network. While optimization algorithms may be used to select hyperparameters automatically, these methods focus on developing networks with high predictive accuracy while disregarding model explainability (descriptive accuracy). However, the field of Explainable Artificial Intelligence (XAI) addresses the absence of model explainability by providing tools that allow developers to evaluate the internal logic of neural networks. In this study, we use the explainability methods Score-CAM and Deep SHAP to select hyperparameters, such as kernel sizes and network depth, to develop a physics-aware CNN for shallow subsurface imaging. We begin with a relatively deep Encoder-Decoder network, which uses surface wave dispersion images as inputs and generates 2D shear wave velocity subsurface images as outputs. Through model explanations, we ultimately find that a shallow CNN using two convolutional layers with an atypical kernel size of 3x1 yields comparable predictive accuracy but with increased descriptive accuracy. We also show that explainability methods can be used to evaluate the network's complexity and decision-making. We believe this method can be used to develop neural networks with high predictive accuracy while also providing inherent explainability.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes