Model Guidance via Explanations Turns Image Classifiers into Segmentation Models
This work addresses the challenge of reducing annotation costs for image segmentation in computer vision, though it is incremental as it builds on existing paradigms like 'Right for the right reasons' and encoder-decoder architectures.
The paper tackles the problem of semi-supervised image segmentation by unifying explainable AI heatmaps with segmentation models, showing that differentiable heatmap architectures achieve competitive results with standard segmentation losses and outperform comparable models when trained with weak supervision like image-level labels and few pixel-level labels.
Heatmaps generated on inputs of image classification networks via explainable AI methods like Grad-CAM and LRP have been observed to resemble segmentations of input images in many cases. Consequently, heatmaps have also been leveraged for achieving weakly supervised segmentation with image-level supervision. On the other hand, losses can be imposed on differentiable heatmaps, which has been shown to serve for (1)~improving heatmaps to be more human-interpretable, (2)~regularization of networks towards better generalization, (3)~training diverse ensembles of networks, and (4)~for explicitly ignoring confounding input features. Due to the latter use case, the paradigm of imposing losses on heatmaps is often referred to as "Right for the right reasons". We unify these two lines of research by investigating semi-supervised segmentation as a novel use case for the Right for the Right Reasons paradigm. First, we show formal parallels between differentiable heatmap architectures and standard encoder-decoder architectures for image segmentation. Second, we show that such differentiable heatmap architectures yield competitive results when trained with standard segmentation losses. Third, we show that such architectures allow for training with weak supervision in the form of image-level labels and small numbers of pixel-level labels, outperforming comparable encoder-decoder models. Code is available at \url{https://github.com/Kainmueller-Lab/TW-autoencoder}.