CVSep 7, 2023

BroadCAM: Outcome-agnostic Class Activation Mapping for Small-scale Weakly Supervised Applications

Jiatai Lin, Guoqiang Han, Xuemiao Xu, Changhong Liang, Tien-Tsin Wong, C. L. Philip Chen, Zaiyi Liu, Chu Han

arXiv:2309.03509v12.81 citationsh-index: 51Has Code

Originality Highly original

AI Analysis

This addresses the challenge of noisy CAM seeds in weakly supervised semantic segmentation and object localization for applications with limited data, offering a more robust visualization technique.

The paper tackles the problem of unreliable class activation mapping (CAM) in small-scale weakly supervised learning by proposing BroadCAM, an outcome-agnostic approach that avoids dependence on unstable model outcomes. It demonstrates superior performance over existing methods on datasets like VOC2012 and BCSS-WSSS with small-scale data (less than 5%) and achieves state-of-the-art results with large-scale data.

Class activation mapping~(CAM), a visualization technique for interpreting deep learning models, is now commonly used for weakly supervised semantic segmentation~(WSSS) and object localization~(WSOL). It is the weighted aggregation of the feature maps by activating the high class-relevance ones. Current CAM methods achieve it relying on the training outcomes, such as predicted scores~(forward information), gradients~(backward information), etc. However, when with small-scale data, unstable training may lead to less effective model outcomes and generate unreliable weights, finally resulting in incorrect activation and noisy CAM seeds. In this paper, we propose an outcome-agnostic CAM approach, called BroadCAM, for small-scale weakly supervised applications. Since broad learning system (BLS) is independent to the model learning, BroadCAM can avoid the weights being affected by the unreliable model outcomes when with small-scale data. By evaluating BroadCAM on VOC2012 (natural images) and BCSS-WSSS (medical images) for WSSS and OpenImages30k for WSOL, BroadCAM demonstrates superior performance than existing CAM methods with small-scale data (less than 5\%) in different CNN architectures. It also achieves SOTA performance with large-scale training data. Extensive qualitative comparisons are conducted to demonstrate how BroadCAM activates the high class-relevance feature maps and generates reliable CAMs when with small-scale training data.

View on arXiv PDF Code

Similar