CVAug 6, 2024

Dual-View Pyramid Pooling in Deep Neural Networks for Improved Medical Image Classification and Confidence Calibration

arXiv:2408.02906v23 citationsh-index: 16
Originality Incremental advance
AI Analysis

This work addresses confidence calibration and classification issues in medical image analysis, which is critical for reliable diagnosis, but it is incremental as it builds on existing pooling methods.

The paper tackled the problem of spatial pooling losing subtle features and cross-channel pooling ignoring salient features, which leads to miscalibration and suboptimal classification in medical imaging, by proposing dual-view pyramid pooling (DVPP) that improved classification accuracy and confidence calibration across six 2D/3D medical image tasks, surpassing state-of-the-art methods.

Spatial pooling (SP) and cross-channel pooling (CCP) operators have been applied to aggregate spatial features and pixel-wise features from feature maps in deep neural networks (DNNs), respectively. Their main goal is to reduce computation and memory overhead without visibly weakening the performance of DNNs. However, SP often faces the problem of losing the subtle feature representations, while CCP has a high possibility of ignoring salient feature representations, which may lead to both miscalibration of confidence issues and suboptimal medical classification results. To address these problems, we propose a novel dual-view framework, the first to systematically investigate the relative roles of SP and CCP by analyzing the difference between spatial features and pixel-wise features. Based on this framework, we propose a new pooling method, termed dual-view pyramid pooling (DVPP), to aggregate multi-scale dual-view features. DVPP aims to boost both medical image classification and confidence calibration performance by fully leveraging the merits of SP and CCP operators from a dual-axis perspective. Additionally, we discuss how to fulfill DVPP with five parameter-free implementations. Extensive experiments on six 2D/3D medical image classification tasks show that our DVPP surpasses state-of-the-art pooling methods in terms of medical image classification results and confidence calibration across different DNNs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes