CVJun 22, 2022

Feature Re-calibration based Multiple Instance Learning for Whole Slide Image Classification

Philip Chikontwe, Soo Jeong Nam, Heounjeong Go, Meejeong Kim, Hyun Jung Sung, Sang Hyun Park

arXiv:2206.10878v25.726 citationsh-index: 35Has Code

Originality Incremental advance

AI Analysis

This addresses the problem of limited labeled data in medical imaging for pathologists, but it is incremental as it builds on existing MIL methods with new techniques.

The paper tackles whole slide image classification by proposing a feature re-calibration method using max-instance statistics and balanced-batch sampling to improve multiple instance learning, showing effectiveness and improvements over state-of-the-art methods on benchmark datasets.

Whole slide image (WSI) classification is a fundamental task for the diagnosis and treatment of diseases; but, curation of accurate labels is time-consuming and limits the application of fully-supervised methods. To address this, multiple instance learning (MIL) is a popular method that poses classification as a weakly supervised learning task with slide-level labels only. While current MIL methods apply variants of the attention mechanism to re-weight instance features with stronger models, scant attention is paid to the properties of the data distribution. In this work, we propose to re-calibrate the distribution of a WSI bag (instances) by using the statistics of the max-instance (critical) feature. We assume that in binary MIL, positive bags have larger feature magnitudes than negatives, thus we can enforce the model to maximize the discrepancy between bags with a metric feature loss that models positive bags as out-of-distribution. To achieve this, unlike existing MIL methods that use single-batch training modes, we propose balanced-batch sampling to effectively use the feature loss i.e., (+/-) bags simultaneously. Further, we employ a position encoding module (PEM) to model spatial/morphological information, and perform pooling by multi-head self-attention (PSMA) with a Transformer encoder. Experimental results on existing benchmark datasets show our approach is effective and improves over state-of-the-art MIL methods.

View on arXiv PDF Code

Similar