CV LGAug 18, 2024

From Correlation to Causation: Max-Pooling-Based Multi-Instance Learning Leads to More Robust Whole Slide Image Classification

Xin Liu, Weijia Zhang, Wei Tang, Thuc Duy Le, Jiuyong Li, Lin Liu, Min-Ling Zhang

arXiv:2408.09449v36.53 citationsh-index: 27

Originality Incremental advance

AI Analysis

This addresses the problem of unreliable tumor localization in medical imaging for pathologists, but it is incremental as it builds on existing max-pooling methods.

The paper tackled the problem of attention-based multi-instance learning models being susceptible to spurious correlations and degrading under domain shift in whole slide image analysis, and the result was that FocusMIL, which couples max-pooling with a variational information bottleneck, showed significant advantages in out-of-distribution scenarios and tumor region localization tasks.

In whole slide images (WSIs) analysis, attention-based multi-instance learning (MIL) models are susceptible to spurious correlations and degrade under domain shift. These methods may assign high attention weights to non-tumor regions, such as staining biases or artifacts, leading to unreliable tumor region localization. In this paper, we revisit max-pooling-based MIL methods from a causal perspective. Under mild assumptions, our theoretical results demonstrate that max-pooling encourages the model to focus on causal factors while ignoring bias-related factors. Furthermore, we discover that existing max-pooling-based methods may overfit the training set through rote memorization of instance features and fail to learn meaningful patterns. To address these issues, we propose FocusMIL, which couples max-pooling with an instance-level variational information bottleneck (VIB) to learn compact, predictive latent representations, and employs a multi-bag mini-batch scheme to stabilize optimization. We conduct comprehensive experiments on three real-world datasets and one semi-synthetic dataset. The results show that, by capturing causal factors, FocusMIL exhibits significant advantages in out-of-distribution scenarios and instance-level tumor region localization tasks.

View on arXiv PDF

Similar