CVLGJun 8, 2023

Multi-level Multiple Instance Learning with Transformer for Whole Slide Image Classification

arXiv:2306.05029v225 citationsh-index: 142Has Code
Originality Incremental advance
AI Analysis

This work addresses weakly-supervised high-resolution image analysis for computer-assisted diagnosis, representing an incremental improvement in domain-specific methods.

The paper tackles the challenge of whole slide image classification by proposing a multi-level multiple instance learning scheme with a Transformer model, achieving state-of-the-art results such as 96.80% test AUC on CAMELYON16 and 99.04% test AUC on TCGA-NSCLC.

Whole slide image (WSI) refers to a type of high-resolution scanned tissue image, which is extensively employed in computer-assisted diagnosis (CAD). The extremely high resolution and limited availability of region-level annotations make employing deep learning methods for WSI-based digital diagnosis challenging. Recently integrating multiple instance learning (MIL) and Transformer for WSI analysis shows very promising results. However, designing effective Transformers for this weakly-supervised high-resolution image analysis is an underexplored yet important problem. In this paper, we propose a Multi-level MIL (MMIL) scheme by introducing a hierarchical structure to MIL, which enables efficient handling of MIL tasks involving a large number of instances. Based on MMIL, we instantiated MMIL-Transformer, an efficient Transformer model with windowed exact self-attention for large-scale MIL tasks. To validate its effectiveness, we conducted a set of experiments on WSI classification tasks, where MMIL-Transformer demonstrate superior performance compared to existing state-of-the-art methods, i.e., 96.80% test AUC and 97.67% test accuracy on the CAMELYON16 dataset, 99.04% test AUC and 94.37% test accuracy on the TCGA-NSCLC dataset, respectively. All code and pre-trained models are available at: https://github.com/hustvl/MMIL-Transformer

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes