CVSep 15, 2025

Multiple Instance Learning Framework with Masked Hard Instance Mining for Gigapixel Histopathology Image Analysis

Wenhao Tang, Sheng Huang, Heng Fang, Fengtao Zhou, Bo Liu, Qingshan Liu

arXiv:2509.11526v13.62 citationsh-index: 6Has CodeInt J Comput Vis

Originality Incremental advance

AI Analysis

This work addresses a critical bottleneck in computational pathology for medical professionals by improving MIL performance on gigapixel whole slide images, though it is an incremental advancement over existing MIL methods.

The paper tackles the problem of bias toward easy-to-classify instances in Multiple Instance Learning (MIL) for gigapixel histopathology images by proposing a novel framework with masked hard instance mining (MHIM-MIL), which outperforms the latest methods on 12 benchmarks for cancer diagnosis, subtyping, and survival analysis tasks.

Digitizing pathological images into gigapixel Whole Slide Images (WSIs) has opened new avenues for Computational Pathology (CPath). As positive tissue comprises only a small fraction of gigapixel WSIs, existing Multiple Instance Learning (MIL) methods typically focus on identifying salient instances via attention mechanisms. However, this leads to a bias towards easy-to-classify instances while neglecting challenging ones. Recent studies have shown that hard examples are crucial for accurately modeling discriminative boundaries. Applying such an idea at the instance level, we elaborate a novel MIL framework with masked hard instance mining (MHIM-MIL), which utilizes a Siamese structure with a consistency constraint to explore the hard instances. Using a class-aware instance probability, MHIM-MIL employs a momentum teacher to mask salient instances and implicitly mine hard instances for training the student model. To obtain diverse, non-redundant hard instances, we adopt large-scale random masking while utilizing a global recycle network to mitigate the risk of losing key features. Furthermore, the student updates the teacher using an exponential moving average, which identifies new hard instances for subsequent training iterations and stabilizes optimization. Experimental results on cancer diagnosis, subtyping, survival analysis tasks, and 12 benchmarks demonstrate that MHIM-MIL outperforms the latest methods in both performance and efficiency. The code is available at: https://github.com/DearCaat/MHIM-MIL.

View on arXiv PDF Code

Similar