CVJun 18, 2024

AEM: Attention Entropy Maximization for Multiple Instance Learning based Whole Slide Image Classification

arXiv:2406.15303v310 citationsHas Code
Originality Incremental advance
AI Analysis

This addresses overfitting issues in medical image analysis for pathologists, though it is incremental as it builds on existing MIL methods with a new regularization approach.

The paper tackles overfitting in Multiple Instance Learning for whole slide image classification by introducing Attention Entropy Maximization, a regularization technique that penalizes excessive attention concentration, leading to superior performance across diverse setups.

Multiple Instance Learning (MIL) effectively analyzes whole slide images but faces overfitting due to attention over-concentration. While existing solutions rely on complex architectural modifications or additional processing steps, we introduce Attention Entropy Maximization (AEM), a simple yet effective regularization technique. Our investigation reveals the positive correlation between attention entropy and model performance. Building on this insight, we integrate AEM regularization into the MIL framework to penalize excessive attention concentration. To address sensitivity to the AEM weight parameter, we implement Cosine Weight Annealing, reducing parameter dependency. Extensive evaluations demonstrate AEM's superior performance across diverse feature extractors, MIL frameworks, attention mechanisms, and augmentation techniques. Here is our anonymous code: https://github.com/dazhangyu123/AEM.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes