CVAIJan 6, 2025

Label-free Concept Based Multiple Instance Learning for Gigapixel Histopathology

arXiv:2501.02922v19 citationsh-index: 55
Originality Incremental advance
AI Analysis

This addresses the need for human-interpretable AI in high-stakes medical domains like pathology, offering an incremental improvement over traditional methods by eliminating costly annotations.

The paper tackles the problem of interpretability in multiple instance learning for gigapixel histopathology by proposing a concept-based model that uses vision-language models to predict pathology concepts without human annotations, achieving AUC and accuracy over 0.9 on Camelyon16 and PANDA datasets and showing high alignment with pathologist concepts in a user study.

Multiple Instance Learning (MIL) methods allow for gigapixel Whole-Slide Image (WSI) analysis with only slide-level annotations. Interpretability is crucial for safely deploying such algorithms in high-stakes medical domains. Traditional MIL methods offer explanations by highlighting salient regions. However, such spatial heatmaps provide limited insights for end users. To address this, we propose a novel inherently interpretable WSI-classification approach that uses human-understandable pathology concepts to generate explanations. Our proposed Concept MIL model leverages recent advances in vision-language models to directly predict pathology concepts based on image features. The model's predictions are obtained through a linear combination of the concepts identified on the top-K patches of a WSI, enabling inherent explanations by tracing each concept's influence on the prediction. In contrast to traditional concept-based interpretable models, our approach eliminates the need for costly human annotations by leveraging the vision-language model. We validate our method on two widely used pathology datasets: Camelyon16 and PANDA. On both datasets, Concept MIL achieves AUC and accuracy scores over 0.9, putting it on par with state-of-the-art models. We further find that 87.1\% (Camelyon16) and 85.3\% (PANDA) of the top 20 patches fall within the tumor region. A user study shows that the concepts identified by our model align with the concepts used by pathologists, making it a promising strategy for human-interpretable WSI classification.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes