CVJun 18, 2022

Attention-based Dynamic Subspace Learners for Medical Image Analysis

arXiv:2206.09068v11 citationsh-index: 40
Originality Incremental advance
AI Analysis

This work addresses the challenge of generalizing similarity learning in medical image analysis for tasks like clustering, retrieval, and segmentation, offering an incremental improvement over existing multi-learner methods.

The paper tackles the problem of learning similarities in medical images by proposing attention-based dynamic subspace learners, which dynamically exploit multiple learners without needing a predefined number and integrate an attention mechanism for interpretability. It achieves competitive results in clustering and retrieval on three benchmark datasets and improves segmentation accuracy by up to 15% in Dice scores compared to state-of-the-art techniques.

Learning similarity is a key aspect in medical image analysis, particularly in recommendation systems or in uncovering the interpretation of anatomical data in images. Most existing methods learn such similarities in the embedding space over image sets using a single metric learner. Images, however, have a variety of object attributes such as color, shape, or artifacts. Encoding such attributes using a single metric learner is inadequate and may fail to generalize. Instead, multiple learners could focus on separate aspects of these attributes in subspaces of an overarching embedding. This, however, implies the number of learners to be found empirically for each new dataset. This work, Dynamic Subspace Learners, proposes to dynamically exploit multiple learners by removing the need of knowing apriori the number of learners and aggregating new subspace learners during training. Furthermore, the visual interpretability of such subspace learning is enforced by integrating an attention module into our method. This integrated attention mechanism provides a visual insight of discriminative image features that contribute to the clustering of image sets and a visual explanation of the embedding features. The benefits of our attention-based dynamic subspace learners are evaluated in the application of image clustering, image retrieval, and weakly supervised segmentation. Our method achieves competitive results with the performances of multiple learners baselines and significantly outperforms the classification network in terms of clustering and retrieval scores on three different public benchmark datasets. Moreover, our attention maps offer a proxy-labels, which improves the segmentation accuracy up to 15% in Dice scores when compared to state-of-the-art interpretation techniques.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes