IVAICVOct 20, 2023

Skin Lesion Segmentation Improved by Transformer-based Networks with Inter-scale Dependency Modeling

arXiv:2310.13604v113 citationsh-index: 4Has Code
Originality Incremental advance
AI Analysis

This work addresses the challenge of semantic gaps in medical image segmentation for skin lesion analysis, offering an incremental improvement over existing Transformer-based U-Net methods.

The paper tackles the problem of accurate skin lesion segmentation for melanoma diagnosis by proposing a U-shaped hierarchical Transformer-based network with an Inter-scale Context Fusion (ISCF) method to mitigate semantic gaps and improve feature re-usability, achieving results validated on two benchmarks with publicly available code.

Melanoma, a dangerous type of skin cancer resulting from abnormal skin cell growth, can be treated if detected early. Various approaches using Fully Convolutional Networks (FCNs) have been proposed, with the U-Net architecture being prominent To aid in its diagnosis through automatic skin lesion segmentation. However, the symmetrical U-Net model's reliance on convolutional operations hinders its ability to capture long-range dependencies crucial for accurate medical image segmentation. Several Transformer-based U-Net topologies have recently been created to overcome this limitation by replacing CNN blocks with different Transformer modules to capture local and global representations. Furthermore, the U-shaped structure is hampered by semantic gaps between the encoder and decoder. This study intends to increase the network's feature re-usability by carefully building the skip connection path. Integrating an already calculated attention affinity within the skip connection path improves the typical concatenation process utilized in the conventional skip connection path. As a result, we propose a U-shaped hierarchical Transformer-based structure for skin lesion segmentation and an Inter-scale Context Fusion (ISCF) method that uses attention correlations in each stage of the encoder to adaptively combine the contexts from each stage to mitigate semantic gaps. The findings from two skin lesion segmentation benchmarks support the ISCF module's applicability and effectiveness. The code is publicly available at \url{https://github.com/saniaesk/skin-lesion-segmentation}

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes