CVJun 1, 2022

Dynamic Linear Transformer for 3D Biomedical Image Segmentation

arXiv:2206.00771v217 citationsh-index: 18Has Code
Originality Incremental advance
AI Analysis

This work addresses 3D medical image segmentation for biomedical applications, offering an incremental improvement by adapting transformers to 3D with reduced computational cost.

The paper tackles 3D biomedical image segmentation by proposing a transformer architecture with linear complexity and dynamic tokens to address the quadratic complexity challenge, achieving promising segmentation performance and accurate uncertainty quantification on CT pancreas datasets.

Transformer-based neural networks have surpassed promising performance on many biomedical image segmentation tasks due to a better global information modeling from the self-attention mechanism. However, most methods are still designed for 2D medical images while ignoring the essential 3D volume information. The main challenge for 3D transformer-based segmentation methods is the quadratic complexity introduced by the self-attention mechanism \cite{vaswani2017attention}. In this paper, we propose a novel transformer architecture for 3D medical image segmentation using an encoder-decoder style architecture with linear complexity. Furthermore, we newly introduce a dynamic token concept to further reduce the token numbers for self-attention calculation. Taking advantage of the global information modeling, we provide uncertainty maps from different hierarchy stages. We evaluate this method on multiple challenging CT pancreas segmentation datasets. Our promising results show that our novel 3D Transformer-based segmentor could provide promising highly feasible segmentation performance and accurate uncertainty quantification using single annotation. Code is available https://github.com/freshman97/LinTransUNet.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes