CVAug 1, 2024

SegStitch: Multidimensional Transformer for Robust and Efficient Medical Imaging Segmentation

arXiv:2408.00496v118 citationsh-index: 23Has Code
Originality Incremental advance
AI Analysis

This work addresses efficiency and accuracy issues in medical imaging segmentation for clinical applications, representing an incremental advancement over existing transformer-based methods.

The paper tackled the challenges of local feature neglect and high computational complexity in 3D medical imaging segmentation by proposing SegStitch, which integrates transformers with denoising ODE blocks, achieving improvements up to 11.48% in mDSC on datasets and reducing parameters by 36.7% compared to UNETR.

Medical imaging segmentation plays a significant role in the automatic recognition and analysis of lesions. State-of-the-art methods, particularly those utilizing transformers, have been prominently adopted in 3D semantic segmentation due to their superior performance in scalability and generalizability. However, plain vision transformers encounter challenges due to their neglect of local features and their high computational complexity. To address these challenges, we introduce three key contributions: Firstly, we proposed SegStitch, an innovative architecture that integrates transformers with denoising ODE blocks. Instead of taking whole 3D volumes as inputs, we adapt axial patches and customize patch-wise queries to ensure semantic consistency. Additionally, we conducted extensive experiments on the BTCV and ACDC datasets, achieving improvements up to 11.48% and 6.71% respectively in mDSC, compared to state-of-the-art methods. Lastly, our proposed method demonstrates outstanding efficiency, reducing the number of parameters by 36.7% and the number of FLOPS by 10.7% compared to UNETR. This advancement holds promising potential for adapting our method to real-world clinical practice. The code will be available at https://github.com/goblin327/SegStitch

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes