IV CVNov 26, 2021

A Robust Volumetric Transformer for Accurate 3D Tumor Segmentation

Himashi Peiris, Munawar Hayat, Zhaolin Chen, Gary Egan, Mehrtash Harandi

arXiv:2111.13300v225.3216 citationsHas Code

Originality Synthesis-oriented

AI Analysis

This work addresses accurate segmentation of brain tumors in medical imaging, representing an incremental improvement with domain-specific impact.

The authors tackled 3D tumor segmentation by proposing a Transformer architecture that balances local and global cues, achieving competitive results on the BraTS benchmark with robustness to data corruptions.

We propose a Transformer architecture for volumetric segmentation, a challenging task that requires keeping a complex balance in encoding local and global spatial cues, and preserving information along all axes of the volume. Encoder of the proposed design benefits from self-attention mechanism to simultaneously encode local and global cues, while the decoder employs a parallel self and cross attention formulation to capture fine details for boundary refinement. Empirically, we show that the proposed design choices result in a computationally efficient model, with competitive and promising results on the Medical Segmentation Decathlon (MSD) brain tumor segmentation (BraTS) Task. We further show that the representations learned by our model are robust against data corruptions. \href{https://github.com/himashi92/VT-UNet}{Our code implementation is publicly available}.

View on arXiv PDF Code

Similar