IVCVNov 26, 2021

A Robust Volumetric Transformer for Accurate 3D Tumor Segmentation

arXiv:2111.13300v2216 citationsHas Code
Originality Synthesis-oriented
AI Analysis

This work addresses accurate segmentation of brain tumors in medical imaging, representing an incremental improvement with domain-specific impact.

The authors tackled 3D tumor segmentation by proposing a Transformer architecture that balances local and global cues, achieving competitive results on the BraTS benchmark with robustness to data corruptions.

We propose a Transformer architecture for volumetric segmentation, a challenging task that requires keeping a complex balance in encoding local and global spatial cues, and preserving information along all axes of the volume. Encoder of the proposed design benefits from self-attention mechanism to simultaneously encode local and global cues, while the decoder employs a parallel self and cross attention formulation to capture fine details for boundary refinement. Empirically, we show that the proposed design choices result in a computationally efficient model, with competitive and promising results on the Medical Segmentation Decathlon (MSD) brain tumor segmentation (BraTS) Task. We further show that the representations learned by our model are robust against data corruptions. \href{https://github.com/himashi92/VT-UNet}{Our code implementation is publicly available}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes