IVCVAug 13, 2022

Recent Progress in Transformer-based Medical Image Analysis

arXiv:2208.06643v4100 citationsh-index: 18
Originality Synthesis-oriented
AI Analysis

It provides a comprehensive overview for the medical image analysis community, but is incremental as it reviews existing progress rather than introducing new methods.

This review paper surveys the application of transformer models to medical image analysis, showing that transformer-based methods outperform existing approaches across various tasks and modalities, as evidenced by numerous experiments and comparisons.

The transformer is primarily used in the field of natural language processing. Recently, it has been adopted and shows promise in the computer vision (CV) field. Medical image analysis (MIA), as a critical branch of CV, also greatly benefits from this state-of-the-art technique. In this review, we first recap the core component of the transformer, the attention mechanism, and the detailed structures of the transformer. After that, we depict the recent progress of the transformer in the field of MIA. We organize the applications in a sequence of different tasks, including classification, segmentation, captioning, registration, detection, enhancement, localization, and synthesis. The mainstream classification and segmentation tasks are further divided into eleven medical image modalities. A large number of experiments studied in this review illustrate that the transformer-based method outperforms existing methods through comparisons with multiple evaluation metrics. Finally, we discuss the open challenges and future opportunities in this field. This task-modality review with the latest contents, detailed information, and comprehensive comparison may greatly benefit the broad MIA community.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes