CV AIOct 13, 2024

Large-Scale 3D Medical Image Pre-training with Geometric Context Priors

arXiv:2410.09890v121.534 citationsh-index: 7Has CodeIEEE Trans Pattern Anal Mach Intell

Originality Incremental advance

AI Analysis

This work addresses label efficiency for medical imaging researchers and practitioners, offering a novel method with broad applicability but incremental in its adaptation of contrastive learning to medical data.

The paper tackles the challenge of annotation scarcity in 3D medical image analysis by introducing a self-supervised pre-training framework called VoCo that leverages geometric context priors, achieving superior performance across 48 medical tasks with the largest pre-training dataset PreCT-160K.

The scarcity of annotations poses a significant challenge in medical image analysis. Large-scale pre-training has emerged as a promising label-efficient solution, owing to the utilization of large-scale data, large models, and advanced pre-training techniques. However, its development in medical images remains underexplored. The primary challenge lies in harnessing large-scale unlabeled data and learning high-level semantics without annotations. We observe that 3D medical images exhibit consistent geometric context, i.e., consistent geometric relations between different organs, which leads to a promising way for learning consistent representations. Motivated by this, we introduce a simple-yet-effective Volume Contrast (VoCo) framework to leverage geometric context priors for self-supervision. Given an input volume, we extract base crops from different regions to construct positive and negative pairs for contrastive learning. Then we predict the contextual position of a random crop by contrasting its similarity to the base crops. In this way, VoCo encodes the inherent geometric context into model representations, facilitating high-level semantic learning without annotations. Specifically, we (1) introduce the largest medical pre-training dataset PreCT-160K; (2) investigate scaling laws and propose guidelines for tailoring different model sizes to various medical tasks; (3) build a benchmark encompassing 48 medical tasks. Extensive experiments highlight the superiority of VoCo. Codes at https://github.com/Luffy03/Large-Scale-Medical.

View on arXiv PDF Code

Similar