CVApr 17, 2023

CLIP-Lung: Textual Knowledge-Guided Lung Nodule Malignancy Prediction

arXiv:2304.08013v132 citationsh-index: 36
Originality Incremental advance
AI Analysis

This work addresses lung nodule malignancy prediction for medical imaging, offering a domain-specific improvement by integrating textual knowledge.

The paper tackled the problem of distinguishing lung nodules with similar malignancy labels by incorporating clinical text annotations into training, resulting in improved classification performance and interpretability on the LIDC-IDRI dataset.

Lung nodule malignancy prediction has been enhanced by advanced deep-learning techniques and effective tricks. Nevertheless, current methods are mainly trained with cross-entropy loss using one-hot categorical labels, which results in difficulty in distinguishing those nodules with closer progression labels. Interestingly, we observe that clinical text information annotated by radiologists provides us with discriminative knowledge to identify challenging samples. Drawing on the capability of the contrastive language-image pre-training (CLIP) model to learn generalized visual representations from text annotations, in this paper, we propose CLIP-Lung, a textual knowledge-guided framework for lung nodule malignancy prediction. First, CLIP-Lung introduces both class and attribute annotations into the training of the lung nodule classifier without any additional overheads in inference. Second, we designed a channel-wise conditional prompt (CCP) module to establish consistent relationships between learnable context prompts and specific feature maps. Third, we align image features with both class and attribute features via contrastive learning, rectifying false positives and false negatives in latent space. The experimental results on the benchmark LIDC-IDRI dataset have demonstrated the superiority of CLIP-Lung, both in classification performance and interpretability of attention maps.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes