CVAICLIRLGJan 15, 2025

Benchmarking Robustness of Contrastive Learning Models for Medical Image-Report Retrieval

arXiv:2501.09134v13 citationsh-index: 3
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of unreliable cross-domain retrieval in medical applications, but it is incremental as it benchmarks existing models without proposing new methods.

The study benchmarked the robustness of four contrastive learning models (CLIP, CXR-RePaiR, MedCLIP, CXR-CLIP) for medical image-report retrieval under image corruption, finding all models sensitive to out-of-distribution data with performance decreasing proportionally to occlusion levels, and MedCLIP slightly more robust but still lagging behind others.

Medical images and reports offer invaluable insights into patient health. The heterogeneity and complexity of these data hinder effective analysis. To bridge this gap, we investigate contrastive learning models for cross-domain retrieval, which associates medical images with their corresponding clinical reports. This study benchmarks the robustness of four state-of-the-art contrastive learning models: CLIP, CXR-RePaiR, MedCLIP, and CXR-CLIP. We introduce an occlusion retrieval task to evaluate model performance under varying levels of image corruption. Our findings reveal that all evaluated models are highly sensitive to out-of-distribution data, as evidenced by the proportional decrease in performance with increasing occlusion levels. While MedCLIP exhibits slightly more robustness, its overall performance remains significantly behind CXR-CLIP and CXR-RePaiR. CLIP, trained on a general-purpose dataset, struggles with medical image-report retrieval, highlighting the importance of domain-specific training data. The evaluation of this work suggests that more effort needs to be spent on improving the robustness of these models. By addressing these limitations, we can develop more reliable cross-domain retrieval models for medical applications.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes