IVCVFeb 7, 2025

Chest X-ray Foundation Model with Global and Local Representations Integration

arXiv:2502.05142v28 citationsh-index: 68Has CodeIEEE Transactions on Medical Imaging
AI Analysis

This work addresses the problem of developing a generalizable and efficient chest X-ray classification model for the medical community, particularly for tasks with limited labeled data.

The authors tackled the problem of limited scope and generalizability of task-specific chest X-ray classification models, and achieved state-of-the-art performance in classifying 40 disease findings with their CheXFound model, outperforming existing models. CheXFound also demonstrated superior label efficiency and strong generalization capabilities across different datasets and tasks.

Chest X-ray (CXR) is the most frequently ordered imaging test, supporting diverse clinical tasks from thoracic disease detection to postoperative monitoring. However, task-specific classification models are limited in scope, require costly labeled data, and lack generalizability to out-of-distribution datasets. To address these challenges, we introduce CheXFound, a self-supervised vision foundation model that learns robust CXR representations and generalizes effectively across a wide range of downstream tasks. We pretrain CheXFound on a curated CXR-1M dataset, comprising over one million unique CXRs from publicly available sources. We propose a Global and Local Representations Integration (GLoRI) module for downstream adaptations, by incorporating disease-specific local features with global image features for enhanced performance in multilabel classification. Our experimental results show that CheXFound outperforms state-of-the-art models in classifying 40 disease findings across different prevalence levels on the CXR-LT 24 dataset and exhibits superior label efficiency on downstream tasks with limited training data. Additionally, CheXFound achieved significant improvements on new tasks with out-of-distribution datasets, including opportunistic cardiovascular disease risk estimation and mortality prediction. These results highlight CheXFound's strong generalization capabilities, enabling diverse adaptations with improved label efficiency. The project source code is publicly available at https://github.com/RPIDIAL/CheXFound.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes