Using Backbone Foundation Model for Evaluating Fairness in Chest Radiography Without Demographic Data
This work addresses fairness assessment in medical diagnostics for scenarios where demographic data is unavailable, though it is incremental as it builds on existing foundation models and bias mitigation techniques.
This study tackled the problem of evaluating fairness in chest radiography without demographic data by using a backbone foundation model as an embedding extractor to create groups representing protected attributes like gender and age, resulting in a reduction of gender-based performance differences by 4.44% in-distribution and 6.16% out-of-distribution, though it lacked robustness for age attributes.
Ensuring consistent performance across diverse populations and incorporating fairness into machine learning models are crucial for advancing medical image diagnostics and promoting equitable healthcare. However, many databases do not provide protected attributes or contain unbalanced representations of demographic groups, complicating the evaluation of model performance across different demographics and the application of bias mitigation techniques that rely on these attributes. This study aims to investigate the effectiveness of using the backbone of Foundation Models as an embedding extractor for creating groups that represent protected attributes, such as gender and age. We propose utilizing these groups in different stages of bias mitigation, including pre-processing, in-processing, and evaluation. Using databases in and out-of-distribution scenarios, it is possible to identify that the method can create groups that represent gender in both databases and reduce in 4.44% the difference between the gender attribute in-distribution and 6.16% in out-of-distribution. However, the model lacks robustness in handling age attributes, underscoring the need for more fundamentally fair and robust Foundation models. These findings suggest a role in promoting fairness assessment in scenarios where we lack knowledge of attributes, contributing to the development of more equitable medical diagnostics.