CV LGAug 28, 2024

Does Data-Efficient Generalization Exacerbate Bias in Foundation Models?

Dilermando Queiroz, Anderson Carlos, Maíra Fatoretto, Luis Filipe Nakayama, André Anjos, Lilian Berton

arXiv:2408.16154v23.71 citationsh-index: 3

Originality Synthesis-oriented

AI Analysis

This highlights fairness risks in deploying foundation models in medical imaging with limited data, which is an incremental but important concern for healthcare applications.

The study investigated whether data-efficient generalization in foundation models exacerbates bias, finding that while RetFound reduced fairness gaps in AUC across gender and age groups compared to supervised learning, bias increased when data amounts decreased during fine-tuning.

Foundation models have emerged as robust models with label efficiency in diverse domains. In medical imaging, these models contribute to the advancement of medical diagnoses due to the difficulty in obtaining labeled data. However, it is unclear whether using a large amount of unlabeled data, biased by the presence of sensitive attributes during pre-training, influences the fairness of the model. This research examines the bias in the Foundation model (RetFound) when it is applied to fine-tune the Brazilian Multilabel Ophthalmological Dataset (BRSET), which has a different population than the pre-training dataset. The model evaluation, in comparison with supervised learning, shows that the Foundation Model has the potential to reduce the gap between the maximum AUC and minimum AUC evaluations across gender and age groups. However, in a data-efficient generalization, the model increases the bias when the data amount decreases. These findings suggest that when deploying a Foundation Model in real-life scenarios with limited data, the possibility of fairness issues should be considered.

View on arXiv PDF

Similar