LGIVMLApr 28, 2025

AI Alignment in Medical Imaging: Unveiling Hidden Biases Through Counterfactual Analysis

arXiv:2504.19621v11 citationsh-index: 2Has Code
Originality Incremental advance
AI Analysis

This work addresses bias risks in ML diagnostic systems for healthcare, offering a tool to improve generalization across demographic groups, though it is incremental as it builds on existing counterfactual fairness concepts.

The paper tackles the problem of biases in medical imaging ML models by introducing a statistical framework to evaluate dependency on sensitive attributes like demographics, demonstrating that their approach outperforms standard baselines on datasets such as CheXpert and MIMIC-CXR.

Machine learning (ML) systems for medical imaging have demonstrated remarkable diagnostic capabilities, but their susceptibility to biases poses significant risks, since biases may negatively impact generalization performance. In this paper, we introduce a novel statistical framework to evaluate the dependency of medical imaging ML models on sensitive attributes, such as demographics. Our method leverages the concept of counterfactual invariance, measuring the extent to which a model's predictions remain unchanged under hypothetical changes to sensitive attributes. We present a practical algorithm that combines conditional latent diffusion models with statistical hypothesis testing to identify and quantify such biases without requiring direct access to counterfactual data. Through experiments on synthetic datasets and large-scale real-world medical imaging datasets, including \textsc{cheXpert} and MIMIC-CXR, we demonstrate that our approach aligns closely with counterfactual fairness principles and outperforms standard baselines. This work provides a robust tool to ensure that ML diagnostic systems generalize well, e.g., across demographic groups, offering a critical step towards AI safety in healthcare. Code: https://github.com/Neferpitou3871/AI-Alignment-Medical-Imaging.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes