AIOct 23, 2025

Bias by Design? How Data Practices Shape Fairness in AI Healthcare Systems

Anna Arias-Duart, Maria Eugenia Cardello, Atia Cortés

arXiv:2510.20332v13.3h-index: 5

Originality Synthesis-oriented

AI Analysis

This addresses fairness issues in AI healthcare systems for clinicians and patients, but it is incremental as it builds on existing bias detection methods.

The paper tackled the problem of biased data collection practices limiting AI integration in healthcare, identifying multiple bias types in clinical data and providing practical recommendations to improve fairness.

Artificial intelligence (AI) holds great promise for transforming healthcare. However, despite significant advances, the integration of AI solutions into real-world clinical practice remains limited. A major barrier is the quality and fairness of training data, which is often compromised by biased data collection practices. This paper draws on insights from the AI4HealthyAging project, part of Spain's national R&D initiative, where our task was to detect biases during clinical data collection. We identify several types of bias across multiple use cases, including historical, representation, and measurement biases. These biases manifest in variables such as sex, gender, age, habitat, socioeconomic status, equipment, and labeling. We conclude with practical recommendations for improving the fairness and robustness of clinical problem design and data collection. We hope that our findings and experience contribute to guiding future projects in the development of fairer AI systems in healthcare.

View on arXiv PDF

Similar