SE LGDec 20, 2024

Data Preparation for Fairness-Performance Trade-Offs: A Practitioner-Friendly Alternative?

Gianmario Voria, Rebecca Di Matteo, Giammaria Giordano, Gemma Catolino, Fabio Palomba

arXiv:2412.15920v11.8h-index: 18

Originality Incremental advance

AI Analysis

This work addresses fairness and bias issues in ML systems for practitioners, but it appears incremental as it builds on existing data preparation and pre-processing approaches.

The paper tackles the problem of bias in machine learning by proposing fairness-aware data preparation practices as a more accessible alternative to standard pre-processing methods, aiming to enhance both fairness and performance through an optimization technique called FATE.

As machine learning (ML) systems are increasingly adopted across industries, addressing fairness and bias has become essential. While many solutions focus on ethical challenges in ML, recent studies highlight that data itself is a major source of bias. Pre-processing techniques, which mitigate bias before training, are effective but may impact model performance and pose integration difficulties. In contrast, fairness-aware Data Preparation practices are both familiar to practitioners and easier to implement, providing a more accessible approach to reducing bias. Objective. This registered report proposes an empirical evaluation of how optimally selected fairness-aware practices, applied in early ML lifecycle stages, can enhance both fairness and performance, potentially outperforming standard pre-processing bias mitigation methods. Method. To this end, we will introduce FATE, an optimization technique for selecting 'Data Preparation' pipelines that optimize fairness and performance. Using FATE, we will analyze the fairness-performance trade-off, comparing pipelines selected by FATE with results by pre-processing bias mitigation techniques.

View on arXiv PDF

Similar