ML LGOct 27, 2025

Understanding Fairness and Prediction Error through Subspace Decomposition and Influence Analysis

Enze Shi, Pankaj Bhagwat, Zhixian Yang, Linglong Kong, Bei Jiang

arXiv:2510.23935v1h-index: 8

Originality Incremental advance

AI Analysis

This work addresses fairness issues in ML models for applications where biased data leads to unfair predictions, offering a principled approach that is incremental in refining existing methods.

The paper tackles the problem of unfair outcomes in machine learning by proposing a framework that adjusts data representations to balance predictive utility and fairness, using subspace decomposition and influence analysis, with experiments showing effective fairness improvements while preserving performance.

Machine learning models have achieved widespread success but often inherit and amplify historical biases, resulting in unfair outcomes. Traditional fairness methods typically impose constraints at the prediction level, without addressing underlying biases in data representations. In this work, we propose a principled framework that adjusts data representations to balance predictive utility and fairness. Using sufficient dimension reduction, we decompose the feature space into target-relevant, sensitive, and shared components, and control the fairness-utility trade-off by selectively removing sensitive information. We provide a theoretical analysis of how prediction error and fairness gaps evolve as shared subspaces are added, and employ influence functions to quantify their effects on the asymptotic behavior of parameter estimates. Experiments on both synthetic and real-world datasets validate our theoretical insights and show that the proposed method effectively improves fairness while preserving predictive performance.

View on arXiv PDF

Similar