LGAIOct 24, 2024

From Efficiency to Equity: Measuring Fairness in Preference Learning

arXiv:2410.18841v13 citationsh-index: 7
Originality Incremental advance
AI Analysis

This work addresses the problem of potential epistemic injustices in AI systems for developers and ethicists, offering a domain-specific framework for improving fairness in preference learning.

The paper tackles the problem of ensuring fairness in AI preference learning models by introducing a framework to evaluate epistemic fairness using metrics adapted from economic inequality measures, and it reveals variations in model performance across users while exploring mitigation techniques.

As AI systems, particularly generative models, increasingly influence decision-making, ensuring that they are able to fairly represent diverse human preferences becomes crucial. This paper introduces a novel framework for evaluating epistemic fairness in preference learning models inspired by economic theories of inequality and Rawlsian justice. We propose metrics adapted from the Gini Coefficient, Atkinson Index, and Kuznets Ratio to quantify fairness in these models. We validate our approach using two datasets: a custom visual preference dataset (AI-EDI-Space) and the Jester Jokes dataset. Our analysis reveals variations in model performance across users, highlighting potential epistemic injustices. We explore pre-processing and in-processing techniques to mitigate these inequalities, demonstrating a complex relationship between model efficiency and fairness. This work contributes to AI ethics by providing a framework for evaluating and improving epistemic fairness in preference learning models, offering insights for developing more inclusive AI systems in contexts where diverse human preferences are crucial.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes