CY LGOct 13, 2023

Identifying and examining machine learning biases on Adult dataset

arXiv:2310.09373v12.34 citationsh-index: 3

Originality Synthesis-oriented

AI Analysis

This work addresses fairness and transparency issues in AI for data-driven societies, though it is incremental as it applies existing ensemble methods to a known bias problem.

The study tackled bias in machine learning models on the Adult dataset, revealing a gender-based wage prediction disparity where predicted wages dropped from $902.91 for males to $774.31 for females, with Kullback-Leibler divergence scores above 0.13 indicating bias.

This research delves into the reduction of machine learning model bias through Ensemble Learning. Our rigorous methodology comprehensively assesses bias across various categorical variables, ultimately revealing a pronounced gender attribute bias. The empirical evidence unveils a substantial gender-based wage prediction disparity: wages predicted for males, initially at \$902.91, significantly decrease to \$774.31 when the gender attribute is alternated to females. Notably, Kullback-Leibler divergence scores point to gender bias, with values exceeding 0.13, predominantly within tree-based models. Employing Ensemble Learning elucidates the quest for fairness and transparency. Intriguingly, our findings reveal that the stacked model aligns with individual models, confirming the resilience of model bias. This study underscores ethical considerations and advocates the implementation of hybrid models for a data-driven society marked by impartiality and inclusivity.

View on arXiv PDF

Similar