LGCYJun 17, 2025

Fair for a few: Improving Fairness in Doubly Imbalanced Datasets

arXiv:2506.14306v1h-index: 24
Originality Incremental advance
AI Analysis

This addresses fairness issues in decision-making for datasets with dual imbalances, but appears incremental as it builds on existing debiasing methods.

The paper tackles fairness in doubly imbalanced datasets, where data is imbalanced in both labels and sensitive attributes, by proposing a multi-criteria solution to improve fairness and classification accuracy, though no concrete numbers are provided.

Fairness has been identified as an important aspect of Machine Learning and Artificial Intelligence solutions for decision making. Recent literature offers a variety of approaches for debiasing, however many of them fall short when the data collection is imbalanced. In this paper, we focus on a particular case, fairness in doubly imbalanced datasets, such that the data collection is imbalanced both for the label and the groups in the sensitive attribute. Firstly, we present an exploratory analysis to illustrate limitations in debiasing on a doubly imbalanced dataset. Then, a multi-criteria based solution is proposed for finding the most suitable sampling and distribution for label and sensitive attribute, in terms of fairness and classification accuracy

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes