LG CYDec 4, 2024

Data Acquisition for Improving Model Fairness using Reinforcement Learning

arXiv:2412.03009v12.6h-index: 1

Originality Incremental advance

AI Analysis

This addresses fairness issues in critical decision-making domains like healthcare and finance, but it is incremental as it builds on existing data valuation and bias mitigation techniques.

The paper tackles the problem of improving fairness in machine learning models by proposing DataSift, a reinforcement learning-based data acquisition framework that orders data points to acquire additional labeled data efficiently, showing significant fairness improvements with few data points on real-world and synthetic datasets.

Machine learning systems are increasingly being used in critical decision making such as healthcare, finance, and criminal justice. Concerns around their fairness have resulted in several bias mitigation techniques that emphasize the need for high-quality data to ensure fairer decisions. However, the role of earlier stages of machine learning pipelines in mitigating model bias has not been explored well. In this paper, we focus on the task of acquiring additional labeled data points for training the downstream machine learning model to rapidly improve its fairness. Since not all data points in a data pool are equally beneficial to the task of fairness, we generate an ordering in which data points should be acquired. We present DataSift, a data acquisition framework based on the idea of data valuation that relies on partitioning and multi-armed bandits to determine the most valuable data points to acquire. Over several iterations, DataSift selects a partition and randomly samples a batch of data points from the selected partition, evaluates the benefit of acquiring the batch on model fairness, and updates the utility of partitions depending on the benefit. To further improve the effectiveness and efficiency of evaluating batches, we leverage influence functions that estimate the effect of acquiring a batch without retraining the model. We empirically evaluate DataSift on several real-world and synthetic datasets and show that the fairness of a machine learning model can be significantly improved even while acquiring a few data points.

View on arXiv PDF

Similar