LGApr 8, 2023

Last-Layer Fairness Fine-tuning is Simple and Effective for Neural Networks

Yuzhen Mao, Zhun Deng, Huaxiu Yao, Ting Ye, Kenji Kawaguchi, James Zou

arXiv:2304.03935v216.528 citationsh-index: 54Has Code

Originality Incremental advance

AI Analysis

This provides an efficient and inexpensive method for training fair neural networks, addressing a key open question in algorithmic fairness for deep learning applications.

The paper tackles the challenge of severe over-fitting to fairness criteria when adding fairness constraints to deep neural networks, and finds that last-layer fine-tuning alone can effectively promote fairness in these models.

As machine learning has been deployed ubiquitously across applications in modern data science, algorithmic fairness has become a great concern. Among them, imposing fairness constraints during learning, i.e. in-processing fair training, has been a popular type of training method because they don't require accessing sensitive attributes during test time in contrast to post-processing methods. While this has been extensively studied in classical machine learning models, their impact on deep neural networks remains unclear. Recent research has shown that adding fairness constraints to the objective function leads to severe over-fitting to fairness criteria in large models, and how to solve this challenge is an important open question. To tackle this, we leverage the wisdom and power of pre-training and fine-tuning and develop a simple but novel framework to train fair neural networks in an efficient and inexpensive way -- last-layer fine-tuning alone can effectively promote fairness in deep neural networks. This framework offers valuable insights into representation learning for training fair neural networks.

View on arXiv PDF Code

Similar