LGApr 8, 2023

Last-Layer Fairness Fine-tuning is Simple and Effective for Neural Networks

arXiv:2304.03935v228 citationsh-index: 54
Originality Incremental advance
AI Analysis

This provides an efficient and inexpensive method for training fair neural networks, addressing a key open question in algorithmic fairness for deep learning applications.

The paper tackles the challenge of severe over-fitting to fairness criteria when adding fairness constraints to deep neural networks, and finds that last-layer fine-tuning alone can effectively promote fairness in these models.

As machine learning has been deployed ubiquitously across applications in modern data science, algorithmic fairness has become a great concern. Among them, imposing fairness constraints during learning, i.e. in-processing fair training, has been a popular type of training method because they don't require accessing sensitive attributes during test time in contrast to post-processing methods. While this has been extensively studied in classical machine learning models, their impact on deep neural networks remains unclear. Recent research has shown that adding fairness constraints to the objective function leads to severe over-fitting to fairness criteria in large models, and how to solve this challenge is an important open question. To tackle this, we leverage the wisdom and power of pre-training and fine-tuning and develop a simple but novel framework to train fair neural networks in an efficient and inexpensive way -- last-layer fine-tuning alone can effectively promote fairness in deep neural networks. This framework offers valuable insights into representation learning for training fair neural networks.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes