LGAIJul 27, 2021

Adversarial Stacked Auto-Encoders for Fair Representation Learning

arXiv:2107.12826v14 citations
Originality Incremental advance
AI Analysis

This work addresses fairness in machine learning for applications where biased data can lead to discriminatory outcomes, representing an incremental improvement over prior methods.

The paper tackles the problem of machine learning models perpetuating biases by proposing a method to learn fair latent representations that improve fairness metrics while maintaining usability for downstream tasks. The results show that stacking auto-encoders and enforcing fairness at multiple latent spaces leads to improved fairness compared to existing approaches.

Training machine learning models with the only accuracy as a final goal may promote prejudices and discriminatory behaviors embedded in the data. One solution is to learn latent representations that fulfill specific fairness metrics. Different types of learning methods are employed to map data into the fair representational space. The main purpose is to learn a latent representation of data that scores well on a fairness metric while maintaining the usability for the downstream task. In this paper, we propose a new fair representation learning approach that leverages different levels of representation of data to tighten the fairness bounds of the learned representation. Our results show that stacking different auto-encoders and enforcing fairness at different latent spaces result in an improvement of fairness compared to other existing approaches.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes