CVAINov 1, 2021

Using Synthetic Images To Uncover Population Biases In Facial Landmarks Detection

arXiv:2111.01683v11 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of bias detection in data-hungry applications like facial recognition, offering a practical solution for improving model fairness, though it is incremental as it builds on existing synthetic data methods.

The paper tackles the problem of detecting population biases in trained models by proposing synthetic test sets to overcome data scarcity, showing that biases observed on real datasets are also seen on synthetic ones, enabling efficient identification of model weak spots.

In order to analyze a trained model performance and identify its weak spots, one has to set aside a portion of the data for testing. The test set has to be large enough to detect statistically significant biases with respect to all the relevant sub-groups in the target population. This requirement may be difficult to satisfy, especially in data-hungry applications. We propose to overcome this difficulty by generating synthetic test set. We use the face landmarks detection task to validate our proposal by showing that all the biases observed on real datasets are also seen on a carefully designed synthetic dataset. This shows that synthetic test sets can efficiently detect a model's weak spots and overcome limitations of real test set in terms of quantity and/or diversity.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes