LGSTJun 7, 2022

How does overparametrization affect performance on minority groups?

arXiv:2206.03515v12 citationsh-index: 24
Originality Incremental advance
AI Analysis

This addresses fairness and performance disparities in machine learning for minority subgroups, providing theoretical insights into an empirical phenomenon.

The paper investigates how overparameterization affects model performance on minority groups, showing theoretically that overparameterization always improves minority group performance when regression functions differ between groups.

The benefits of overparameterization for the overall performance of modern machine learning (ML) models are well known. However, the effect of overparameterization at a more granular level of data subgroups is less understood. Recent empirical studies demonstrate encouraging results: (i) when groups are not known, overparameterized models trained with empirical risk minimization (ERM) perform better on minority groups; (ii) when groups are known, ERM on data subsampled to equalize group sizes yields state-of-the-art worst-group-accuracy in the overparameterized regime. In this paper, we complement these empirical studies with a theoretical investigation of the risk of overparameterized random feature models on minority groups. In a setting in which the regression functions for the majority and minority groups are different, we show that overparameterization always improves minority group performance.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes