LGAICVCYITJun 29, 2024

Towards Formalizing Spuriousness of Biased Datasets Using Partial Information Decomposition

arXiv:2407.00482v25 citationsHas Code
AI Analysis

This work addresses the issue of dataset bias for machine learning practitioners by providing a tool to anticipate and measure spuriousness before model training, though it is incremental as it builds on existing information theory concepts.

The authors tackled the problem of spurious associations in datasets by proposing a framework to preemptively disentangle these associations using Partial Information Decomposition, and they demonstrated that their measure of dataset spuriousness correlates with post-training model generalization metrics like worst-group accuracy across 6 benchmark datasets.

Spuriousness arises when there is an association between two or more variables in a dataset that are not causally related. In this work, we propose an explainability framework to preemptively disentangle the nature of such spurious associations in a dataset before model training. We leverage a body of work in information theory called Partial Information Decomposition (PID) to decompose the total information about the target into four non-negative quantities, namely unique information (in core and spurious features, respectively), redundant information, and synergistic information. Our framework helps anticipate when the core or spurious feature is indispensable, when either suffices, and when both are jointly needed for an optimal classifier trained on the dataset. Next, we leverage this decomposition to propose a novel measure of the spuriousness of a dataset. We arrive at this measure systematically by examining several candidate measures, and demonstrating what they capture and miss through intuitive canonical examples and counterexamples. Our framework Spurious Disentangler consists of segmentation, dimensionality reduction, and estimation modules, with capabilities to specifically handle high-dimensional image data efficiently. Finally, we also perform empirical evaluation to demonstrate the trends of unique, redundant, and synergistic information, as well as our proposed spuriousness measure across $6$ benchmark datasets under various experimental settings. We observe an agreement between our preemptive measure of dataset spuriousness and post-training model generalization metrics such as worst-group accuracy, further supporting our proposition. The code is available at https://github.com/Barproda/spuriousness-disentangler.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes