LGOct 26, 2021

On the Effects of Artificial Data Modification

arXiv:2110.13968v23 citations
Originality Synthesis-oriented
AI Analysis

This work addresses biases in evaluation methods for vision models, which is important for researchers in computer vision and machine learning, though it is incremental as it corrects existing approaches rather than introducing a new paradigm.

The paper investigates the assumptions that data modification artifacts are detrimental in training and negligible in evaluation, showing that current shape bias and occlusion robustness measures are biased and proposing a fairer alternative for robustness, arguing that artifacts should be understood and exploited rather than eliminated.

Data distortion is commonly applied in vision models during both training (e.g methods like MixUp and CutMix) and evaluation (e.g. shape-texture bias and robustness). This data modification can introduce artificial information. It is often assumed that the resulting artefacts are detrimental to training, whilst being negligible when analysing models. We investigate these assumptions and conclude that in some cases they are unfounded and lead to incorrect results. Specifically, we show current shape bias identification methods and occlusion robustness measures are biased and propose a fairer alternative for the latter. Subsequently, through a series of experiments we seek to correct and strengthen the community's perception of how augmenting affects learning of vision models. Based on our empirical results we argue that the impact of the artefacts must be understood and exploited rather than eliminated.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes