LGJan 12, 2022

Preventing Manifold Intrusion with Locality: Local Mixup

arXiv:2201.04368v117 citations
Originality Incremental advance
AI Analysis

This addresses issues in data augmentation for computer vision, but it is incremental as it modifies an existing method.

The paper tackles the problem of Mixup producing out-of-distribution samples and contradictions by introducing Local Mixup, which weights down distant samples in loss computation, and shows it improves test accuracy on standardized computer vision benchmarks.

Mixup is a data-dependent regularization technique that consists in linearly interpolating input samples and associated outputs. It has been shown to improve accuracy when used to train on standard machine learning datasets. However, authors have pointed out that Mixup can produce out-of-distribution virtual samples and even contradictions in the augmented training set, potentially resulting in adversarial effects. In this paper, we introduce Local Mixup in which distant input samples are weighted down when computing the loss. In constrained settings we demonstrate that Local Mixup can create a trade-off between bias and variance, with the extreme cases reducing to vanilla training and classical Mixup. Using standardized computer vision benchmarks , we also show that Local Mixup can improve test accuracy.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes