MLLGAug 18, 2021

Semantic Perturbations with Normalizing Flows for Improved Generalization

arXiv:2108.07958v113 citations
Originality Highly original
AI Analysis

This work addresses the need for domain-agnostic data augmentation to improve generalization in machine learning, offering a novel approach that outperforms existing methods, particularly in data-scarce scenarios.

The paper tackled the problem of overfitting in deep neural networks by introducing an unsupervised data augmentation method using normalizing flows for on-manifold perturbations in latent space, achieving 96.6% test accuracy on CIFAR-10 with ResNet-18 and 10-25% relative improvement in low data regimes.

Data augmentation is a widely adopted technique for avoiding overfitting when training deep neural networks. However, this approach requires domain-specific knowledge and is often limited to a fixed set of hard-coded transformations. Recently, several works proposed to use generative models for generating semantically meaningful perturbations to train a classifier. However, because accurate encoding and decoding are critical, these methods, which use architectures that approximate the latent-variable inference, remained limited to pilot studies on small datasets. Exploiting the exactly reversible encoder-decoder structure of normalizing flows, we perform on-manifold perturbations in the latent space to define fully unsupervised data augmentations. We demonstrate that such perturbations match the performance of advanced data augmentation techniques -- reaching 96.6% test accuracy for CIFAR-10 using ResNet-18 and outperform existing methods, particularly in low data regimes -- yielding 10--25% relative improvement of test accuracy from classical training. We find that our latent adversarial perturbations adaptive to the classifier throughout its training are most effective, yielding the first test accuracy improvement results on real-world datasets -- CIFAR-10/100 -- via latent-space perturbations.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes