LGCVJan 15, 2021

Counterfactual Generative Networks

arXiv:2101.06046v1147 citations
Originality Incremental advance
AI Analysis

This work addresses the issue of robust and interpretable classifiers for image classification by exposing causal structures, though it builds incrementally on existing generative modeling advances.

The paper tackled the problem of neural networks learning spurious correlations by proposing a generative model that decomposes images into independent causal mechanisms for shape, texture, and background, enabling counterfactual image generation. The result showed improved out-of-distribution robustness with only a marginal performance drop on original tasks, demonstrated on MNIST and ImageNet datasets.

Neural networks are prone to learning shortcuts -- they often model simple correlations, ignoring more complex ones that potentially generalize better. Prior works on image classification show that instead of learning a connection to object shape, deep classifiers tend to exploit spurious correlations with low-level texture or the background for solving the classification task. In this work, we take a step towards more robust and interpretable classifiers that explicitly expose the task's causal structure. Building on current advances in deep generative modeling, we propose to decompose the image generation process into independent causal mechanisms that we train without direct supervision. By exploiting appropriate inductive biases, these mechanisms disentangle object shape, object texture, and background; hence, they allow for generating counterfactual images. We demonstrate the ability of our model to generate such images on MNIST and ImageNet. Further, we show that the counterfactual images can improve out-of-distribution robustness with a marginal drop in performance on the original classification task, despite being synthetic. Lastly, our generative model can be trained efficiently on a single GPU, exploiting common pre-trained models as inductive biases.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes