CLLGMay 22, 2023

Improving Classifier Robustness through Active Generation of Pairwise Counterfactuals

arXiv:2305.13535v11 citations
Originality Highly original
AI Analysis

This addresses the problem of high labeling costs in counterfactual data augmentation for NLP practitioners, offering a scalable solution with significant performance gains.

The paper tackles the challenge of efficiently generating and labeling counterfactuals for data augmentation to improve classifier robustness, achieving an 18-20% robustness improvement and 14-21% error reduction on out-of-domain datasets with only 10% human-annotated data.

Counterfactual Data Augmentation (CDA) is a commonly used technique for improving robustness in natural language classifiers. However, one fundamental challenge is how to discover meaningful counterfactuals and efficiently label them, with minimal human labeling cost. Most existing methods either completely rely on human-annotated labels, an expensive process which limits the scale of counterfactual data, or implicitly assume label invariance, which may mislead the model with incorrect labels. In this paper, we present a novel framework that utilizes counterfactual generative models to generate a large number of diverse counterfactuals by actively sampling from regions of uncertainty, and then automatically label them with a learned pairwise classifier. Our key insight is that we can more correctly label the generated counterfactuals by training a pairwise classifier that interpolates the relationship between the original example and the counterfactual. We demonstrate that with a small amount of human-annotated counterfactual data (10%), we can generate a counterfactual augmentation dataset with learned labels, that provides an 18-20% improvement in robustness and a 14-21% reduction in errors on 6 out-of-domain datasets, comparable to that of a fully human-annotated counterfactual dataset for both sentiment classification and question paraphrase tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes