CLApr 24, 2020

Syntactic Data Augmentation Increases Robustness to Inference Heuristics

arXiv:2004.11999v11057 citations
AI Analysis

This addresses robustness issues in NLP models for tasks requiring syntactic understanding, though it is incremental as it builds on existing data augmentation techniques.

The paper tackled the problem of pretrained models like BERT lacking sensitivity to word order in natural language inference by augmenting training data with syntactically generated examples, resulting in improved accuracy on controlled challenge sets from 0.28 to 0.73 without harming standard test performance.

Pretrained neural models such as BERT, when fine-tuned to perform natural language inference (NLI), often show high accuracy on standard datasets, but display a surprising lack of sensitivity to word order on controlled challenge sets. We hypothesize that this issue is not primarily caused by the pretrained model's limitations, but rather by the paucity of crowdsourced NLI examples that might convey the importance of syntactic structure at the fine-tuning stage. We explore several methods to augment standard training sets with syntactically informative examples, generated by applying syntactic transformations to sentences from the MNLI corpus. The best-performing augmentation method, subject/object inversion, improved BERT's accuracy on controlled examples that diagnose sensitivity to word order from 0.28 to 0.73, without affecting performance on the MNLI test set. This improvement generalized beyond the particular construction used for data augmentation, suggesting that augmentation causes BERT to recruit abstract syntactic representations.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes