CLJun 6, 2021

Embracing Ambiguity: Shifting the Training Target of NLI Models

arXiv:2106.03020v1715 citations
Originality Incremental advance
AI Analysis

This work addresses ambiguity in NLI for NLP researchers, offering an incremental improvement by shifting training targets to better capture linguistic uncertainty.

The paper tackles the problem of ambiguous labels in Natural Language Inference datasets by training models directly on estimated label distributions instead of gold-labels, showing that this approach reduces ChaosNLI divergence scores and improves performance and representations for downstream tasks.

Natural Language Inference (NLI) datasets contain examples with highly ambiguous labels. While many research works do not pay much attention to this fact, several recent efforts have been made to acknowledge and embrace the existence of ambiguity, such as UNLI and ChaosNLI. In this paper, we explore the option of training directly on the estimated label distribution of the annotators in the NLI task, using a learning loss based on this ambiguity distribution instead of the gold-labels. We prepare AmbiNLI, a trial dataset obtained from readily available sources, and show it is possible to reduce ChaosNLI divergence scores when finetuning on this data, a promising first step towards learning how to capture linguistic ambiguity. Additionally, we show that training on the same amount of data but targeting the ambiguity distribution instead of gold-labels can result in models that achieve higher performance and learn better representations for downstream tasks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes