CLNov 28, 2022

Contrastive Novelty-Augmented Learning: Anticipating Outliers with Large Language Models

arXiv:2211.15718v2223 citationsh-index: 42
Originality Incremental advance
AI Analysis

This addresses the issue of overconfidence in selective prediction for text classification, enabling models to better abstain on novel classes, though it is incremental as it builds on existing methods for OOD detection.

The paper tackled the problem of text classification models being overly confident on unseen classes by introducing Contrastive Novelty-Augmented Learning (CoNAL), a method that generates out-of-distribution examples using large language models and trains classifiers to reduce confidence on them, resulting in improvements of 2.3% AUAC and 5.5% AUROC across four NLP datasets.

In many task settings, text classification models are likely to encounter examples from novel classes on which they cannot predict correctly. Selective prediction, in which models abstain on low-confidence examples, provides a possible solution, but existing models are often overly confident on unseen classes. To remedy this overconfidence, we introduce Contrastive Novelty-Augmented Learning (CoNAL), a two-step method that generates OOD examples representative of novel classes, then trains to decrease confidence on them. First, we generate OOD examples by prompting a large language model twice: we prompt it to enumerate relevant novel classes, then generate examples from each novel class matching the task format. Second, we train a classifier with a novel contrastive objective that encourages lower confidence on generated OOD examples than training examples. When trained with CoNAL, classifiers improve in their ability to detect and abstain on novel class examples over prior methods by an average of 2.3% in terms of accuracy under the accuracy-coverage curve (AUAC) and 5.5% AUROC across 4 NLP datasets, with no cost to in-distribution accuracy.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes