CLLGOct 28, 2020

Handling Class Imbalance in Low-Resource Dialogue Systems by Combining Few-Shot Classification and Interpolation

arXiv:2010.15090v16 citations
Originality Incremental advance
AI Analysis

This work addresses data imbalance issues in dialogue systems for low-resource settings, presenting an incremental improvement over existing methods.

The paper tackles class imbalance in low-resource dialogue systems by developing a pairwise learning framework that combines few-shot classification and interpolation of utterance representations, resulting in significant improvements in macro-F1 scores across multiple neural architectures and datasets.

Utterance classification performance in low-resource dialogue systems is constrained by an inevitably high degree of data imbalance in class labels. We present a new end-to-end pairwise learning framework that is designed specifically to tackle this phenomenon by inducing a few-shot classification capability in the utterance representations and augmenting data through an interpolation of utterance representations. Our approach is a general purpose training methodology, agnostic to the neural architecture used for encoding utterances. We show significant improvements in macro-F1 score over standard cross-entropy training for three different neural architectures, demonstrating improvements on a Virtual Patient dialogue dataset as well as a low-resourced emulation of the Switchboard dialogue act classification dataset.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes