CLAug 27, 2021

ProtoInfoMax: Prototypical Networks with Mutual Information Maximization for Out-of-Domain Detection

arXiv:2108.12229v5662 citations
Originality Incremental advance
AI Analysis

This addresses a critical requirement for reliable NLP systems, such as dialogue systems, to prevent failures from unsupported inputs, though it appears incremental as it builds on existing Prototypical Networks.

The paper tackles the problem of detecting out-of-domain (OOD) inputs in NLP applications like intent classification, where zero OOD training data is available, by proposing ProtoInfoMax, which improves OOD detection performance by up to 20% in low-resource text classification settings.

The ability to detect Out-of-Domain (OOD) inputs has been a critical requirement in many real-world NLP applications. For example, intent classification in dialogue systems. The reason is that the inclusion of unsupported OOD inputs may lead to catastrophic failure of systems. However, it remains an empirical question whether current methods can tackle such problems reliably in a realistic scenario where zero OOD training data is available. In this study, we propose ProtoInfoMax, a new architecture that extends Prototypical Networks to simultaneously process in-domain and OOD sentences via Mutual Information Maximization (InfoMax) objective. Experimental results show that our proposed method can substantially improve performance up to 20% for OOD detection in low resource settings of text classification. We also show that ProtoInfoMax is less prone to typical overconfidence errors of Neural Networks, leading to more reliable prediction results.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes