CLAIJun 28, 2021

Enhancing the Generalization for Intent Classification and Out-of-Domain Detection in SLU

arXiv:2106.14464v1718 citations
Originality Incremental advance
AI Analysis

This addresses the practical problem of detecting unsupported utterances in SLU systems without costly extra data collection, representing a strong domain-specific improvement.

The paper tackles intent classification and out-of-domain detection in spoken language understanding by proposing a domain-regularized module that reduces classifier overconfidence, achieving state-of-the-art performance on four datasets using only in-domain training data.

Intent classification is a major task in spoken language understanding (SLU). Since most models are built with pre-collected in-domain (IND) training utterances, their ability to detect unsupported out-of-domain (OOD) utterances has a critical effect in practical use. Recent works have shown that using extra data and labels can improve the OOD detection performance, yet it could be costly to collect such data. This paper proposes to train a model with only IND data while supporting both IND intent classification and OOD detection. Our method designs a novel domain-regularized module (DRM) to reduce the overconfident phenomenon of a vanilla classifier, achieving a better generalization in both cases. Besides, DRM can be used as a drop-in replacement for the last layer in any neural network-based intent classifier, providing a low-cost strategy for a significant improvement. The evaluation on four datasets shows that our method built on BERT and RoBERTa models achieves state-of-the-art performance against existing approaches and the strong baselines we created for the comparisons.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes