CLAug 24, 2021

Density-Based Dynamic Curriculum Learning for Intent Detection

arXiv:2108.10674v120 citations
Originality Incremental advance
AI Analysis

This work addresses a specific issue in intent detection for natural language processing applications, representing an incremental improvement over existing methods.

The paper tackles the problem of pre-trained language models overfitting simple samples and failing to learn complex ones in intent detection by proposing a density-based dynamic curriculum learning model that assigns difficulty levels based on eigenvector density and adjusts sample proportions during training, resulting in significant improvements over strong baselines on three open datasets.

Pre-trained language models have achieved noticeable performance on the intent detection task. However, due to assigning an identical weight to each sample, they suffer from the overfitting of simple samples and the failure to learn complex samples well. To handle this problem, we propose a density-based dynamic curriculum learning model. Our model defines the sample's difficulty level according to their eigenvectors' density. In this way, we exploit the overall distribution of all samples' eigenvectors simultaneously. Then we apply a dynamic curriculum learning strategy, which pays distinct attention to samples of various difficulty levels and alters the proportion of samples during the training process. Through the above operation, simple samples are well-trained, and complex samples are enhanced. Experiments on three open datasets verify that the proposed density-based algorithm can distinguish simple and complex samples significantly. Besides, our model obtains obvious improvement over the strong baselines.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes