CLLGAug 9, 2023

Slot Induction via Pre-trained Language Model Probing and Multi-level Contrastive Learning

AmazonSalesforce
arXiv:2308.04712v1193 citationsh-index: 32
Originality Incremental advance
AI Analysis

This work addresses the challenge of reducing annotation costs for slot labeling in natural language understanding, which is incremental as it builds on existing methods but introduces novel techniques for unsupervised learning.

The paper tackles the problem of slot induction in task-oriented dialogue systems without token-level annotations by leveraging unsupervised pre-trained language model probing and multi-level contrastive learning, achieving competitive performance that bridges gaps with supervised models on benchmark datasets.

Recent advanced methods in Natural Language Understanding for Task-oriented Dialogue (TOD) Systems (e.g., intent detection and slot filling) require a large amount of annotated data to achieve competitive performance. In reality, token-level annotations (slot labels) are time-consuming and difficult to acquire. In this work, we study the Slot Induction (SI) task whose objective is to induce slot boundaries without explicit knowledge of token-level slot annotations. We propose leveraging Unsupervised Pre-trained Language Model (PLM) Probing and Contrastive Learning mechanism to exploit (1) unsupervised semantic knowledge extracted from PLM, and (2) additional sentence-level intent label signals available from TOD. Our approach is shown to be effective in SI task and capable of bridging the gaps with token-level supervised models on two NLU benchmark datasets. When generalized to emerging intents, our SI objectives also provide enhanced slot label representations, leading to improved performance on the Slot Filling tasks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes