CLFeb 3, 2025

Main Predicate and Their Arguments as Explanation Signals For Intent Classification

arXiv:2502.01270v111 citationsh-index: 3NAACL
Originality Incremental advance
AI Analysis

This addresses the problem of explainability in intent classification for conversational agents, though it is incremental as it builds on existing datasets and methods.

The paper tackles the lack of explainability in intent classification for chatbots by automatically augmenting datasets with explanation signals based on main predicates and their arguments, creating a 21k-instance dataset and showing that guiding models with these signals improves plausibility scores by 3-4%.

Intent classification is crucial for conversational agents (chatbots), and deep learning models perform well in this area. However, little research has been done on the explainability of intent classification due to the absence of suitable benchmark data. Human annotation of explanation signals in text samples is time-consuming and costly. However, from inspection of data on intent classification, we see that, more often than not, the main verb denotes the action, and the direct object indicates the domain of conversation, serving as explanation signals for intent. This observation enables us to hypothesize that the main predicate in the text utterances, along with the arguments of the main predicate, can serve as explanation signals. Leveraging this, we introduce a new technique to automatically augment text samples from intent classification datasets with word-level explanations. We mark main predicates (primarily verbs) and their arguments (dependency relations) as explanation signals in benchmark intent classification datasets ATIS and SNIPS, creating a unique 21k-instance dataset for explainability. Further, we experiment with deep learning and language models. We observe that models that work well for classification do not perform well in explainability metrics like plausibility and faithfulness. We also observe that guiding models to focus on explanation signals from our dataset during training improves the plausibility Token F1 score by 3-4%, improving the model's reasoning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes