CLAIIRLGSDASOct 21, 2020

Contextualized Attention-based Knowledge Transfer for Spoken Conversational Question Answering

arXiv:2010.11066v442 citations
Originality Incremental advance
AI Analysis

This work addresses a domain-specific problem for SCQA systems by mitigating ASR noise, but it is incremental as it builds on existing distillation and attention methods.

The paper tackles performance degradation in spoken conversational question answering (SCQA) due to noisy ASR transcriptions by proposing CADNet, a contextualized attention-based distillation approach, which achieves remarkable performance improvements on the Spoken-CoQA dataset.

Spoken conversational question answering (SCQA) requires machines to model complex dialogue flow given the speech utterances and text corpora. Different from traditional text question answering (QA) tasks, SCQA involves audio signal processing, passage comprehension, and contextual understanding. However, ASR systems introduce unexpected noisy signals to the transcriptions, which result in performance degradation on SCQA. To overcome the problem, we propose CADNet, a novel contextualized attention-based distillation approach, which applies both cross-attention and self-attention to obtain ASR-robust contextualized embedding representations of the passage and dialogue history for performance improvements. We also introduce the spoken conventional knowledge distillation framework to distill the ASR-robust knowledge from the estimated probabilities of the teacher model to the student. We conduct extensive experiments on the Spoken-CoQA dataset and demonstrate that our approach achieves remarkable performance in this task.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes