CLASOct 21, 2019

Text Matters but Speech Influences: A Computational Analysis of Syntactic Ambiguity Resolution

arXiv:1910.09275v32 citations
Originality Incremental advance
AI Analysis

This work addresses a challenging problem in linguistics and spoken language understanding systems, though it is incremental in applying existing attention mechanisms to a specific domain.

The paper tackled syntactic ambiguity resolution in spoken language understanding by developing attentive recurrent neural networks that integrate acoustic and textual features, achieving significantly superior performance with co-attention frameworks on a Korean speech corpus.

Analyzing how human beings resolve syntactic ambiguity has long been an issue of interest in the field of linguistics. It is, at the same time, one of the most challenging issues for spoken language understanding (SLU) systems as well. As syntactic ambiguity is intertwined with issues regarding prosody and semantics, the computational approach toward speech intention identification is expected to benefit from the observations of the human language processing mechanism. In this regard, we address the task with attentive recurrent neural networks that exploit acoustic and textual features simultaneously and reveal how the modalities interact with each other to derive sentence meaning. Utilizing a speech corpus recorded on Korean scripts of syntactically ambiguous utterances, we revealed that co-attention frameworks, namely multi-hop attention and cross-attention, show significantly superior performance in disambiguating speech intention. With further analysis, we demonstrate that the computational models reflect the internal relationship between auditory and linguistic processes.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes