CLSep 3, 2017

Disentangling ASR and MT Errors in Speech Translation

arXiv:1709.00678v1617 citations
Originality Synthesis-oriented
AI Analysis

This work addresses error analysis in speech translation for researchers and practitioners, but it is incremental as it builds on existing methods for error detection.

The paper tackled automatic quality assessment for spoken language translation by investigating errors from transcription (ASR) and translation (MT) modules, proposing a single classifier with joint features and achieving evaluation on 2-class and 3-class labeling tasks.

The main aim of this paper is to investigate automatic quality assessment for spoken language translation (SLT). More precisely, we investigate SLT errors that can be due to transcription (ASR) or to translation (MT) modules. This paper investigates automatic detection of SLT errors using a single classifier based on joint ASR and MT features. We evaluate both 2-class (good/bad) and 3-class (good/badASR/badMT ) labeling tasks. The 3-class problem necessitates to disentangle ASR and MT errors in the speech translation output and we propose two label extraction methods for this non trivial step. This enables - as a by-product - qualitative analysis on the SLT errors and their origin (are they due to transcription or to translation step?) on our large in-house corpus for French-to-English speech translation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes