CLSep 2, 2021

Coarse-To-Fine And Cross-Lingual ASR Transfer

arXiv:2109.00916v1
Originality Incremental advance
AI Analysis

This work addresses the challenge of ASR for low-resource languages like Czech by enabling efficient cross-lingual transfer, though it is incremental as it builds on existing transfer learning techniques.

The paper tackled the problem of training end-to-end neural automatic speech recognition (ASR) systems for low-resource languages by proposing a transfer learning method from English to Czech using an intermediate alphabet without accents, which reduced training time and word error rate (WER).

End-to-end neural automatic speech recognition systems achieved recently state-of-the-art results, but they require large datasets and extensive computing resources. Transfer learning has been proposed to overcome these difficulties even across languages, e.g., German ASR trained from an English model. We experiment with much less related languages, reusing an English model for Czech ASR. To simplify the transfer, we propose to use an intermediate alphabet, Czech without accents, and document that it is a highly effective strategy. The technique is also useful on Czech data alone, in the style of coarse-to-fine training. We achieve substantial eductions in training time as well as word error rate (WER).

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes