CLLGSep 30, 2020

Cross-lingual Spoken Language Understanding with Regularized Representation Alignment

arXiv:2009.14510v11001 citations
Originality Incremental advance
AI Analysis

This work addresses cross-lingual spoken language understanding, offering incremental improvements for multilingual NLP applications.

The paper tackles the problem of imperfect cross-lingual representation alignment in spoken language understanding systems, proposing a regularization approach that improves performance in few-shot and zero-shot scenarios, achieving comparable results to full supervised training with only 3% of target language data.

Despite the promising results of current cross-lingual models for spoken language understanding systems, they still suffer from imperfect cross-lingual representation alignments between the source and target languages, which makes the performance sub-optimal. To cope with this issue, we propose a regularization approach to further align word-level and sentence-level representations across languages without any external resource. First, we regularize the representation of user utterances based on their corresponding labels. Second, we regularize the latent variable model (Liu et al., 2019) by leveraging adversarial training to disentangle the latent variables. Experiments on the cross-lingual spoken language understanding task show that our model outperforms current state-of-the-art methods in both few-shot and zero-shot scenarios, and our model, trained on a few-shot setting with only 3\% of the target language training data, achieves comparable performance to the supervised training with all the training data.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes