CL LGSep 30, 2020

Cross-lingual Spoken Language Understanding with Regularized Representation Alignment

Zihan Liu, Genta Indra Winata, Peng Xu, Zhaojiang Lin, Pascale Fung

arXiv:2009.14510v131.11001 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses cross-lingual spoken language understanding, offering incremental improvements for multilingual NLP applications.

The paper tackles the problem of imperfect cross-lingual representation alignment in spoken language understanding systems, proposing a regularization approach that improves performance in few-shot and zero-shot scenarios, achieving comparable results to full supervised training with only 3% of target language data.

Despite the promising results of current cross-lingual models for spoken language understanding systems, they still suffer from imperfect cross-lingual representation alignments between the source and target languages, which makes the performance sub-optimal. To cope with this issue, we propose a regularization approach to further align word-level and sentence-level representations across languages without any external resource. First, we regularize the representation of user utterances based on their corresponding labels. Second, we regularize the latent variable model (Liu et al., 2019) by leveraging adversarial training to disentangle the latent variables. Experiments on the cross-lingual spoken language understanding task show that our model outperforms current state-of-the-art methods in both few-shot and zero-shot scenarios, and our model, trained on a few-shot setting with only 3\% of the target language training data, achieves comparable performance to the supervised training with all the training data.

View on arXiv PDF Code

Similar