CL AI LGApr 9, 2021

TransWiC at SemEval-2021 Task 2: Transformer-based Multilingual and Cross-lingual Word-in-Context Disambiguation

Hansi Hettiarachchi, Tharindu Ranasinghe

arXiv:2104.04632v131.5715 citations

Originality Synthesis-oriented

AI Analysis

This addresses the challenge of generalizing word sense disambiguation across languages for applications like question answering, though it is incremental as it builds on existing transformer methods.

The paper tackled the problem of word-in-context disambiguation across languages by using pretrained transformer models without language-specific resources, achieving 0.90 accuracy for English-English, close to the best result of 0.93.

Identifying whether a word carries the same meaning or different meaning in two contexts is an important research area in natural language processing which plays a significant role in many applications such as question answering, document summarisation, information retrieval and information extraction. Most of the previous work in this area rely on language-specific resources making it difficult to generalise across languages. Considering this limitation, our approach to SemEval-2021 Task 2 is based only on pretrained transformer models and does not use any language-specific processing and resources. Despite that, our best model achieves 0.90 accuracy for English-English subtask which is very compatible compared to the best result of the subtask; 0.93 accuracy. Our approach also achieves satisfactory results in other monolingual and cross-lingual language pairs as well.

View on arXiv PDF

Similar