CLMay 9, 2020

LinCE: A Centralized Benchmark for Linguistic Code-switching Evaluation

arXiv:2005.04322v11024 citations
AI Analysis

This addresses the problem of fragmented research and limited generalizability in code-switching evaluation for the NLP community, though it is incremental as it builds on existing corpora and models.

The paper tackles the lack of a centralized benchmark for linguistic code-switching in NLP by proposing LinCE, which combines ten corpora across four language pairs and four tasks, resulting in an online platform for real-time result comparison and baseline scores from models like LSTM, ELMo, and multilingual BERT.

Recent trends in NLP research have raised an interest in linguistic code-switching (CS); modern approaches have been proposed to solve a wide range of NLP tasks on multiple language pairs. Unfortunately, these proposed methods are hardly generalizable to different code-switched languages. In addition, it is unclear whether a model architecture is applicable for a different task while still being compatible with the code-switching setting. This is mainly because of the lack of a centralized benchmark and the sparse corpora that researchers employ based on their specific needs and interests. To facilitate research in this direction, we propose a centralized benchmark for Linguistic Code-switching Evaluation (LinCE) that combines ten corpora covering four different code-switched language pairs (i.e., Spanish-English, Nepali-English, Hindi-English, and Modern Standard Arabic-Egyptian Arabic) and four tasks (i.e., language identification, named entity recognition, part-of-speech tagging, and sentiment analysis). As part of the benchmark centralization effort, we provide an online platform at ritual.uh.edu/lince, where researchers can submit their results while comparing with others in real-time. In addition, we provide the scores of different popular models, including LSTM, ELMo, and multilingual BERT so that the NLP community can compare against state-of-the-art systems. LinCE is a continuous effort, and we will expand it with more low-resource languages and tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes