CLOct 15, 2021

Span Detection for Aspect-Based Sentiment Analysis in Vietnamese

arXiv:2110.07833v1583 citationsHas Code
Originality Synthesis-oriented
AI Analysis

It addresses a neglected sub-task in Vietnamese NLP with practical applications, but the method is incremental.

The paper tackles the problem of detecting user opinion spans in Vietnamese aspect-based sentiment analysis by introducing a new dataset (UIT-ViSD4SA) with 35,396 annotated spans and proposing a BiLSTM-CRF system that achieves a 62.76% F1 score.

Aspect-based sentiment analysis plays an essential role in natural language processing and artificial intelligence. Recently, researchers only focused on aspect detection and sentiment classification but ignoring the sub-task of detecting user opinion span, which has enormous potential in practical applications. In this paper, we present a new Vietnamese dataset (UIT-ViSD4SA) consisting of 35,396 human-annotated spans on 11,122 feedback comments for evaluating the span detection in aspect-based sentiment analysis. Besides, we also propose a novel system using Bidirectional Long Short-Term Memory (BiLSTM) with a Conditional Random Field (CRF) layer (BiLSTM-CRF) for the span detection task in Vietnamese aspect-based sentiment analysis. The best result is a 62.76% F1 score (macro) for span detection using BiLSTM-CRF with embedding fusion of syllable embedding, character embedding, and contextual embedding from XLM-RoBERTa. In future work, span detection will be extended in many NLP tasks such as constructive detection, emotion recognition, complaint analysis, and opinion mining. Our dataset is freely available at https://github.com/kimkim00/UIT-ViSD4SA for research purposes.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes