LGCLMLNov 26, 2016

Structural Correspondence Learning for Cross-lingual Sentiment Classification with One-to-many Mappings

arXiv:1611.08737v1
Originality Incremental advance
AI Analysis

This addresses cross-lingual sentiment analysis for languages like English and Chinese, offering an incremental improvement over existing structural correspondence learning methods.

The paper tackled the problem of cross-lingual sentiment classification by proposing a method that learns one-to-many mappings for pivot words using distributed representations, achieving more competitive results than state-of-the-art methods on the NLP&CC 2013 dataset.

Structural correspondence learning (SCL) is an effective method for cross-lingual sentiment classification. This approach uses unlabeled documents along with a word translation oracle to automatically induce task specific, cross-lingual correspondences. It transfers knowledge through identifying important features, i.e., pivot features. For simplicity, however, it assumes that the word translation oracle maps each pivot feature in source language to exactly only one word in target language. This one-to-one mapping between words in different languages is too strict. Also the context is not considered at all. In this paper, we propose a cross-lingual SCL based on distributed representation of words; it can learn meaningful one-to-many mappings for pivot words using large amounts of monolingual data and a small dictionary. We conduct experiments on NLP\&CC 2013 cross-lingual sentiment analysis dataset, employing English as source language, and Chinese as target language. Our method does not rely on the parallel corpora and the experimental results show that our approach is more competitive than the state-of-the-art methods in cross-lingual sentiment classification.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes