LGCLIRMLSep 1, 2019

Transfer Learning Between Related Tasks Using Expected Label Proportions

arXiv:1909.00430v1997 citations
Originality Incremental advance
AI Analysis

This work addresses data scarcity in deep learning for NLP tasks, offering an incremental method to leverage related tasks for improved performance.

The paper tackles the problem of limited labeled data for deep learning by applying expectation regularization (XR) to transfer learning between related tasks, using a model trained on task A to estimate label proportions for task B and training a model for B with an XR loss, which improves upon fully supervised systems and combines with LM-based pretraining in aspect-based sentiment classification.

Deep learning systems thrive on abundance of labeled training data but such data is not always available, calling for alternative methods of supervision. One such method is expectation regularization (XR) (Mann and McCallum, 2007), where models are trained based on expected label proportions. We propose a novel application of the XR framework for transfer learning between related tasks, where knowing the labels of task A provides an estimation of the label proportion of task B. We then use a model trained for A to label a large corpus, and use this corpus with an XR loss to train a model for task B. To make the XR framework applicable to large-scale deep-learning setups, we propose a stochastic batched approximation procedure. We demonstrate the approach on the task of Aspect-based Sentiment classification, where we effectively use a sentence-level sentiment predictor to train accurate aspect-based predictor. The method improves upon fully supervised neural system trained on aspect-level data, and is also cumulative with LM-based pretraining, as we demonstrate by improving a BERT-based Aspect-based Sentiment model.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes