CLLGNov 13, 2019

Unsupervised Domain Adaptation on Reading Comprehension

arXiv:1911.06137v542 citations
Originality Incremental advance
AI Analysis

This addresses the domain adaptation challenge for reading comprehension systems, which is incremental as it builds on existing methods like BERT and adversarial learning.

The paper tackles the problem of poor generalization in reading comprehension models across domains by proposing an unsupervised domain adaptation method, achieving comparable accuracy to supervised models on multiple large-scale benchmarks.

Reading comprehension (RC) has been studied in a variety of datasets with the boosted performance brought by deep neural networks. However, the generalization capability of these models across different domains remains unclear. To alleviate this issue, we are going to investigate unsupervised domain adaptation on RC, wherein a model is trained on labeled source domain and to be applied to the target domain with only unlabeled samples. We first show that even with the powerful BERT contextual representation, the performance is still unsatisfactory when the model trained on one dataset is directly applied to another target dataset. To solve this, we provide a novel conditional adversarial self-training method (CASe). Specifically, our approach leverages a BERT model fine-tuned on the source dataset along with the confidence filtering to generate reliable pseudo-labeled samples in the target domain for self-training. On the other hand, it further reduces domain distribution discrepancy through conditional adversarial learning across domains. Extensive experiments show our approach achieves comparable accuracy to supervised models on multiple large-scale benchmark datasets.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes