CLLGJan 31, 2021

Mixup Regularized Adversarial Networks for Multi-Domain Text Classification

arXiv:2102.00467v112 citations
Originality Incremental advance
AI Analysis

This work addresses multi-domain text classification, an incremental improvement for natural language processing applications.

The paper tackled the problem of insufficient domain-invariant features and mismatching in multi-domain text classification by proposing a mixup regularized adversarial network, achieving average accuracies of 87.64% and 89.0% on two benchmarks.

Using the shared-private paradigm and adversarial training has significantly improved the performances of multi-domain text classification (MDTC) models. However, there are two issues for the existing methods. First, instances from the multiple domains are not sufficient for domain-invariant feature extraction. Second, aligning on the marginal distributions may lead to fatal mismatching. In this paper, we propose a mixup regularized adversarial network (MRAN) to address these two issues. More specifically, the domain and category mixup regularizations are introduced to enrich the intrinsic features in the shared latent space and enforce consistent predictions in-between training instances such that the learned features can be more domain-invariant and discriminative. We conduct experiments on two benchmarks: The Amazon review dataset and the FDU-MTL dataset. Our approach on these two datasets yields average accuracies of 87.64\% and 89.0\% respectively, outperforming all relevant baselines.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes