CLJun 21, 2021

Empower Distantly Supervised Relation Extraction with Collaborative Adversarial Training

arXiv:2106.10835v124 citations
Originality Incremental advance
AI Analysis

This addresses a key bottleneck for researchers in relation extraction, though it is incremental as it builds on existing multi-instance learning methods.

The paper tackles the low data utilization problem in distantly supervised relation extraction with multi-instance learning by proposing collaborative adversarial training, which improves the state-of-the-art by about 5 absolute AUC score.

With recent advances in distantly supervised (DS) relation extraction (RE), considerable attention is attracted to leverage multi-instance learning (MIL) to distill high-quality supervision from the noisy DS. Here, we go beyond label noise and identify the key bottleneck of DS-MIL to be its low data utilization: as high-quality supervision being refined by MIL, MIL abandons a large amount of training instances, which leads to a low data utilization and hinders model training from having abundant supervision. In this paper, we propose collaborative adversarial training to improve the data utilization, which coordinates virtual adversarial training (VAT) and adversarial training (AT) at different levels. Specifically, since VAT is label-free, we employ the instance-level VAT to recycle instances abandoned by MIL. Besides, we deploy AT at the bag-level to unleash the full potential of the high-quality supervision got by MIL. Our proposed method brings consistent improvements (~ 5 absolute AUC score) to the previous state of the art, which verifies the importance of the data utilization issue and the effectiveness of our method.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes