CLJul 9, 2024

Consistent Document-Level Relation Extraction via Counterfactuals

arXiv:2407.06699v224 citationsh-index: 13
Originality Incremental advance
AI Analysis

This addresses inconsistency issues in document-level relation extraction for NLP researchers, though it is incremental as it builds on existing bias mitigation techniques.

The paper tackles the problem of factual biases in document-level relation extraction models by introducing CovEReD, a counterfactual data generation method using entity replacement, and shows that training on this data maintains consistency with minimal performance impact, releasing a pipeline and dataset for evaluation.

Many datasets have been developed to train and evaluate document-level relation extraction (RE) models. Most of these are constructed using real-world data. It has been shown that RE models trained on real-world data suffer from factual biases. To evaluate and address this issue, we present CovEReD, a counterfactual data generation approach for document-level relation extraction datasets using entity replacement. We first demonstrate that models trained on factual data exhibit inconsistent behavior: while they accurately extract triples from factual data, they fail to extract the same triples after counterfactual modification. This inconsistency suggests that models trained on factual data rely on spurious signals such as specific entities and external knowledge $\unicode{x2013}$ rather than on the input context $\unicode{x2013}$ to extract triples. We show that by generating document-level counterfactual data with CovEReD and training models on them, consistency is maintained with minimal impact on RE performance. We release our CovEReD pipeline as well as Re-DocRED-CF, a dataset of counterfactual RE documents, to assist in evaluating and addressing inconsistency in document-level RE.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes