CLAIMay 18, 2023

Uncertainty Guided Label Denoising for Document-level Distant Relation Extraction

arXiv:2305.11029v2225 citations
Originality Incremental advance
AI Analysis

This work addresses a specific challenge in document-level relation extraction for researchers, but it is incremental as it builds on existing pseudo-label denoising methods.

The paper tackles the problem of noise in pseudo labels for document-level distant relation extraction by introducing uncertainty estimation to guide label denoising, resulting in performance improvements of 1.91 F1 and 2.28 Ign F1 on the RE-DocRED dataset.

Document-level relation extraction (DocRE) aims to infer complex semantic relations among entities in a document. Distant supervision (DS) is able to generate massive auto-labeled data, which can improve DocRE performance. Recent works leverage pseudo labels generated by the pre-denoising model to reduce noise in DS data. However, unreliable pseudo labels bring new noise, e.g., adding false pseudo labels and losing correct DS labels. Therefore, how to select effective pseudo labels to denoise DS data is still a challenge in document-level distant relation extraction. To tackle this issue, we introduce uncertainty estimation technology to determine whether pseudo labels can be trusted. In this work, we propose a Document-level distant Relation Extraction framework with Uncertainty Guided label denoising, UGDRE. Specifically, we propose a novel instance-level uncertainty estimation method, which measures the reliability of the pseudo labels with overlapping relations. By further considering the long-tail problem, we design dynamic uncertainty thresholds for different types of relations to filter high-uncertainty pseudo labels. We conduct experiments on two public datasets. Our framework outperforms strong baselines by 1.91 F1 and 2.28 Ign F1 on the RE-DocRED dataset.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes