CLApr 17, 2021

Learning from Noisy Labels for Entity-Centric Information Extraction

arXiv:2104.08656v2666 citations
Originality Incremental advance
AI Analysis

This addresses performance degradation in information extraction due to noisy labels, which is an incremental improvement for the field.

The paper tackles the problem of deep neural models overfitting noisy labels in entity-centric information extraction by proposing a co-regularization framework with multiple models regularized for agreement, achieving effectiveness on benchmarks like TACRED and CoNLL03.

Recent information extraction approaches have relied on training deep neural models. However, such models can easily overfit noisy labels and suffer from performance degradation. While it is very costly to filter noisy labels in large learning resources, recent studies show that such labels take more training steps to be memorized and are more frequently forgotten than clean labels, therefore are identifiable in training. Motivated by such properties, we propose a simple co-regularization framework for entity-centric information extraction, which consists of several neural models with identical structures but different parameter initialization. These models are jointly optimized with the task-specific losses and are regularized to generate similar predictions based on an agreement loss, which prevents overfitting on noisy labels. Extensive experiments on two widely used but noisy benchmarks for information extraction, TACRED and CoNLL03, demonstrate the effectiveness of our framework. We release our code to the community for future research.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes