AI CL IRSep 3, 2018

DeFactoNLP: Fact Verification using Entity Recognition, TFIDF Vector Comparison and Decomposable Attention

Aniketh Janardhan Reddy, Gil Rocha, Diego Esteves

arXiv:1809.00509v157.81094 citationsHas Code

Originality Synthesis-oriented

AI Analysis

This addresses the challenge of misinformation detection for NLP applications, but it is incremental as it builds on existing methods for a specific shared task.

The paper tackles the problem of automated fact verification by developing DeFactoNLP, a system that assesses claim veracity and retrieves supporting evidence from Wikipedia, achieving a 0.3833 FEVER score, 0.5136 label accuracy, and 0.4277 evidence F1-score.

In this paper, we describe DeFactoNLP, the system we designed for the FEVER 2018 Shared Task. The aim of this task was to conceive a system that can not only automatically assess the veracity of a claim but also retrieve evidence supporting this assessment from Wikipedia. In our approach, the Wikipedia documents whose Term Frequency-Inverse Document Frequency (TFIDF) vectors are most similar to the vector of the claim and those documents whose names are similar to those of the named entities (NEs) mentioned in the claim are identified as the documents which might contain evidence. The sentences in these documents are then supplied to a textual entailment recognition module. This module calculates the probability of each sentence supporting the claim, contradicting the claim or not providing any relevant information to assess the veracity of the claim. Various features computed using these probabilities are finally used by a Random Forest classifier to determine the overall truthfulness of the claim. The sentences which support this classification are returned as evidence. Our approach achieved a 0.4277 evidence F1-score, a 0.5136 label accuracy and a 0.3833 FEVER score.

View on arXiv PDF Code

Similar