CLMar 18, 2024

A Closer Look at Claim Decomposition

Miriam Wanner, Seth Ebner, Zhengping Jiang, Mark Dredze, Benjamin Van Durme

arXiv:2403.11903v121.244 citationsh-index: 21STARSEM

Originality Incremental advance

AI Analysis

This work addresses the problem of accurately evaluating text support for generated content, which is incremental by refining decomposition methods in existing metrics.

The paper investigates how claim decomposition methods, particularly LLM-based ones, affect evaluation metrics like FActScore, finding sensitivity due to attribution errors, and proposes DecompScore and a novel LLM-based approach inspired by logical atomism for improved decomposition quality.

As generated text becomes more commonplace, it is increasingly important to evaluate how well-supported such text is by external knowledge sources. Many approaches for evaluating textual support rely on some method for decomposing text into its individual subclaims which are scored against a trusted reference. We investigate how various methods of claim decomposition -- especially LLM-based methods -- affect the result of an evaluation approach such as the recently proposed FActScore, finding that it is sensitive to the decomposition method used. This sensitivity arises because such metrics attribute overall textual support to the model that generated the text even though error can also come from the metric's decomposition step. To measure decomposition quality, we introduce an adaptation of FActScore, which we call DecompScore. We then propose an LLM-based approach to generating decompositions inspired by Bertrand Russell's theory of logical atomism and neo-Davidsonian semantics and demonstrate its improved decomposition quality over previous methods.

View on arXiv PDF

Similar