CLOct 3, 2022

Probing of Quantitative Values in Abstractive Summarization Models

arXiv:2210.00667v1h-index: 4

Originality Incremental advance

AI Analysis

This addresses the issue of data hallucination for users relying on accurate quantitative summaries, but it is incremental as it focuses on evaluation rather than solving the problem.

The paper tackled the problem of quantitative data hallucination in abstractive summarization models by proposing probing tests to evaluate how well these models represent numerical values from input text, finding that most state-of-the-art model encoders struggle to adequately embed quantitative data, with some even underperforming random representations.

Abstractive text summarization has recently become a popular approach, but data hallucination remains a serious problem, including with quantitative data. We propose a set of probing tests to evaluate the efficacy of abstract summarization models' modeling of quantitative values found in the input text. Our results show that in most cases, the encoders of recent SOTA-performing models struggle to provide embeddings that adequately represent quantitative values in the input compared to baselines, and in particular, they outperform random representations in some, but surprisingly not all, cases. Under our assumptions, this suggests that the encoder's performance contributes to the quantity hallucination problem. One model type in particular, DistilBART-CDM, was observed to underperform randomly initialized representations for several experiments, and performance versus BERT suggests that standard pretraining and fine-tuning approaches for the summarization task may play a role in underperformance for some encoders.

View on arXiv PDF

Similar