CLAIMay 6

Generating Query-Focused Summarization Datasets from Query-Free Summarization Datasets

arXiv:2605.0539214.6h-index: 4
Predicted impact top 67% in CL · last 90 daysOriginality Synthesis-oriented
AI Analysis

This work addresses the problem of lacking query-annotated datasets for QFS by enabling the use of query-free datasets, though the improvement is incremental.

The paper proposes an evidence-based model to generate query keywords from query-free summarization datasets for Query-Focused Summarization (QFS). Experiments show that summaries generated using these queries achieve competitive ROUGE scores compared to those using original queries.

Large-scale datasets are widely used to perform summarization tasks, but they may not include queries alongside documents and summaries. In the search for suitable datasets for Query-Focused Summarization (QFS), we identify two research questions: Is it possible to automatically generate evidence-based query keywords from query-free datasets? Does evidence-based query generation support the QFS task? This paper proposes an evidence-based model to generate queries from query-free datasets. To evaluate our model intrinsically, we compare the similarity between the original queries and the system-generated queries of two QFS datasets. We also perform summarization tasks using different pre-trained models, as well as a state-of-the-art (SOTA) QFS model, to measure the extrinsic performance of our query generation approach. Experimental results indicate that summaries generated using evidence-based queries achieve competitive ROUGE scores compared to those generated from the original queries.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes