AI CLJul 15, 2024

IDEAL: Leveraging Infinite and Dynamic Characterizations of Large Language Models for Query-focused Summarization

Jie Cao, Dian Jiao, Qiang Yan, Wenqiao Zhang, Siliang Tang, Yueting Zhuang

arXiv:2407.10486v22 citationsh-index: 27Has Code

Originality Incremental advance

AI Analysis

This work addresses query-focused summarization for users needing personalized summaries, but it appears incremental as it builds on existing LLM capabilities.

The paper tackles query-focused summarization by proposing two modules (Query-aware HyperExpert and Query-focused Infini-attention) to handle lengthy documents and align queries with LLMs, achieving effectiveness and generalizability on existing benchmarks.

Query-focused summarization (QFS) aims to produce summaries that answer particular questions of interest, enabling greater user control and personalization. With the advent of large language models (LLMs), shows their impressive capability of textual understanding through large-scale pretraining, which implies the great potential of extractive snippet generation. In this paper, we systematically investigated two indispensable characteristics that the LLMs-based QFS models should be harnessed, Lengthy Document Summarization and Efficiently Fine-grained Query-LLM Alignment, respectively. Correspondingly, we propose two modules called Query-aware HyperExpert and Query-focused Infini-attention to access the aforementioned characteristics. These innovations pave the way for broader application and accessibility in the field of QFS technology. Extensive experiments conducted on existing QFS benchmarks indicate the effectiveness and generalizability of the proposed approach. Our code is publicly available at https://github.com/DCDmllm/IDEAL_Summary.

View on arXiv PDF Code

Similar