CLAILGMay 1, 2024

RST-LoRA: A Discourse-Aware Low-Rank Adaptation for Long Document Abstractive Summarization

arXiv:2405.00657v233 citationsh-index: 6NAACL
Originality Incremental advance
AI Analysis

This work addresses the challenge of efficiently summarizing long documents for NLP applications, though it is incremental as it builds on existing LoRA methods.

The paper tackled the problem of long document abstractive summarization by integrating rhetorical structure theory (RST) into a low-rank adaptation (LoRA) model, resulting in a variant that outperformed vanilla LoRA, full-parameter fine-tuning, and previous state-of-the-art methods in evaluations.

For long document summarization, discourse structure is important to discern the key content of the text and the differences in importance level between sentences. Unfortunately, the integration of rhetorical structure theory (RST) into parameter-efficient fine-tuning strategies for long document summarization remains unexplored. Therefore, this paper introduces RST-LoRA and proposes four RST-aware variants to explicitly incorporate RST into the LoRA model. Our empirical evaluation demonstrates that incorporating the type and uncertainty of rhetorical relations can complementarily enhance the performance of LoRA in summarization tasks. Furthermore, the best-performing variant we introduced outperforms the vanilla LoRA and full-parameter fine-tuning models, as confirmed by multiple automatic and human evaluations, and even surpasses previous state-of-the-art methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes