CLJan 7, 2025

Progressive Document-level Text Simplification via Large Language Models

Dengzhao Fang, Jipeng Qiang, Yi Zhu, Yunhao Yuan, Wei Li, Yan Liu

arXiv:2501.03857v19.613 citationsh-index: 15New generation computing

Originality Incremental advance

AI Analysis

This addresses the challenge of simplifying long documents for improved readability, though it is incremental as it builds on existing LLM capabilities.

The paper tackled the problem of document-level text simplification, which is under-explored compared to lexical and sentence-level approaches, by proposing a progressive method (ProgDS) that hierarchically decomposes the task using large language models, and it significantly outperformed existing methods, advancing the state-of-the-art.

Research on text simplification has primarily focused on lexical and sentence-level changes. Long document-level simplification (DS) is still relatively unexplored. Large Language Models (LLMs), like ChatGPT, have excelled in many natural language processing tasks. However, their performance on DS tasks is unsatisfactory, as they often treat DS as merely document summarization. For the DS task, the generated long sequences not only must maintain consistency with the original document throughout, but complete moderate simplification operations encompassing discourses, sentences, and word-level simplifications. Human editors employ a hierarchical complexity simplification strategy to simplify documents. This study delves into simulating this strategy through the utilization of a multi-stage collaboration using LLMs. We propose a progressive simplification method (ProgDS) by hierarchically decomposing the task, including the discourse-level, topic-level, and lexical-level simplification. Experimental results demonstrate that ProgDS significantly outperforms existing smaller models or direct prompting with LLMs, advancing the state-of-the-art in the document simplification task.

View on arXiv PDF

Similar