CLSep 21, 2025

Prompt-Based Simplification for Plain Language using Spanish Language Models

arXiv:2509.17209v11 citationsh-index: 9
Originality Synthesis-oriented
AI Analysis

This work addresses the challenge of making text more accessible in Spanish, though it is incremental as it applies existing methods to a specific domain and dataset.

The paper tackled the problem of adapting text to plain language in Spanish using prompt-based simplification with language models, achieving first place in semantic similarity (SIM=0.75) but fourth in readability (FH=69.72).

This paper describes the participation of HULAT-UC3M in CLEARS 2025 Subtask 1: Adaptation of Text to Plain Language (PL) in Spanish. We explored strategies based on models trained on Spanish texts, including a zero-shot configuration using prompt engineering and a fine-tuned version with Low-Rank Adaptation (LoRA). Different strategies were evaluated on representative internal subsets of the training data, using the official task metrics, cosine similarity (SIM) and the Fernández-Huerta readability index (FH) to guide the selection of the optimal model and prompt combination. The final system was selected for its balanced and consistent performance, combining normalization steps, the RigoChat-7B-v2 model, and a dedicated PL-oriented prompt. It ranked first in semantic similarity (SIM = 0.75), however, fourth in readability (FH = 69.72). We also discuss key challenges related to training data heterogeneity and the limitations of current evaluation metrics in capturing both linguistic clarity and content preservation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes