CLSep 30, 2024

Language Resources in Spanish for Automatic Text Simplification across Domains

arXiv:2409.20466v11 citationsh-index: 11
Originality Synthesis-oriented
AI Analysis

This work addresses the need for accessible text simplification tools in Spanish across specific domains, but it is incremental as it builds on existing resources and methods.

The authors tackled the problem of automatic text simplification for Spanish by developing language resources and models across finance, medicine, and history domains, resulting in publicly available corpora, guidelines, lexicons, datasets, and two simplification tools.

This work describes the language resources and models developed for automatic simplification of Spanish texts in three domains: Finance, Medicine and History studies. We created several corpora in each domain, annotation and simplification guidelines, a lexicon of technical and simplified medical terms, datasets used in shared tasks for the financial domain, and two simplification tools. The methodology, resources and companion publications are shared publicly on the web-site: https://clara-nlp.uned.es/.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes