Language Resources in Spanish for Automatic Text Simplification across Domains
This work addresses the need for accessible text simplification tools in Spanish across specific domains, but it is incremental as it builds on existing resources and methods.
The authors tackled the problem of automatic text simplification for Spanish by developing language resources and models across finance, medicine, and history domains, resulting in publicly available corpora, guidelines, lexicons, datasets, and two simplification tools.
This work describes the language resources and models developed for automatic simplification of Spanish texts in three domains: Finance, Medicine and History studies. We created several corpora in each domain, annotation and simplification guidelines, a lexicon of technical and simplified medical terms, datasets used in shared tasks for the financial domain, and two simplification tools. The methodology, resources and companion publications are shared publicly on the web-site: https://clara-nlp.uned.es/.