CLApr 27

Zero-shot Large Language Models for Automatic Readability Assessment

arXiv:2604.2447086.8Has Code

AI Analysis

This work provides a practical zero-shot solution for readability assessment, benefiting educators and content creators needing to match text difficulty to audience.

The authors propose a zero-shot prompting methodology for automatic readability assessment using LLMs, outperforming prior methods on 13 of 14 datasets. Their combined approach LAURAE robustly beats prior methods across languages and text lengths.

Unsupervised automatic readability assessment (ARA) methods have important practical and research applications (e.g., ensuring medical or educational materials are suitable for their target audiences). In this paper, we propose a new zero-shot prompting methodology for ARA and present the first comprehensive evaluation of using large language models (LLMs) as an unsupervised ARA method by testing 10 diverse open-source LLMs (e.g., different sizes and developers) on 14 diverse datasets (e.g., different text lengths and languages). Our findings show that our proposed prompting methodology outperforms prior methods on 13 of the 14 datasets. Furthermore, we propose LAURAE, which combines LLM and readability formula scores to improve robustness by capturing both contextual and shallow (e.g., sentence length) features of readability. Our evaluation demonstrates that LAURAE robustly outperforms prior methods across languages, text lengths, and amounts of technical language.

View on arXiv PDF Code

Similar