Exploring the Feasibility of Multilingual Grammatical Error Correction with a Single LLM up to 9B parameters: A Comparative Study of 17 Models
This work addresses the problem of efficient multilingual grammatical error correction for users needing text correction in multiple languages, but it is incremental as it compares existing models without introducing new methods.
The study evaluated 17 language models for multilingual grammatical error correction across English, German, Italian, and Swedish, identifying six models that improved correctness in all languages and found Gemma 9B to be the best performer.
Recent language models can successfully solve various language-related tasks, and many understand inputs stated in different languages. In this paper, we explore the performance of 17 popular models used to correct grammatical issues in texts stated in English, German, Italian, and Swedish when using a single model to correct texts in all those languages. We analyze the outputs generated by these models, focusing on decreasing the number of grammatical errors while keeping the changes small. The conclusions drawn help us understand what problems occur among those models and which models can be recommended for multilingual grammatical error correction tasks. We list six models that improve grammatical correctness in all four languages and show that Gemma 9B is currently the best performing one for the languages considered.