CLMar 8, 2024

To Err Is Human, but Llamas Can Learn It Too

arXiv:2403.05493v225 citationsh-index: 22EMNLP
Originality Incremental advance
AI Analysis

This addresses grammatical error correction for multiple languages, showing incremental improvements over existing methods.

This study tackled grammatical error correction (GEC) by using artificial error generation (AEG) with language models, and the result was outperforming previous state-of-the-art models with gains of 0.8 to 6 F0.5 points across German, Ukrainian, and Estonian.

This study explores enhancing grammatical error correction (GEC) through artificial error generation (AEG) using language models (LMs). Specifically, we fine-tune Llama 2-based LMs for error generation and find that this approach yields synthetic errors akin to human errors. Next, we train GEC Llama models with the help of these artificial errors and outperform previous state-of-the-art error correction models, with gains ranging between 0.8 and 6 F0.5 points across all tested languages (German, Ukrainian, and Estonian). Moreover, we demonstrate that generating errors by fine-tuning smaller sequence-to-sequence models and prompting large commercial LMs (GPT-3.5 and GPT-4) also results in synthetic errors beneficially affecting error generation models.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes