Towards Lithuanian grammatical error correction
This addresses grammatical errors for Lithuanian speakers/writers, but is incremental as it applies existing methods to a new language.
The authors tackled grammatical error correction for Lithuanian by constructing a transformer-based model, achieving F0.5=0.92 and releasing the model and code.
Everyone wants to write beautiful and correct text, yet the lack of language skills, experience, or hasty typing can result in errors. By employing the recent advances in transformer architectures, we construct a grammatical error correction model for Lithuanian, the language rich in archaic features. We compare subword and byte-level approaches and share our best trained model, achieving F$_{0.5}$=0.92, and accompanying code, in an online open-source repository.