CLLGJul 2, 2019

A Neural Grammatical Error Correction System Built On Better Pre-training and Sequential Transfer Learning

arXiv:1907.01256v11113 citations
Originality Incremental advance
AI Analysis

This work addresses grammatical error correction for language learners and writers, but it is incremental as it builds on existing pre-training and transfer learning methods.

The paper tackled the low-resource challenge in grammatical error correction by generating synthetic parallel corpora for pre-training Transformer models and applying sequential transfer learning, achieving competitive results in ACL 2019 BEA Shared Task tracks.

Grammatical error correction can be viewed as a low-resource sequence-to-sequence task, because publicly available parallel corpora are limited. To tackle this challenge, we first generate erroneous versions of large unannotated corpora using a realistic noising function. The resulting parallel corpora are subsequently used to pre-train Transformer models. Then, by sequentially applying transfer learning, we adapt these models to the domain and style of the test set. Combined with a context-aware neural spellchecker, our system achieves competitive results in both restricted and low resource tracks in ACL 2019 BEA Shared Task. We release all of our code and materials for reproducibility.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes