CL AIMar 31, 2016

Neural Language Correction with Character-Based Attention

Ziang Xie, Anand Avati, Naveen Arivazhagan, Dan Jurafsky, Andrew Y. Ng

arXiv:1603.09727v118.4151 citations

Originality Incremental advance

AI Analysis

This work addresses the challenge of flexible language correction for English learners, though it is incremental as it builds on existing neural models.

The paper tackles the problem of natural language correction for language learners by introducing a neural network-based approach that operates at the character level to handle errors like redundancy and orthographic issues, achieving a state-of-the-art F0.5-score on the CoNLL 2014 Shared Task.

Natural language correction has the potential to help language learners improve their writing skills. While approaches with separate classifiers for different error types have high precision, they do not flexibly handle errors such as redundancy or non-idiomatic phrasing. On the other hand, word and phrase-based machine translation methods are not designed to cope with orthographic errors, and have recently been outpaced by neural models. Motivated by these issues, we present a neural network-based approach to language correction. The core component of our method is an encoder-decoder recurrent neural network with an attention mechanism. By operating at the character level, the network avoids the problem of out-of-vocabulary words. We illustrate the flexibility of our approach on dataset of noisy, user-generated text collected from an English learner forum. When combined with a language model, our method achieves a state-of-the-art $F_{0.5}$-score on the CoNLL 2014 Shared Task. We further demonstrate that training the network on additional data with synthesized errors can improve performance.

View on arXiv PDF

Similar