CL AINov 23, 2024

Enhancing Grammatical Error Detection using BERT with Cleaned Lang-8 Dataset

arXiv:2411.15523v13 citationsh-index: 1Has Code

Originality Synthesis-oriented

AI Analysis

This addresses the problem of grammatical error detection for applications like language learning and writing assistance, but it is incremental as it builds on existing transformer methods with data improvements.

The paper tackled grammatical error detection by fine-tuning BERT on a cleaned Lang-8 dataset, achieving an F1 score of 0.91 and accuracy up to 98.49% on training data, demonstrating the impact of data cleaning.

This paper presents an improved LLM based model for Grammatical Error Detection (GED), which is a very challenging and equally important problem for many applications. The traditional approach to GED involved hand-designed features, but recently, Neural Networks (NN) have automated the discovery of these features, improving performance in GED. Traditional rule-based systems have an F1 score of 0.50-0.60 and earlier machine learning models give an F1 score of 0.65-0.75, including decision trees and simple neural networks. Previous deep learning models, for example, Bi-LSTM, have reported F1 scores within the range from 0.80 to 0.90. In our study, we have fine-tuned various transformer models using the Lang8 dataset rigorously cleaned by us. In our experiments, the BERT-base-uncased model gave an impressive performance with an F1 score of 0.91 and accuracy of 98.49% on training data and 90.53% on testing data, also showcasing the importance of data cleaning. Increasing model size using BERT-large-uncased or RoBERTa-large did not give any noticeable improvements in performance or advantage for this task, underscoring that larger models are not always better. Our results clearly show how far rigorous data cleaning and simple transformer-based models can go toward significantly improving the quality of GED.

View on arXiv PDF Code

Similar