CL LG MLSep 18, 2019

Language models and Automated Essay Scoring

Pedro Uria Rodriguez, Amir Jafari, Christopher M. Ormerod

arXiv:1909.09482v13.3112 citations

Originality Synthesis-oriented

AI Analysis

This is an incremental study applying existing NLP models to essay scoring, potentially benefiting educational assessment.

The paper tackled automatic essay scoring by comparing BERT and XLNet language models on a Kaggle dataset, achieving above human-level accuracy.

In this paper, we present a new comparative study on automatic essay scoring (AES). The current state-of-the-art natural language processing (NLP) neural network architectures are used in this work to achieve above human-level accuracy on the publicly available Kaggle AES dataset. We compare two powerful language models, BERT and XLNet, and describe all the layers and network architectures in these models. We elucidate the network architectures of BERT and XLNet using clear notation and diagrams and explain the advantages of transformer architectures over traditional recurrent neural network architectures. Linear algebra notation is used to clarify the functions of transformers and attention mechanisms. We compare the results with more traditional methods, such as bag of words (BOW) and long short term memory (LSTM) networks.

View on arXiv PDF

Similar