CLLGMLSep 18, 2019

Language models and Automated Essay Scoring

arXiv:1909.09482v1112 citations
AI Analysis

This is an incremental study applying existing NLP models to essay scoring, potentially benefiting educational assessment.

The paper tackled automatic essay scoring by comparing BERT and XLNet language models on a Kaggle dataset, achieving above human-level accuracy.

In this paper, we present a new comparative study on automatic essay scoring (AES). The current state-of-the-art natural language processing (NLP) neural network architectures are used in this work to achieve above human-level accuracy on the publicly available Kaggle AES dataset. We compare two powerful language models, BERT and XLNet, and describe all the layers and network architectures in these models. We elucidate the network architectures of BERT and XLNet using clear notation and diagrams and explain the advantages of transformer architectures over traditional recurrent neural network architectures. Linear algebra notation is used to clarify the functions of transformers and attention mechanisms. We compare the results with more traditional methods, such as bag of words (BOW) and long short term memory (LSTM) networks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes