CLSDASMay 5, 2025

Automatic Proficiency Assessment in L2 English Learners

arXiv:2505.02615v12 citationsh-index: 23
Originality Synthesis-oriented
AI Analysis

This addresses the need for automated, consistent evaluation of English proficiency, reducing reliance on subjective human raters, though it appears incremental by applying existing models to this domain.

This paper tackles the problem of automating proficiency assessment for second language English learners by applying deep learning models to speech and text data, achieving robust results with a pretrained wav2vec 2.0 model on datasets including EFCamDat, ANGLISH, and a private dataset.

Second language proficiency (L2) in English is usually perceptually evaluated by English teachers or expert evaluators, with the inherent intra- and inter-rater variability. This paper explores deep learning techniques for comprehensive L2 proficiency assessment, addressing both the speech signal and its correspondent transcription. We analyze spoken proficiency classification prediction using diverse architectures, including 2D CNN, frequency-based CNN, ResNet, and a pretrained wav2vec 2.0 model. Additionally, we examine text-based proficiency assessment by fine-tuning a BERT language model within resource constraints. Finally, we tackle the complex task of spontaneous dialogue assessment, managing long-form audio and speaker interactions through separate applications of wav2vec 2.0 and BERT models. Results from experiments on EFCamDat and ANGLISH datasets and a private dataset highlight the potential of deep learning, especially the pretrained wav2vec 2.0 model, for robust automated L2 proficiency evaluation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes