AS SD SPDec 1, 2021

Predicting lexical skills from oral reading with acoustic measures

Charvi Vitthal, Shreeharsha B S, Kamini Sabu, Preeti Rao

arXiv:2112.00635v11.2

Originality Incremental advance

AI Analysis

This provides a more efficient, language-agnostic tool for education administrators to assess literacy, though it is incremental as it builds on existing acoustic analysis methods.

The authors tackled the problem of automating literacy assessment by predicting children's word-decoding skills from oral reading recordings using simple acoustic features, achieving performance close to language-dependent automatic speech recognition without its computational and training burdens.

Literacy assessment is an important activity for education administrators across the globe. Typically achieved in a school setting by testing a child's oral reading, it is intensive in human resources. While automatic speech recognition (ASR) is a potential solution to the problem, it tends to be computationally expensive for hand-held devices apart from needing language and accent-specific speech for training. In this work, we propose a system to predict the word-decoding skills of a student based on simple acoustic features derived from the recording. We first identify a meaningful categorization of word-decoding skills by analyzing a manually transcribed data set of children's oral reading recordings. Next the automatic prediction of the category is attempted with the proposed acoustic features. Pause statistics, syllable rate and spectral and intensity dynamics are found to be reliable indicators of specific types of oral reading deficits, providing useful feedback by discriminating the different characteristics of beginning readers. This computationally simple and language-agnostic approach is found to provide a performance close to that obtained using a language dependent ASR that required considerable tuning of its parameters.

View on arXiv PDF

Similar