CVSep 19, 2024

Real-time estimation of overt attention from dynamic features of the face using deep-learning

arXiv:2409.13084v22 citationsh-index: 4Has Code
Originality Incremental advance
AI Analysis

This addresses the loss of visual feedback for teachers in remote learning, enabling real-time attention monitoring with potential applications in online education, though it is incremental as it builds on existing methods for attention estimation.

The paper tackles the problem of estimating student attention in remote learning by using front-facing video to predict attention levels based on eye, head, and face movements, achieving R^2 scores of 0.38 on unseen data and 0.26-0.30 on unseen subjects.

Students often drift in and out of focus during class. Effective teachers recognize this and re-engage them when necessary. With the shift to remote learning, teachers have lost the visual feedback needed to adapt to varying student engagement. We propose using readily available front-facing video to infer attention levels based on movements of the eyes, head, and face. We train a deep learning model to predict a measure of attention based on overt eye movements. Specifically, we measure Inter-Subject Correlation of eye movements in ten-second intervals while students watch the same educational videos. In 3 different experiments (N=83) we show that the trained model predicts this objective metric of attention on unseen data with $R^2$=0.38, and on unseen subjects with $R^2$=0.26-0.30. The deep network relies mostly on a student's eye movements, but to some extent also on movements of the brows, cheeks, and head. In contrast to Inter-Subject Correlation of the eyes, the model can estimate attentional engagement from individual students' movements without needing reference data from an attentive group. This enables a much broader set of online applications. The solution is lightweight and can operate on the client side, which mitigates some of the privacy concerns associated with online attention monitoring. GitHub implementation is available at https://github.com/asortubay/timeISC

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes