HCCVLGFeb 5, 2022

VIS-iTrack: Visual Intention through Gaze Tracking using Low-Cost Webcam

arXiv:2202.02587v19 citations
AI Analysis

This work addresses the need for intention-aware interactive interfaces to facilitate human cognition, but it is incremental as it applies existing methods to a new dataset.

The paper tackled the problem of identifying visual intention (text vs. image) from real-time eye gaze data using a low-cost webcam, achieving 92.19% accuracy with an SVM classifier on a dataset of 124 samples from 31 participants.

Human intention is an internal, mental characterization for acquiring desired information. From interactive interfaces containing either textual or graphical information, intention to perceive desired information is subjective and strongly connected with eye gaze. In this work, we determine such intention by analyzing real-time eye gaze data with a low-cost regular webcam. We extracted unique features (e.g., Fixation Count, Eye Movement Ratio) from the eye gaze data of 31 participants to generate a dataset containing 124 samples of visual intention for perceiving textual or graphical information, labeled as either TEXT or IMAGE, having 48.39% and 51.61% distribution, respectively. Using this dataset, we analyzed 5 classifiers, including Support Vector Machine (SVM) (Accuracy: 92.19%). Using the trained SVM, we investigated the variation of visual intention among 30 participants, distributed in 3 age groups, and found out that young users were more leaned towards graphical contents whereas older adults felt more interested in textual ones. This finding suggests that real-time eye gaze data can be a potential source of identifying visual intention, analyzing which intention aware interactive interfaces can be designed and developed to facilitate human cognition.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes