CVSep 27, 2024

AL-GTD: Deep Active Learning for Gaze Target Detection

arXiv:2409.18561v11 citationsh-index: 13Has Code
Originality Incremental advance
AI Analysis

This addresses the high labeling cost problem for computer vision researchers working on gaze analysis, though it appears incremental as it builds on existing active learning methods.

The paper tackles the problem of reducing labeled data requirements for gaze target detection by proposing AL-GTD, an active learning approach that integrates supervised and self-supervised losses with pseudo-labeling. It achieves state-of-the-art AUC results using only 40-50% of training data and reaches satisfactory performance with just 10-20% of data.

Gaze target detection aims at determining the image location where a person is looking. While existing studies have made significant progress in this area by regressing accurate gaze heatmaps, these achievements have largely relied on access to extensive labeled datasets, which demands substantial human labor. In this paper, our goal is to reduce the reliance on the size of labeled training data for gaze target detection. To achieve this, we propose AL-GTD, an innovative approach that integrates supervised and self-supervised losses within a novel sample acquisition function to perform active learning (AL). Additionally, it utilizes pseudo-labeling to mitigate distribution shifts during the training phase. AL-GTD achieves the best of all AUC results by utilizing only 40-50% of the training data, in contrast to state-of-the-art (SOTA) gaze target detectors requiring the entire training dataset to achieve the same performance. Importantly, AL-GTD quickly reaches satisfactory performance with 10-20% of the training data, showing the effectiveness of our acquisition function, which is able to acquire the most informative samples. We provide a comprehensive experimental analysis by adapting several AL methods for the task. AL-GTD outperforms AL competitors, simultaneously exhibiting superior performance compared to SOTA gaze target detectors when all are trained within a low-data regime. Code is available at https://github.com/francescotonini/al-gtd.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes