CVApr 12, 2024

SCOUT+: Towards Practical Task-Driven Drivers' Gaze Prediction

U of Toronto
arXiv:2404.08756v110 citationsh-index: 19Has Code2024 IEEE Intelligent Vehicles Symposium (IV)
Originality Incremental advance
AI Analysis

This work addresses a practical problem for driver monitoring systems by enabling task-aware gaze prediction without needing extensive annotations, though it is incremental as it builds on prior methods.

The paper tackles the challenge of predicting drivers' gaze in safety-critical scenarios by introducing SCOUT+, a model that uses GPS-derived route and map information to improve prediction accuracy, achieving performance comparable to a top-down model that requires privileged ground truth data.

Accurate prediction of drivers' gaze is an important component of vision-based driver monitoring and assistive systems. Of particular interest are safety-critical episodes, such as performing maneuvers or crossing intersections. In such scenarios, drivers' gaze distribution changes significantly and becomes difficult to predict, especially if the task and context information is represented implicitly, as is common in many state-of-the-art models. However, explicit modeling of top-down factors affecting drivers' attention often requires additional information and annotations that may not be readily available. In this paper, we address the challenge of effective modeling of task and context with common sources of data for use in practical systems. To this end, we introduce SCOUT+, a task- and context-aware model for drivers' gaze prediction, which leverages route and map information inferred from commonly available GPS data. We evaluate our model on two datasets, DR(eye)VE and BDD-A, and demonstrate that using maps improves results compared to bottom-up models and reaches performance comparable to the top-down model SCOUT which relies on privileged ground truth information. Code is available at https://github.com/ykotseruba/SCOUT.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes