CVJul 4, 2023

ChildPlay: A New Benchmark for Understanding Children's Gaze Behaviour

arXiv:2307.01630v143 citationsh-index: 48
Originality Incremental advance
AI Analysis

This work addresses a gap in gaze behavior analysis for diagnosing developmental disorders in children, though it is incremental as it extends existing methods to a new domain.

The paper tackles the lack of public datasets for predicting gaze targets in children by introducing the ChildPlay dataset, featuring annotated videos of children interacting in uncontrolled environments, and proposes a geometrically grounded model that achieves state-of-the-art results on benchmarks and ChildPlay, with fine-tuning improving face-looking prediction in children by a significant margin.

Gaze behaviors such as eye-contact or shared attention are important markers for diagnosing developmental disorders in children. While previous studies have looked at some of these elements, the analysis is usually performed on private datasets and is restricted to lab settings. Furthermore, all publicly available gaze target prediction benchmarks mostly contain instances of adults, which makes models trained on them less applicable to scenarios with young children. In this paper, we propose the first study for predicting the gaze target of children and interacting adults. To this end, we introduce the ChildPlay dataset: a curated collection of short video clips featuring children playing and interacting with adults in uncontrolled environments (e.g. kindergarten, therapy centers, preschools etc.), which we annotate with rich gaze information. We further propose a new model for gaze target prediction that is geometrically grounded by explicitly identifying the scene parts in the 3D field of view (3DFoV) of the person, leveraging recent geometry preserving depth inference methods. Our model achieves state of the art results on benchmark datasets and ChildPlay. Furthermore, results show that looking at faces prediction performance on children is much worse than on adults, and can be significantly improved by fine-tuning models using child gaze annotations. Our dataset and models will be made publicly available.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes