HCCVIRMay 12, 2025

A Versatile Dataset of Mouse and Eye Movements on Search Engine Results Pages

arXiv:2507.08003v14 citationsh-index: 3SIGIR
Originality Synthesis-oriented
AI Analysis

This provides a more reliable dataset for researchers studying user behavior on SERPs, though it is incremental as it builds on existing proxy methods by adding eye-tracking.

The authors tackled the problem of inaccurate and biased self-reported ground-truth labels for user attention on Search Engine Result Pages by creating a dataset with objective eye-tracking data, comprising 2,776 queries from 47 participants, and including various data types like HTML, screenshots, and movement data.

We contribute a comprehensive dataset to study user attention and purchasing behavior on Search Engine Result Pages (SERPs). Previous work has relied on mouse movements as a low-cost large-scale behavioral proxy but also has relied on self-reported ground-truth labels, collected at post-task, which can be inaccurate and prone to biases. To address this limitation, we use an eye tracker to construct an objective ground-truth of continuous visual attention. Our dataset comprises 2,776 transactional queries on Google SERPs, collected from 47 participants, and includes: (1) HTML source files, with CSS and images; (2) rendered SERP screenshots; (3) eye movement data; (4) mouse movement data; (5) bounding boxes of direct display and organic advertisements; and (6) scripts for further preprocessing the data. In this paper we provide an overview of the dataset and baseline experiments (classification tasks) that can inspire researchers about the different possibilities for future work.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes