HCAIJan 29

Attention Guidance through Video Script: A Case Study of Object Focusing on 360° VR Video Tours

arXiv:2603.168756 citationsh-index: 4
AI Analysis

This addresses the challenge of attention guidance for viewers in immersive VR environments, but it is incremental as it applies existing models to a specific domain.

The paper tackled the problem of guiding viewer attention in 360° VR videos, which often lack effective methods for focusing on specific elements, by combining Grounding Dino and Segment Anything models to direct attention based on video scripts, with experiments on a university tour showing improved user experience.

Within the expansive domain of virtual reality (VR), 360° VR videos immerse viewers in a spherical environment, allowing them to explore and interact with the virtual world from all angles. While this video representation offers unparalleled levels of immersion, it often lacks effective methods to guide viewers' attention toward specific elements within the virtual environment. This paper combines the models Grounding Dino and Segment Anything (SAM) to guide attention by object focusing based on video scripts. As a case study, this work conducts the experiments on a 360° video tour on the University of Reading. The experiment results show that video scripts can improve the user experience in 360° VR Videos Tour by helping in the task of directing the user's attention.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes