CVApr 22, 2024

360VOTS: Visual Object Tracking and Segmentation in Omnidirectional Videos

arXiv:2404.13953v28 citationsh-index: 8IEEE Trans Pattern Anal Mach Intell
Originality Incremental advance
AI Analysis

This work addresses challenges in omnidirectional video analysis for computer vision applications, but it is incremental as it builds upon previous tracking work and focuses on dataset creation.

The authors tackled visual object tracking and segmentation in omnidirectional videos by introducing a novel representation (eBFoV) and a comprehensive dataset (360VOS) with 290 sequences, demonstrating effectiveness through benchmarking state-of-the-art approaches.

Visual object tracking and segmentation in omnidirectional videos are challenging due to the wide field-of-view and large spherical distortion brought by 360° images. To alleviate these problems, we introduce a novel representation, extended bounding field-of-view (eBFoV), for target localization and use it as the foundation of a general 360 tracking framework which is applicable for both omnidirectional visual object tracking and segmentation tasks. Building upon our previous work on omnidirectional visual object tracking (360VOT), we propose a comprehensive dataset and benchmark that incorporates a new component called omnidirectional video object segmentation (360VOS). The 360VOS dataset includes 290 sequences accompanied by dense pixel-wise masks and covers a broader range of target categories. To support both the development and evaluation of algorithms in this domain, we divide the dataset into a training subset with 170 sequences and a testing subset with 120 sequences. Furthermore, we tailor evaluation metrics for both omnidirectional tracking and segmentation to ensure rigorous assessment. Through extensive experiments, we benchmark state-of-the-art approaches and demonstrate the effectiveness of our proposed 360 tracking framework and training dataset. Homepage: https://360vots.hkustvgd.com/

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes