SELGJul 11, 2024

Semantic GUI Scene Learning and Video Alignment for Detecting Duplicate Video-based Bug Reports

arXiv:2407.08610v112 citationsh-index: 12
Originality Highly original
AI Analysis

This addresses the problem of managing duplicate video bug reports for developers, offering a significant improvement over existing methods.

The paper tackles duplicate detection for video-based bug reports by introducing JANUS, which uses vision transformers and adaptive video alignment to capture subtle GUI patterns, achieving an mRR/mAP of 89.8%/84.7% and outperforming prior work by around 9% in most tasks.

Video-based bug reports are increasingly being used to document bugs for programs centered around a graphical user interface (GUI). However, developing automated techniques to manage video-based reports is challenging as it requires identifying and understanding often nuanced visual patterns that capture key information about a reported bug. In this paper, we aim to overcome these challenges by advancing the bug report management task of duplicate detection for video-based reports. To this end, we introduce a new approach, called JANUS, that adapts the scene-learning capabilities of vision transformers to capture subtle visual and textual patterns that manifest on app UI screens - which is key to differentiating between similar screens for accurate duplicate report detection. JANUS also makes use of a video alignment technique capable of adaptive weighting of video frames to account for typical bug manifestation patterns. In a comprehensive evaluation on a benchmark containing 7,290 duplicate detection tasks derived from 270 video-based bug reports from 90 Android app bugs, the best configuration of our approach achieves an overall mRR/mAP of 89.8%/84.7%, and for the large majority of duplicate detection tasks, outperforms prior work by around 9% to a statistically significant degree. Finally, we qualitatively illustrate how the scene-learning capabilities provided by Janus benefits its performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes