Visual Memorability for Robotic Interestingness via Unsupervised Online Learning
This addresses an underexplored problem crucial for autonomous exploration and decision-making in robotics, with incremental improvements in accuracy.
The paper tackles the problem of predicting interesting scenes for mobile robots by proposing a translation-invariant visual memory and a three-stage architecture, achieving much higher accuracy than state-of-the-art methods on challenging datasets.
In this paper, we explore the problem of interesting scene prediction for mobile robots. This area is currently underexplored but is crucial for many practical applications such as autonomous exploration and decision making. Inspired by industrial demands, we first propose a novel translation-invariant visual memory for recalling and identifying interesting scenes, then design a three-stage architecture of long-term, short-term, and online learning. This enables our system to learn human-like experience, environmental knowledge, and online adaption, respectively. Our approach achieves much higher accuracy than the state-of-the-art algorithms on challenging robotic interestingness datasets.