CVDec 11, 2019

SiamMan: Siamese Motion-aware Network for Visual Tracking

arXiv:1912.05515v23 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of robust visual tracking for applications like surveillance and robotics, presenting an incremental improvement over existing siamese-based methods.

The paper tackles visual tracking by proposing SiamMan, a siamese motion-aware network that introduces a localization branch and global context module to handle target motion patterns, achieving leading accuracy on benchmarks like VOT2018 and OTB100 with high efficiency.

In this paper, we present a novel siamese motion-aware network (SiamMan) for visual tracking, which consists of the siamese feature extraction subnetwork, followed by the classification, regression, and localization branches in parallel. The classification branch is used to distinguish the foreground from background, and the regression branch is adopt to regress the bounding box of target. To reduce the impact of manually designed anchor boxes to adapt to different target motion patterns, we design the localization branch, which aims to coarsely localize the target to help the regression branch to generate accurate results. Meanwhile, we introduce the global context module into the localization branch to capture long-range dependency for more robustness in large displacement of target. In addition, we design a multi-scale learnable attention module to guide these three branches to exploit discriminative features for better performance. The whole network is trained offline in an end-to-end fashion with large-scale image pairs using the standard SGD algorithm with back-propagation. Extensive experiments on five challenging benchmarks, i.e., VOT2016, VOT2018, OTB100, UAV123 and LTB35, demonstrate that SiamMan achieves leading accuracy with high efficiency. Code can be found at https://isrc.iscas.ac.cn/gitlab/research/siamman.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes