CVMar 1, 2021

MFST: Multi-Features Siamese Tracker

arXiv:2103.00810v11 citationsHas Code
Originality Incremental advance
AI Analysis

This work addresses a specific bottleneck in visual object tracking for applications requiring robust performance, representing an incremental improvement over existing Siamese trackers.

The paper tackles the limitation of Siamese trackers relying on single convolutional layers for similarity analysis by proposing MFST, which fuses hierarchical features from multiple layers to achieve higher tracking accuracy and outperform several state-of-the-art trackers.

Siamese trackers have recently achieved interesting results due to their balance between accuracy and speed. This success is mainly due to the fact that deep similarity networks were specifically designed to address the image similarity problem. Therefore, they are inherently more appropriate than classical CNNs for the tracking task. However, Siamese trackers rely on the last convolutional layers for similarity analysis and target search, which restricts their performance. In this paper, we argue that using a single convolutional layer as feature representation is not the optimal choice within the deep similarity framework, as multiple convolutional layers provide several abstraction levels in characterizing an object. Starting from this motivation, we present the Multi-Features Siamese Tracker (MFST), a novel tracking algorithm exploiting several hierarchical feature maps for robust deep similarity tracking. MFST proceeds by fusing hierarchical features to ensure a richer and more efficient representation. Moreover, we handle appearance variation by calibrating deep features extracted from two different CNN models. Based on this advanced feature representation, our algorithm achieves high tracking accuracy, while outperforming several state-of-the-art trackers, including standard Siamese trackers. The code and trained models are available at https://github.com/zhenxili96/MFST.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes