CVNov 26, 2019

Multi-Task Driven Feature Models for Thermal Infrared Tracking

arXiv:1911.11384v173 citationsHas Code
Originality Incremental advance
AI Analysis

This work addresses the challenge of adapting tracking methods from RGB to TIR domains, which is important for applications like surveillance and autonomous systems, but it is incremental as it builds on existing deep tracking approaches.

The paper tackles the problem of thermal infrared (TIR) tracking by developing a multi-task framework to learn TIR-specific discriminative and fine-grained correlation features, achieving a relative gain of 10% over the baseline and performing favorably against state-of-the-art methods on three benchmarks.

Existing deep Thermal InfraRed (TIR) trackers usually use the feature models of RGB trackers for representation. However, these feature models learned on RGB images are neither effective in representing TIR objects nor taking fine-grained TIR information into consideration. To this end, we develop a multi-task framework to learn the TIR-specific discriminative features and fine-grained correlation features for TIR tracking. Specifically, we first use an auxiliary classification network to guide the generation of TIR-specific discriminative features for distinguishing the TIR objects belonging to different classes. Second, we design a fine-grained aware module to capture more subtle information for distinguishing the TIR objects belonging to the same class. These two kinds of features complement each other and recognize TIR objects in the levels of inter-class and intra-class respectively. These two feature models are learned using a multi-task matching framework and are jointly optimized on the TIR tracking task. In addition, we develop a large-scale TIR training dataset to train the network for adapting the model to the TIR domain. Extensive experimental results on three benchmarks show that the proposed algorithm achieves a relative gain of 10% over the baseline and performs favorably against the state-of-the-art methods. Codes and the proposed TIR dataset are available at {https://github.com/QiaoLiuHit/MMNet}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes