CVMar 21, 2022

Transforming Model Prediction for Tracking

arXiv:2203.11192v1347 citationsh-index: 191
Originality Incremental advance
AI Analysis

This work improves visual object tracking performance for applications like surveillance and autonomous systems, representing an incremental advance by integrating Transformers into an existing framework.

The paper tackles the limited expressivity of optimization-based tracking methods by proposing a Transformer-based model prediction module, achieving a new state-of-the-art AUC of 68.5% on the LaSOT dataset.

Optimization based tracking methods have been widely successful by integrating a target model prediction module, providing effective global reasoning by minimizing an objective function. While this inductive bias integrates valuable domain knowledge, it limits the expressivity of the tracking network. In this work, we therefore propose a tracker architecture employing a Transformer-based model prediction module. Transformers capture global relations with little inductive bias, allowing it to learn the prediction of more powerful target models. We further extend the model predictor to estimate a second set of weights that are applied for accurate bounding box regression. The resulting tracker relies on training and on test frame information in order to predict all weights transductively. We train the proposed tracker end-to-end and validate its performance by conducting comprehensive experiments on multiple tracking datasets. Our tracker sets a new state of the art on three benchmarks, achieving an AUC of 68.5% on the challenging LaSOT dataset.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes