CVMay 8, 2024

TENet: Targetness Entanglement Incorporating with Multi-Scale Pooling and Mutually-Guided Fusion for RGB-E Object Tracking

arXiv:2405.05004v121 citationsh-index: 17Has CodeNeural Networks
Originality Incremental advance
AI Analysis

This work addresses the challenge of robust object tracking in dynamic scenes for applications like robotics and surveillance, representing an incremental advance by adapting existing methods to better handle event data.

The paper tackles the problem of improving visual object tracking by incorporating event camera data with RGB, proposing a method that adapts feature extraction to event data sparsity and achieves state-of-the-art performance, with precision and success rates on the COESOT dataset improved by 4.9% and 5.2%, respectively.

There is currently strong interest in improving visual object tracking by augmenting the RGB modality with the output of a visual event camera that is particularly informative about the scene motion. However, existing approaches perform event feature extraction for RGB-E tracking using traditional appearance models, which have been optimised for RGB only tracking, without adapting it for the intrinsic characteristics of the event data. To address this problem, we propose an Event backbone (Pooler), designed to obtain a high-quality feature representation that is cognisant of the innate characteristics of the event data, namely its sparsity. In particular, Multi-Scale Pooling is introduced to capture all the motion feature trends within event data through the utilisation of diverse pooling kernel sizes. The association between the derived RGB and event representations is established by an innovative module performing adaptive Mutually Guided Fusion (MGF). Extensive experimental results show that our method significantly outperforms state-of-the-art trackers on two widely used RGB-E tracking datasets, including VisEvent and COESOT, where the precision and success rates on COESOT are improved by 4.9% and 5.2%, respectively. Our code will be available at https://github.com/SSSpc333/TENet.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes