CVJan 30, 2025

Track-On: Transformer-based Online Point Tracking with Memory

arXiv:2501.18487v122 citationsh-index: 6ICLR
Originality Highly original
AI Analysis

It addresses the need for robust, real-time point tracking in streaming video applications, offering a scalable solution for diverse domains.

The paper tackles the problem of long-term point tracking in videos under challenging conditions like appearance changes and occlusions, introducing Track-On, a transformer-based online model that sets a new state-of-the-art for online models and achieves competitive results compared to offline approaches on seven datasets.

In this paper, we consider the problem of long-term point tracking, which requires consistent identification of points across multiple frames in a video, despite changes in appearance, lighting, perspective, and occlusions. We target online tracking on a frame-by-frame basis, making it suitable for real-world, streaming scenarios. Specifically, we introduce Track-On, a simple transformer-based model designed for online long-term point tracking. Unlike prior methods that depend on full temporal modeling, our model processes video frames causally without access to future frames, leveraging two memory modules -- spatial memory and context memory -- to capture temporal information and maintain reliable point tracking over long time horizons. At inference time, it employs patch classification and refinement to identify correspondences and track points with high accuracy. Through extensive experiments, we demonstrate that Track-On sets a new state-of-the-art for online models and delivers superior or competitive results compared to offline approaches on seven datasets, including the TAP-Vid benchmark. Our method offers a robust and scalable solution for real-time tracking in diverse applications. Project page: https://kuis-ai.github.io/track_on

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes