CVIVMar 26, 2024

Every Shot Counts: Using Exemplars for Repetition Counting in Videos

arXiv:2403.18074v211 citationsh-index: 43ACCV
Originality Highly original
AI Analysis

This addresses the problem of accurately counting repetitive actions in videos for applications like sports analysis or surveillance, representing a strong specific gain rather than a foundational advance.

The paper tackles video repetition counting by proposing an exemplar-based approach that discovers visual correspondence across repetitions, achieving state-of-the-art performance on RepCount, Countix, and UCFRep datasets.

Video repetition counting infers the number of repetitions of recurring actions or motion within a video. We propose an exemplar-based approach that discovers visual correspondence of video exemplars across repetitions within target videos. Our proposed Every Shot Counts (ESCounts) model is an attention-based encoder-decoder that encodes videos of varying lengths alongside exemplars from the same and different videos. In training, ESCounts regresses locations of high correspondence to the exemplars within the video. In tandem, our method learns a latent that encodes representations of general repetitive motions, which we use for exemplar-free, zero-shot inference. Extensive experiments over commonly used datasets (RepCount, Countix, and UCFRep) showcase ESCounts obtaining state-of-the-art performance across all three datasets. Detailed ablations further demonstrate the effectiveness of our method.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes