CVAIMar 19, 2023

TempT: Temporal consistency for Test-time adaptation

arXiv:2303.10536v210 citationsh-index: 13
Originality Incremental advance
AI Analysis

This addresses the problem of adapting models to video data during inference for computer vision tasks like facial expression recognition, but it is incremental as it builds on existing test-time adaptation methods.

The paper tackles test-time adaptation for videos by using temporal consistency of predictions as a self-supervision signal, achieving competitive performance on the AffWild2 dataset compared to prior reported results.

We introduce Temporal consistency for Test-time adaptation (TempT) a novel method for test-time adaptation on videos through the use of temporal coherence of predictions across sequential frames as a self-supervision signal. TempT is an approach with broad potential applications in computer vision tasks including facial expression recognition (FER) in videos. We evaluate TempT performance on the AffWild2 dataset. Our approach focuses solely on the unimodal visual aspect of the data and utilizes a popular 2D CNN backbone in contrast to larger sequential or attention-based models used in other approaches. Our preliminary experimental results demonstrate that TempT has competitive performance compared to the previous years reported performances and its efficacy provides a compelling proof-of-concept for its use in various real-world applications.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes