CVSep 20, 2022

Mitigating Representation Bias in Action Recognition: Algorithms and Benchmarks

Peking U
arXiv:2209.09393v19 citationsh-index: 87
Originality Incremental advance
AI Analysis

This work tackles bias in video datasets for action recognition, which is incremental as it builds on existing debiasing methods.

The paper addresses the problem of representation bias in action recognition by proposing SMAD for algorithmic debiasing and OmniDebias for dataset debiasing, achieving improved performance on rare scenes with fewer web data.

Deep learning models have achieved excellent recognition results on large-scale video benchmarks. However, they perform poorly when applied to videos with rare scenes or objects, primarily due to the bias of existing video datasets. We tackle this problem from two different angles: algorithm and dataset. From the perspective of algorithms, we propose Spatial-aware Multi-Aspect Debiasing (SMAD), which incorporates both explicit debiasing with multi-aspect adversarial training and implicit debiasing with the spatial actionness reweighting module, to learn a more generic representation invariant to non-action aspects. To neutralize the intrinsic dataset bias, we propose OmniDebias to leverage web data for joint training selectively, which can achieve higher performance with far fewer web data. To verify the effectiveness, we establish evaluation protocols and perform extensive experiments on both re-distributed splits of existing datasets and a new evaluation dataset focusing on the action with rare scenes. We also show that the debiased representation can generalize better when transferred to other datasets and tasks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes