CVJul 25, 2022

Domain Decorrelation with Potential Energy Ranking

Sen Pei, Jiaxi Sun, Richard Yi Da Xu, Shiming Xiang, Gaofeng Meng

arXiv:2207.12194v33.74 citationsh-index: 60Has Code

Originality Incremental advance

AI Analysis

This addresses domain generalization for computer vision systems, which is crucial for real-world applications where training and test data distributions differ, though it appears incremental as it builds on existing methods for feature decoupling.

The paper tackles the problem of domain shift in deep learning by proposing Potential Energy Ranking (PoER) to decouple object and domain features, improving average top-1 accuracy by at least 1.20% on domain generalization benchmarks and achieving top place in the ECCV 2022 NICO Challenge with a vanilla ResNet-18.

Machine learning systems, especially the methods based on deep learning, enjoy great success in modern computer vision tasks under experimental settings. Generally, these classic deep learning methods are built on the \emph{i.i.d.} assumption, supposing the training and test data are drawn from a similar distribution independently and identically. However, the aforementioned \emph{i.i.d.} assumption is in general unavailable in the real-world scenario, and as a result, leads to sharp performance decay of deep learning algorithms. Behind this, domain shift is one of the primary factors to be blamed. In order to tackle this problem, we propose using \textbf{Po}tential \textbf{E}nergy \textbf{R}anking (PoER) to decouple the object feature and the domain feature (\emph{i.e.,} appearance feature) in given images, promoting the learning of label-discriminative features while filtering out the irrelevant correlations between the objects and the background. PoER helps the neural networks to capture label-related features which contain the domain information first in shallow layers and then distills the label-discriminative representations out progressively, enforcing the neural networks to be aware of the characteristic of objects and background which is vital to the generation of domain-invariant features. PoER reports superior performance on domain generalization benchmarks, improving the average top-1 accuracy by at least 1.20\% compared to the existing methods. Moreover, we use PoER in the ECCV 2022 NICO Challenge\footnote{https://nicochallenge.com}, achieving top place with only a vanilla ResNet-18. The code has been made available at https://github.com/ForeverPs/PoER.

View on arXiv PDF Code

Similar