CVMar 24, 2022

FAMLP: A Frequency-Aware MLP-Like Architecture For Domain Generalization

arXiv:2203.12893v111 citationsh-index: 78
Originality Incremental advance
AI Analysis

This addresses domain generalization for MLP-like models, which is an incremental advance as it adapts existing frequency filtering techniques to a new architecture.

The paper tackles the problem of poor generalization of MLP-like models to heterogeneous tasks by proposing a frequency-aware MLP architecture that filters out domain-specific features in the frequency domain, achieving state-of-the-art performance with improvements of 3%, 4%, and 9% on three benchmarks.

MLP-like models built entirely upon multi-layer perceptrons have recently been revisited, exhibiting the comparable performance with transformers. It is one of most promising architectures due to the excellent trade-off between network capability and efficiency in the large-scale recognition tasks. However, its generalization performance to heterogeneous tasks is inferior to other architectures (e.g., CNNs and transformers) due to the extensive retention of domain information. To address this problem, we propose a novel frequency-aware MLP architecture, in which the domain-specific features are filtered out in the transformed frequency domain, augmenting the invariant descriptor for label prediction. Specifically, we design an adaptive Fourier filter layer, in which a learnable frequency filter is utilized to adjust the amplitude distribution by optimizing both the real and imaginary parts. A low-rank enhancement module is further proposed to rectify the filtered features by adding the low-frequency components from SVD decomposition. Finally, a momentum update strategy is utilized to stabilize the optimization to fluctuation of model parameters and inputs by the output distillation with weighted historical states. To our best knowledge, we are the first to propose a MLP-like backbone for domain generalization. Extensive experiments on three benchmarks demonstrate significant generalization performance, outperforming the state-of-the-art methods by a margin of 3%, 4% and 9%, respectively.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes