CVApr 1, 2025

Mixture-of-Attack-Experts with Class Regularization for Unified Physical-Digital Face Attack Detection

arXiv:2504.00458v111 citationsh-index: 19AAAI
Originality Incremental advance
AI Analysis

This addresses security vulnerabilities in facial recognition systems against diverse attack types, representing a strong domain-specific advancement.

The paper tackles the problem of detecting both physical and digital face attacks in facial recognition systems by proposing FG-MoE-CLIP-CAR, which addresses large intra-class variation in attacks and small inter-class variation between live and fake faces. The method achieves state-of-the-art performance on two unified physical-digital attack datasets.

Facial recognition systems in real-world scenarios are susceptible to both digital and physical attacks. Previous methods have attempted to achieve classification by learning a comprehensive feature space. However, these methods have not adequately accounted for the inherent characteristics of physical and digital attack data, particularly the large intra class variation in attacks and the small inter-class variation between live and fake faces. To address these limitations, we propose the Fine-Grained MoE with Class-Aware Regularization CLIP framework (FG-MoE-CLIP-CAR), incorporating key improvements at both the feature and loss levels. At the feature level, we employ a Soft Mixture of Experts (Soft MoE) architecture to leverage different experts for specialized feature processing. Additionally, we refine the Soft MoE to capture more subtle differences among various types of fake faces. At the loss level, we introduce two constraint modules: the Disentanglement Module (DM) and the Cluster Distillation Module (CDM). The DM enhances class separability by increasing the distance between the centers of live and fake face classes. However, center-to-center constraints alone are insufficient to ensure distinctive representations for individual features. Thus, we propose the CDM to further cluster features around their respective class centers while maintaining separation from other classes. Moreover, specific attacks that significantly deviate from common attack patterns are often overlooked. To address this issue, our distance calculation prioritizes more distant features. Experimental results on two unified physical-digital attack datasets demonstrate that the proposed method achieves state-of-the-art (SOTA) performance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes