CVAIApr 7, 2025

From Specificity to Generality: Revisiting Generalizable Artifacts in Detecting Face Deepfakes

arXiv:2504.04827v214 citationsh-index: 3
Originality Incremental advance
AI Analysis

This addresses the problem of detecting varied deepfakes for security and media integrity, but it is incremental as it builds on existing artifact-based detection methods.

The paper tackles the challenge of detecting diverse facial deepfakes by proposing a universal detection framework that focuses on general artifacts, Face Inconsistency Artifacts (FIA) and Up-Sampling Artifacts (USA), rather than specific ones. The result is that a standard image classifier trained on pseudo-fake data created with these artifacts generalizes well to unseen deepfakes, though no concrete numbers are provided.

Detecting deepfakes has been an increasingly important topic, especially given the rapid development of AI generation techniques. In this paper, we ask: How can we build a universal detection framework that is effective for most facial deepfakes? One significant challenge is the wide variety of deepfake generators available, resulting in varying forgery artifacts (e.g., lighting inconsistency, color mismatch, etc). But should we ``teach" the detector to learn all these artifacts separately? It is impossible and impractical to elaborate on them all. So the core idea is to pinpoint the more common and general artifacts across different deepfakes. Accordingly, we categorize deepfake artifacts into two distinct yet complementary types: Face Inconsistency Artifacts (FIA) and Up-Sampling Artifacts (USA). FIA arise from the challenge of generating all intricate details, inevitably causing inconsistencies between the complex facial features and relatively uniform surrounding areas. USA, on the other hand, are the inevitable traces left by the generator's decoder during the up-sampling process. This categorization stems from the observation that all existing deepfakes typically exhibit one or both of these artifacts. To achieve this, we propose a new data-level pseudo-fake creation framework that constructs fake samples with only the FIA and USA, without introducing extra less-general artifacts. Specifically, we employ a super-resolution to simulate the USA, while design a Blender module that uses image-level self-blending on diverse facial regions to create the FIA. We surprisingly found that, with this intuitive design, a standard image classifier trained only with our pseudo-fake data can non-trivially generalize well to unseen deepfakes.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes