CV AISep 20, 2025

FakeChain: Exposing Shallow Cues in Multi-Step Deepfake Detection

arXiv:2509.16602v21 citationsh-index: 3Has CodeCIKM

Originality Synthesis-oriented

AI Analysis

This addresses a growing challenge in deepfake detection for security and media integrity, but it is incremental as it primarily exposes a limitation in existing models rather than proposing a new solution.

The paper tackles the problem of detecting multi-step deepfakes, which combine different manipulation methods, and finds that detection performance drops by up to 58.83% in F1-score when the final manipulation type differs from the training distribution, revealing that detectors rely on last-stage artifacts rather than cumulative traces.

Multi-step or hybrid deepfakes, created by sequentially applying different deepfake creation methods such as Face-Swapping, GAN-based generation, and Diffusion methods, can pose an emerging and unforseen technical challenge for detection models trained on single-step forgeries. While prior studies have mainly focused on detecting isolated single manipulation, little is known about the detection model behavior under such compositional, hybrid, and complex manipulation pipelines. In this work, we introduce \textbf{FakeChain}, a large-scale benchmark comprising 1-, 2-, and 3-Step forgeries synthesized using five state-of-the-art representative generators. Using this approach, we analyze detection performance and spectral properties across hybrid manipulation at different step, along with varying generator combinations and quality settings. Surprisingly, our findings reveal that detection performance highly depends on the final manipulation type, with F1-score dropping by up to \textbf{58.83\%} when it differs from training distribution. This clearly demonstrates that detectors rely on last-stage artifacts rather than cumulative manipulation traces, limiting generalization. Such findings highlight the need for detection models to explicitly consider manipulation history and sequences. Our results highlight the importance of benchmarks such as FakeChain, reflecting growing synthesis complexity and diversity in real-world scenarios. Our sample code is available here\footnote{https://github.com/minjihh/FakeChain}.

View on arXiv PDF Code

Similar