CVAIMar 11, 2024

Exploiting Style Latent Flows for Generalizing Deepfake Video Detection

arXiv:2403.06592v387 citationsh-index: 10CVPR
Originality Incremental advance
AI Analysis

This addresses the challenge of generalizing deepfake detection across different datasets and manipulation methods, though it appears incremental as it builds on existing latent vector analysis.

The paper tackles the problem of detecting fake videos by analyzing abnormal temporal changes in style latent vectors, achieving superior performance in cross-dataset and cross-manipulation scenarios.

This paper presents a new approach for the detection of fake videos, based on the analysis of style latent vectors and their abnormal behavior in temporal changes in the generated videos. We discovered that the generated facial videos suffer from the temporal distinctiveness in the temporal changes of style latent vectors, which are inevitable during the generation of temporally stable videos with various facial expressions and geometric transformations. Our framework utilizes the StyleGRU module, trained by contrastive learning, to represent the dynamic properties of style latent vectors. Additionally, we introduce a style attention module that integrates StyleGRU-generated features with content-based features, enabling the detection of visual and temporal artifacts. We demonstrate our approach across various benchmark scenarios in deepfake detection, showing its superiority in cross-dataset and cross-manipulation scenarios. Through further analysis, we also validate the importance of using temporal changes of style latent vectors to improve the generality of deepfake video detection.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes