CVJan 10, 2025

HFMF: Hierarchical Fusion Meets Multi-Stream Models for Deepfake Detection

arXiv:2501.05631v18 citationsh-index: 12025 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW)
Originality Incremental advance
AI Analysis

This addresses the challenge of detecting realistic synthetic images and videos for mitigating misinformation, representing an incremental improvement in deepfake detection methods.

The authors tackled the problem of detecting AI-generated deepfakes by proposing HFMF, a two-stage framework using hierarchical cross-modal feature fusion and multi-stream feature extraction, which achieved superior performance across diverse dataset benchmarks.

The rapid progress in deep generative models has led to the creation of incredibly realistic synthetic images that are becoming increasingly difficult to distinguish from real-world data. The widespread use of Variational Models, Diffusion Models, and Generative Adversarial Networks has made it easier to generate convincing fake images and videos, which poses significant challenges for detecting and mitigating the spread of misinformation. As a result, developing effective methods for detecting AI-generated fakes has become a pressing concern. In our research, we propose HFMF, a comprehensive two-stage deepfake detection framework that leverages both hierarchical cross-modal feature fusion and multi-stream feature extraction to enhance detection performance against imagery produced by state-of-the-art generative AI models. The first component of our approach integrates vision Transformers and convolutional nets through a hierarchical feature fusion mechanism. The second component of our framework combines object-level information and a fine-tuned convolutional net model. We then fuse the outputs from both components via an ensemble deep neural net, enabling robust classification performances. We demonstrate that our architecture achieves superior performance across diverse dataset benchmarks while maintaining calibration and interoperability.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes