CVDec 4, 2023

MMFusion: Combining Image Forensic Filters for Visual Manipulation Detection and Localization

arXiv:2312.01790v28 citationsh-index: 6
Originality Incremental advance
AI Analysis

This addresses the need for more robust image manipulation detection, which is crucial for security and media verification, though it appears incremental as it builds on existing filter-based methods.

The paper tackles the problem of detecting and localizing manipulated images by combining multiple forensic filters that capture complementary artifacts, resulting in MMFusion, which outperforms state-of-the-art models across several datasets.

Recent image manipulation localization and detection techniques typically leverage forensic artifacts and traces that are produced by a noise-sensitive filter, such as SRM or Bayar convolution. In this paper, we showcase that different filters commonly used in such approaches excel at unveiling different types of manipulations and provide complementary forensic traces. Thus, we explore ways of combining the outputs of such filters to leverage the complementary nature of the produced artifacts for performing image manipulation localization and detection (IMLD). We assess two distinct combination methods: one that produces independent features from each forensic filter and then fuses them (this is referred to as late fusion) and one that performs early mixing of different modal outputs and produces combined features (this is referred to as early fusion). We use the latter as a feature encoding mechanism, accompanied by a new decoding mechanism that encompasses feature re-weighting, for formulating the proposed MMFusion architecture. We demonstrate that MMFusion achieves competitive performance for both image manipulation localization and detection, outperforming state-of-the-art models across several image and video datasets. We also investigate further the contribution of each forensic filter within MMFusion for addressing different types of manipulations, building on recent AI explainability measures.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes