CV LGDec 30, 2024

HFI: A unified framework for training-free detection and implicit watermarking of latent diffusion model generated images

Sungik Choi, Sungwoo Park, Jaehoon Lee, Seunghyun Kim, Stanley Jungkyu Choi, Moontae Lee

arXiv:2412.20704v18.78 citationsh-index: 7

Originality Incremental advance

AI Analysis

This addresses the challenge of malicious use of AI-generated images by providing a training-free detection solution, though it is incremental as it builds on prior reconstruction-based methods.

The paper tackles the problem of detecting AI-generated images from latent diffusion models without requiring training data, by proposing HFI, a method that measures aliasing in reconstructed images to improve detection accuracy, especially for images with simple backgrounds, and shows it outperforms existing training-free methods and can be used for implicit watermarking.

Dramatic advances in the quality of the latent diffusion models (LDMs) also led to the malicious use of AI-generated images. While current AI-generated image detection methods assume the availability of real/AI-generated images for training, this is practically limited given the vast expressibility of LDMs. This motivates the training-free detection setup where no related data are available in advance. The existing LDM-generated image detection method assumes that images generated by LDM are easier to reconstruct using an autoencoder than real images. However, we observe that this reconstruction distance is overfitted to background information, leading the current method to underperform in detecting images with simple backgrounds. To address this, we propose a novel method called HFI. Specifically, by viewing the autoencoder of LDM as a downsampling-upsampling kernel, HFI measures the extent of aliasing, a distortion of high-frequency information that appears in the reconstructed image. HFI is training-free, efficient, and consistently outperforms other training-free methods in detecting challenging images generated by various generative models. We also show that HFI can successfully detect the images generated from the specified LDM as a means of implicit watermarking. HFI outperforms the best baseline method while achieving magnitudes of

View on arXiv PDF

Similar