CVAINov 1, 2025

Detecting AI-Generated Images via Diffusion Snap-Back Reconstruction: A Forensic Approach

arXiv:2511.00352v1h-index: 1
Originality Incremental advance
AI Analysis

This addresses the challenge of synthetic media forensics for security and verification applications, though it is incremental as it builds on existing diffusion-based methods.

The paper tackles the problem of detecting AI-generated images from modern diffusion models like Stable Diffusion and DALL-E, achieving 0.993 AUROC on a dataset of 4,000 images with robustness to distortions.

The rapid rise of generative diffusion models has made distinguishing authentic visual content from synthetic imagery increasingly challenging. Traditional deepfake detection methods, which rely on frequency or pixel-level artifacts, fail against modern text-to-image systems such as Stable Diffusion and DALL-E that produce photorealistic and artifact-free results. This paper introduces a diffusion-based forensic framework that leverages multi-strength image reconstruction dynamics, termed diffusion snap-back, to identify AI-generated images. By analysing how reconstruction metrics (LPIPS, SSIM, and PSNR) evolve across varying noise strengths, we extract interpretable manifold-based features that differentiate real and synthetic images. Evaluated on a balanced dataset of 4,000 images, our approach achieves 0.993 AUROC under cross-validation and remains robust to common distortions such as compression and noise. Despite using limited data and a single diffusion backbone (Stable Diffusion v1.5), the proposed method demonstrates strong generalization and interpretability, offering a foundation for scalable, model-agnostic synthetic media forensics.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes