CVJan 28

Exploiting the Final Component of Generator Architectures for AI-Generated Image Detection

arXiv:2601.20461v1h-index: 1
Originality Highly original
AI Analysis

This addresses the need for reliable detection of AI-generated images in online environments, offering a novel approach with strong generalization capabilities.

The paper tackles the problem of poor generalization in AI-generated image detection by exploiting common final components in image generators, achieving an average accuracy of 98.83% across 22 unseen generators using only 100 samples per category.

With the rapid proliferation of powerful image generators, accurate detection of AI-generated images has become essential for maintaining a trustworthy online environment. However, existing deepfake detectors often generalize poorly to images produced by unseen generators. Notably, despite being trained under vastly different paradigms, such as diffusion or autoregressive modeling, many modern image generators share common final architectural components that serve as the last stage for converting intermediate representations into images. Motivated by this insight, we propose to "contaminate" real images using the generator's final component and train a detector to distinguish them from the original real images. We further introduce a taxonomy based on generators' final components and categorize 21 widely used generators accordingly, enabling a comprehensive investigation of our method's generalization capability. Using only 100 samples from each of three representative categories, our detector-fine-tuned on the DINOv3 backbone-achieves an average accuracy of 98.83% across 22 testing sets from unseen generators.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes