Any-Resolution AI-Generated Image Detection by Spectral Learning
This work addresses the challenge of generalizing AI-generated image detection to unseen models, which is crucial for applications like content moderation and forensics, though it builds incrementally on spectral artifact methods.
The paper tackles the problem of detecting AI-generated images by leveraging the invariant spectral distribution of real images, achieving a 5.5% absolute improvement in AUC over the previous state-of-the-art across 13 generative models.
Recent works have established that AI models introduce spectral artifacts into generated images and propose approaches for learning to capture them using labeled data. However, the significant differences in such artifacts among different generative models hinder these approaches from generalizing to generators not seen during training. In this work, we build upon the key idea that the spectral distribution of real images constitutes both an invariant and highly discriminative pattern for AI-generated image detection. To model this under a self-supervised setup, we employ masked spectral learning using the pretext task of frequency reconstruction. Since generated images constitute out-of-distribution samples for this model, we propose spectral reconstruction similarity to capture this divergence. Moreover, we introduce spectral context attention, which enables our approach to efficiently capture subtle spectral inconsistencies in images of any resolution. Our spectral AI-generated image detection approach (SPAI) achieves a 5.5% absolute improvement in AUC over the previous state-of-the-art across 13 recent generative approaches, while exhibiting robustness against common online perturbations. Code is available on https://mever-team.github.io/spai.