CVJun 5

DRIFT: From Robustness Gaps to Invariance Manifolds for AI-Generated Image Detection

Abhishek Ameta, Sayan Banerjee, Shreyas Pandith, Harshit, Ankita Chatterjee, Akshay Janardan Bankar, Amit Satish Unde

arXiv:2606.069188.8

Originality Incremental advance

AI Analysis

For practitioners needing to detect AI-generated images in open-world settings with unseen generators, this method offers improved generalization and interpretability over existing approaches.

The paper introduces DRIFT, a method for AI-generated image detection that learns an invariance manifold of real images using a frozen vision foundation model and lightweight projection heads, achieving strong open-world generalization across unseen generators and resolutions, outperforming training-free robustness-based baselines.

The rapid evolution of generative image models challenges existing AI-generated image detectors, particularly in open-world settings with unseen generators. Recent training-free approaches measure robustness gaps in frozen vision foundation models (VFMs), detecting fakes via perturbation-induced embedding drift. However, these methods rely on fixed invariance geometry inherited from pretraining and lack principled adaptation to the detection task. We instead formulate AI-generated image detection as learning a structured invariance manifold of real images under one-class supervision. Building upon a frozen VFM, we introduce lightweight projection heads that decompose representation space into complementary robust and fragile subspaces. The robust subspace is explicitly trained to suppress variations induced by physically plausible imaging transformations, approximating tangent directions of a real-image manifold, while the fragile subspace retains sensitivity to edit-like perturbations. A structured ordering margin enforces hierarchical separation between physical invariance and edit-induced variability, enabling detection as a margin-violation test relative to the learned manifold. At inference, multi-scale patch-wise drift under both transformation families yields a dual-channel invariance signature and interpretable localization. Extensive experiments demonstrate strong open-world generalization across unseen generators and resolutions, consistently outperforming training-free robustness-based baselines while providing interpretable invariance-violation maps.

View on arXiv PDF

Similar