CVMay 30

FiSeR: Fine-Grained Source Representations for Cross-Domain AI Image Detection

arXiv:2606.0060681.2Has Code
Predicted impact top 26% in CV · last 90 daysOriginality Incremental advance
AI Analysis

Addresses poor generalization of synthetic image detectors under domain shift, a key problem for forensic and security applications.

FiSeR improves cross-domain AI image detection by learning transferable representations via hierarchical contrastive learning, achieving +10.22 AUROC gain over DIRE on four benchmarks and +10.64/+17.41 AUROC gain on AIGIBench/Chameleon with few-shot adaptation.

Real-world synthetic image detectors often generalize poorly under domain shift despite strong in-domain performance. Using unsupervised UMAP projections, we find that natural and synthetic features remain partially separable on unseen datasets, yet performance still drops, suggesting that the classification head overfits to training-domain artifacts. Therefore, the key is to learn more transferable representations so that the decision criterion is more stable and robust to domain shifts. Based on the structural fact that synthetic images are produced by diverse generators, we propose a hierarchical contrastive learning framework that improves the separability between natural and synthetic images while preserving generator identity information. It jointly optimizes (i) a coarse contrastive objective between natural and synthetic images and (ii) a fine contrastive objective among synthetic images using generator identities. Trained on WildFake, our method achieves an average AUROC gain of +10.22 on cross-domain evaluation over Chameleon, AIGIBench, Community Forensics, and GenImage under the same settings as the strong baseline DIRE. For few-shot adaptation, we freeze the backbone and fit an SVM head on 10 labeled samples per class, improving AUROC by +10.64 on AIGIBench and +17.41 on Chameleon, averaged over 12 widely used detectors. Our code is publicly available at: https://github.com/heyongxin233/FiSeR.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes