CVSep 12, 2025

GAMMA: Generalizable Alignment via Multi-task and Manipulation-Augmented Training for AI-Generated Image Detection

Haozhen Yan, Yan Hong, Suning Lang, Jiahui Zhan, Yikun Ji, Yujie Gao, Jun Lan, Huijia Zhu, Weiqiang Wang, Jianfu Zhang

arXiv:2509.10250v16.21 citationsh-index: 8

Originality Incremental advance

AI Analysis

This addresses the challenge of detecting AI-generated images across diverse and evolving generative models, which is crucial for security and media integrity, though it is an incremental improvement over existing methods.

The paper tackles the problem of poor generalization in AI-generated image detectors to unseen generative models by proposing GAMMA, a training framework that reduces domain bias and enhances semantic alignment, achieving state-of-the-art generalization with a 5.8% accuracy improvement on the GenImage benchmark and strong robustness on new models like GPT-4o.

With generative models becoming increasingly sophisticated and diverse, detecting AI-generated images has become increasingly challenging. While existing AI-genereted Image detectors achieve promising performance on in-distribution generated images, their generalization to unseen generative models remains limited. This limitation is largely attributed to their reliance on generation-specific artifacts, such as stylistic priors and compression patterns. To address these limitations, we propose GAMMA, a novel training framework designed to reduce domain bias and enhance semantic alignment. GAMMA introduces diverse manipulation strategies, such as inpainting-based manipulation and semantics-preserving perturbations, to ensure consistency between manipulated and authentic content. We employ multi-task supervision with dual segmentation heads and a classification head, enabling pixel-level source attribution across diverse generative domains. In addition, a reverse cross-attention mechanism is introduced to allow the segmentation heads to guide and correct biased representations in the classification branch. Our method achieves state-of-the-art generalization performance on the GenImage benchmark, imporving accuracy by 5.8%, but also maintains strong robustness on newly released generative model such as GPT-4o.

View on arXiv PDF

Similar