CVAILGMar 26, 2024

Fake or JPEG? Revealing Common Biases in Generated Image Detection Datasets

arXiv:2403.17608v243 citationsh-index: 17ECCV Workshops
Originality Incremental advance
AI Analysis

This work addresses biases in datasets for AI-generated image detection, which is crucial for combating misinformation, but it is incremental as it focuses on improving existing datasets rather than introducing a new detection method.

The paper identified that many AI-generated image detection datasets contain biases related to JPEG compression and image size, which affect detector performance. By removing these biases, they improved cross-generator performance by over 11 percentage points for ResNet50 and Swin-T detectors on the GenImage dataset, achieving state-of-the-art results.

The widespread adoption of generative image models has highlighted the urgent need to detect artificial content, which is a crucial step in combating widespread manipulation and misinformation. Consequently, numerous detectors and associated datasets have emerged. However, many of these datasets inadvertently introduce undesirable biases, thereby impacting the effectiveness and evaluation of detectors. In this paper, we emphasize that many datasets for AI-generated image detection contain biases related to JPEG compression and image size. Using the GenImage dataset, we demonstrate that detectors indeed learn from these undesired factors. Furthermore, we show that removing the named biases substantially increases robustness to JPEG compression and significantly alters the cross-generator performance of evaluated detectors. Specifically, it leads to more than 11 percentage points increase in cross-generator performance for ResNet50 and Swin-T detectors on the GenImage dataset, achieving state-of-the-art results. We provide the dataset and source codes of this paper on the anonymous website: https://www.unbiased-genimage.org

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes