AICRMar 21, 2024

Can ChatGPT Detect DeepFakes? A Study of Using Multimodal Large Language Models for Media Forensics

arXiv:2403.14077v496 citationsh-index: 112024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
Originality Synthesis-oriented
AI Analysis

This addresses the need for accessible DeepFake detection tools, but it is incremental as it applies existing LLMs to a new domain without major methodological innovation.

The study tackled the problem of detecting AI-generated media (DeepFakes) by investigating multimodal large language models (LLMs), showing they can expose AI-generated images through experimental design and prompt engineering without requiring programming.

DeepFakes, which refer to AI-generated media content, have become an increasing concern due to their use as a means for disinformation. Detecting DeepFakes is currently solved with programmed machine learning algorithms. In this work, we investigate the capabilities of multimodal large language models (LLMs) in DeepFake detection. We conducted qualitative and quantitative experiments to demonstrate multimodal LLMs and show that they can expose AI-generated images through careful experimental design and prompt engineering. This is interesting, considering that LLMs are not inherently tailored for media forensic tasks, and the process does not require programming. We discuss the limitations of multimodal LLMs for these tasks and suggest possible improvements.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes