CVCYSIAug 8, 2024

Can GPT-4 Models Detect Misleading Visualizations?

arXiv:2408.12617v116 citationsh-index: 13
Originality Synthesis-oriented
AI Analysis

This addresses the problem of visual misinformation online, particularly during critical events, but is incremental as it applies existing models to a new task with prompt engineering.

The study investigated whether GPT-4 models can detect misleading visualizations, finding they achieve moderate accuracy in naive zero-shot settings and improve with guided prompts, though effectiveness varies by misleader type.

The proliferation of misleading visualizations online, particularly during critical events like public health crises and elections, poses a significant risk. This study investigates the capability of GPT-4 models (4V, 4o, and 4o mini) to detect misleading visualizations. Utilizing a dataset of tweet-visualization pairs containing various visual misleaders, we test these models under four experimental conditions with different levels of guidance. We show that GPT-4 models can detect misleading visualizations with moderate accuracy without prior training (naive zero-shot) and that performance notably improves when provided with definitions of misleaders (guided zero-shot). However, a single prompt engineering technique does not yield the best results for all misleader types. Specifically, providing the models with misleader definitions and examples (guided few-shot) proves more effective for reasoning misleaders, while guided zero-shot performs better for design misleaders. This study underscores the feasibility of using large vision-language models to detect visual misinformation and the importance of prompt engineering for optimized detection accuracy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes