LGCVMay 29, 2025

Vision Language Models are Biased

arXiv:2505.23941v243 citationsh-index: 10
Originality Synthesis-oriented
AI Analysis

This work identifies a failure mode in VLMs that affects their reliability for visual tasks, though it is incremental as it focuses on testing biases rather than solving them.

The study tested how prior knowledge biases vision language models (VLMs) on objective visual tasks like counting and identification, finding that state-of-the-art VLMs scored only 17.05% accuracy on average across diverse domains, with accuracy nearly doubling when image backgrounds were removed.

Large language models (LLMs) memorize a vast amount of prior knowledge from the Internet that helps them on downstream tasks but also may notoriously sway their outputs towards wrong or biased answers. In this work, we test how the knowledge about popular subjects hurt the accuracy of vision language models (VLMs) on standard, objective visual tasks of counting and identification. We find that state-of-the-art VLMs are strongly biased (e.g., unable to recognize the 4th stripe has been added to a 3-stripe Adidas logo) scoring an average of 17.05% accuracy in counting (e.g., counting stripes in an Adidas-like logo) across 7 diverse domains from animals, logos, chess, board games, optical illusions, to patterned grids. Removing image backgrounds nearly doubles accuracy (21.09 percentage points), revealing that contextual visual cues trigger these biased responses. Further analysis of VLMs' reasoning patterns shows that counting accuracy initially rises with thinking tokens, reaching ~40%, before declining with excessive reasoning. Our work presents an interesting failure mode in VLMs and a human-supervised automated framework for testing VLM biases. Code and data are available at: vlmsarebiased.github.io.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes