CLMar 11, 2024

Naming, Describing, and Quantifying Visual Objects in Humans and LLMs

arXiv:2403.06935v327 citationsh-index: 8ACL
Originality Synthesis-oriented
AI Analysis

This addresses the problem of assessing VLLMs' pragmatic language capabilities for researchers in AI and linguistics, but it is incremental as it builds on existing models and datasets.

The study evaluated Vision & Language Large Language Models (VLLMs) on their ability to mimic human variability in naming, describing, and quantifying objects in images, finding that while some models performed well for nouns and attributes, all failed at quantifiers requiring high-level reasoning.

While human speakers use a variety of different expressions when describing the same object in an image, giving rise to a distribution of plausible labels driven by pragmatic constraints, the extent to which current Vision & Language Large Language Models (VLLMs) can mimic this crucial feature of language use is an open question. This applies to common, everyday objects, but it is particularly interesting for uncommon or novel objects for which a category label may be lacking or fuzzy. Furthermore, similar patterns of variation are observed among human speakers for highly context-sensitive expressions, such as the quantifiers 'few' or 'most'. In our work, we evaluate VLLMs (FROMAGe, BLIP-2, LLaVA) on three categories (nouns, attributes, and quantifiers) where humans show great subjective variability concerning the distribution over plausible labels, using datasets and resources mostly under-explored in previous work. Our results reveal mixed evidence on the ability of VLLMs to capture human naming preferences at generation time: while some models are good at mimicking human distributions for nouns and attributes, all of them fail to assign quantifiers, a task that requires more accurate, high-level reasoning.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes