CLAICYOct 20, 2023

POSQA: Probe the World Models of LLMs with Size Comparisons

DeepMind
arXiv:2310.13394v1132 citationsh-index: 31
Originality Incremental advance
AI Analysis

This work addresses the need to verify LLMs' embodied comprehension for AI safety and alignment, though it is incremental as it builds on existing probing methods.

The researchers tackled the problem of assessing large language models' real-world understanding by creating a dataset of physical object size comparison questions, finding that even the largest models perform poorly in zero-shot settings and are vulnerable to prompt deception.

Embodied language comprehension emphasizes that language understanding is not solely a matter of mental processing in the brain but also involves interactions with the physical and social environment. With the explosive growth of Large Language Models (LLMs) and their already ubiquitous presence in our daily lives, it is becoming increasingly necessary to verify their real-world understanding. Inspired by cognitive theories, we propose POSQA: a Physical Object Size Question Answering dataset with simple size comparison questions to examine the extremity and analyze the potential mechanisms of the embodied comprehension of the latest LLMs. We show that even the largest LLMs today perform poorly under the zero-shot setting. We then push their limits with advanced prompting techniques and external knowledge augmentation. Furthermore, we investigate whether their real-world comprehension primarily derives from contextual information or internal weights and analyse the impact of prompt formats and report bias of different objects. Our results show that real-world understanding that LLMs shaped from textual data can be vulnerable to deception and confusion by the surface form of prompts, which makes it less aligned with human behaviours.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes