AICLFeb 6, 2024

Position: Stop Making Unscientific AGI Performance Claims

arXiv:2402.03962v310 citationsh-index: 8ICML
Originality Synthesis-oriented
AI Analysis

This addresses a methodological issue for AI researchers and the broader community by highlighting unscientific claims about AGI, though it is incremental as it critiques existing practices rather than introducing new techniques.

The paper argues that finding meaningful patterns in AI model latent spaces, such as those in large language models, is not evidence of artificial general intelligence (AGI), as similar patterns occur in simpler models like random projections and autoencoders, and it calls for caution in interpreting such correlations as signs of AGI.

Developments in the field of Artificial Intelligence (AI), and particularly large language models (LLMs), have created a 'perfect storm' for observing 'sparks' of Artificial General Intelligence (AGI) that are spurious. Like simpler models, LLMs distill meaningful representations in their latent embeddings that have been shown to correlate with external variables. Nonetheless, the correlation of such representations has often been linked to human-like intelligence in the latter but not the former. We probe models of varying complexity including random projections, matrix decompositions, deep autoencoders and transformers: all of them successfully distill information that can be used to predict latent or external variables and yet none of them have previously been linked to AGI. We argue and empirically demonstrate that the finding of meaningful patterns in latent spaces of models cannot be seen as evidence in favor of AGI. Additionally, we review literature from the social sciences that shows that humans are prone to seek such patterns and anthropomorphize. We conclude that both the methodological setup and common public image of AI are ideal for the misinterpretation that correlations between model representations and some variables of interest are 'caused' by the model's understanding of underlying 'ground truth' relationships. We, therefore, call for the academic community to exercise extra caution, and to be keenly aware of principles of academic integrity, in interpreting and communicating about AI research outcomes.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes