CLAICYHCSep 4, 2024

More is More: Addition Bias in Large Language Models

arXiv:2409.02569v15 citationsh-index: 3
Originality Incremental advance
AI Analysis

This identifies a cognitive bias in LLMs that could increase resource use and costs, which is an incremental finding building on known human biases.

The paper investigates additive bias in Large Language Models (LLMs), finding a significant preference for additive over subtractive changes across multiple models, with examples like Llama 3.1 adding letters 97.85% of the time in a palindrome task and GPT-3.5 Turbo adding bricks 76.38% of the time in a balancing task.

In this paper, we investigate the presence of additive bias in Large Language Models (LLMs), drawing a parallel to the cognitive bias observed in humans where individuals tend to favor additive over subtractive changes. Using a series of controlled experiments, we tested various LLMs, including GPT-3.5 Turbo, Claude 3.5 Sonnet, Mistral, Math$Σ$tral, and Llama 3.1, on tasks designed to measure their propensity for additive versus subtractive modifications. Our findings demonstrate a significant preference for additive changes across all tested models. For example, in a palindrome creation task, Llama 3.1 favored adding letters 97.85% of the time over removing them. Similarly, in a Lego tower balancing task, GPT-3.5 Turbo chose to add a brick 76.38% of the time rather than remove one. In a text summarization task, Mistral 7B produced longer summaries in 59.40% to 75.10% of cases when asked to improve its own or others' writing. These results indicate that, similar to humans, LLMs exhibit a marked additive bias, which might have implications when LLMs are used on a large scale. Addittive bias might increase resource use and environmental impact, leading to higher economic costs due to overconsumption and waste. This bias should be considered in the development and application of LLMs to ensure balanced and efficient problem-solving approaches.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes