CLJan 31, 2024

I Think, Therefore I am: Benchmarking Awareness of Large Language Models Using AwareBench

arXiv:2401.17882v218 citationsh-index: 17Has Code
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of assessing AI awareness for researchers and developers, contributing incrementally to benchmarks for AI alignment and safety.

The authors tackled the problem of evaluating awareness in large language models (LLMs) by introducing AwareBench, a benchmark based on psychological and philosophical theories, and found that most of 13 tested LLMs struggle to recognize their capabilities and missions but show decent social intelligence.

Do large language models (LLMs) exhibit any forms of awareness similar to humans? In this paper, we introduce AwareBench, a benchmark designed to evaluate awareness in LLMs. Drawing from theories in psychology and philosophy, we define awareness in LLMs as the ability to understand themselves as AI models and to exhibit social intelligence. Subsequently, we categorize awareness in LLMs into five dimensions, including capability, mission, emotion, culture, and perspective. Based on this taxonomy, we create a dataset called AwareEval, which contains binary, multiple-choice, and open-ended questions to assess LLMs' understandings of specific awareness dimensions. Our experiments, conducted on 13 LLMs, reveal that the majority of them struggle to fully recognize their capabilities and missions while demonstrating decent social intelligence. We conclude by connecting awareness of LLMs with AI alignment and safety, emphasizing its significance to the trustworthy and ethical development of LLMs. Our dataset and code are available at https://github.com/HowieHwong/Awareness-in-LLM.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes