CLMay 7, 2024

Understanding the Capabilities and Limitations of Large Language Models for Cultural Commonsense

Siqi Shen, Lajanugen Logeswaran, Moontae Lee, Honglak Lee, Soujanya Poria, Rada Mihalcea

arXiv:2405.04655v124.077 citationsh-index: 77NAACL

Originality Incremental advance

AI Analysis

This study addresses cultural bias in LLMs, which is a problem for developing fair and inclusive AI systems, though it is incremental as it builds on existing commonsense evaluations.

The paper examined the capabilities and limitations of large language models (LLMs) in cultural commonsense tasks, finding significant performance discrepancies across cultures, with general commonsense affected by cultural context and query language impacting results.

Large language models (LLMs) have demonstrated substantial commonsense understanding through numerous benchmark evaluations. However, their understanding of cultural commonsense remains largely unexamined. In this paper, we conduct a comprehensive examination of the capabilities and limitations of several state-of-the-art LLMs in the context of cultural commonsense tasks. Using several general and cultural commonsense benchmarks, we find that (1) LLMs have a significant discrepancy in performance when tested on culture-specific commonsense knowledge for different cultures; (2) LLMs' general commonsense capability is affected by cultural context; and (3) The language used to query the LLMs can impact their performance on cultural-related tasks. Our study points to the inherent bias in the cultural understanding of LLMs and provides insights that can help develop culturally aware language models.

View on arXiv PDF

Similar