CLOct 9, 2025

Neuron-Level Analysis of Cultural Understanding in Large Language Models

Taisei Yamamoto, Ryoma Kumon, Danushka Bollegala, Hitomi Yanaka

arXiv:2510.08284v14.91 citationsh-index: 14Has Code

Originality Incremental advance

AI Analysis

This work addresses cultural fairness in LLMs for global deployment, offering insights into internal mechanisms and practical guidance, though it is incremental as it builds on existing neuron analysis methods.

The researchers tackled the problem of cultural bias and limited awareness in large language models by conducting a neuron-level analysis to identify neurons driving cultural behavior, finding that suppressing these neurons degrades cultural benchmark performance by up to 30% while general NLU performance remains unaffected.

As large language models (LLMs) are increasingly deployed worldwide, ensuring their fair and comprehensive cultural understanding is important. However, LLMs exhibit cultural bias and limited awareness of underrepresented cultures, while the mechanisms underlying their cultural understanding remain underexplored. To fill this gap, we conduct a neuron-level analysis to identify neurons that drive cultural behavior, introducing a gradient-based scoring method with additional filtering for precise refinement. We identify both culture-general neurons contributing to cultural understanding regardless of cultures, and culture-specific neurons tied to an individual culture. These neurons account for less than 1% of all neurons and are concentrated in shallow to middle MLP layers. We validate their role by showing that suppressing them substantially degrades performance on cultural benchmarks (by up to 30%), while performance on general natural language understanding (NLU) benchmarks remains largely unaffected. Moreover, we show that culture-specific neurons support knowledge of not only the target culture, but also related cultures. Finally, we demonstrate that training on NLU benchmarks can diminish models' cultural understanding when we update modules containing many culture-general neurons. These findings provide insights into the internal mechanisms of LLMs and offer practical guidance for model training and engineering. Our code is available at https://github.com/ynklab/CULNIG

View on arXiv PDF Code

Similar