AIMay 7

CrossCult-KIBench: A Benchmark for Cross-Cultural Knowledge Insertion in MLLMs

arXiv:2605.0611564.1
AI Analysis

For researchers developing culturally-aware MLLMs, this benchmark highlights the challenge of effective knowledge insertion without side effects, but the contribution is incremental as it primarily provides a new evaluation tool.

The paper introduces CrossCult-KIBench, a benchmark with 9,800 cases across 49 cultural scenarios in English, Chinese, and Arabic, to evaluate cross-cultural knowledge insertion in MLLMs. Experiments show current methods fail to balance cultural adaptation with preserving original behavior.

Multimodal Large Language Models (MLLMs), trained primarily on English-centric data, frequently generate culturally inappropriate or misaligned responses in cross-cultural settings. To mitigate this, we introduce the task of cross-cultural knowledge insertion, which focuses on adapting models to specific cultural contexts while preserving their original behavior in other cultures. To facilitate research in this area, we introduce CrossCult-KIBench, a comprehensive evaluation benchmark for assessing both the effectiveness of knowledge insertion and its unintended side effects on non-target cultures. The benchmark includes 9,800 image-grounded cases covering 49 culturally relevant visual scenarios across English, Chinese, and Arabic language-culture groups. It supports evaluation in both single-insert and sequential-insert settings. We also propose Memory-Conditioned Knowledge Insertion (MCKI) as a baseline method. MCKI retrieves relevant cultural knowledge from an external memory using frozen MLLM representations, prepending matched entries as conditional prompts when applicable. Extensive experiments on CrossCult-KIBench reveal that current approaches struggle to balance effective cultural adaptation with behavioral preservation, highlighting a key challenge in developing culturally-aware MLLMs. Our work thus underscores an important research direction for developing more culturally adaptive and responsible MLLMs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes