CLFeb 25, 2025

What are Foundation Models Cooking in the Post-Soviet World?

Anton Lavrouk, Tarek Naous, Alan Ritter, Wei Xu

arXiv:2502.18583v38.32 citationsh-index: 14Has CodeEMNLP

Originality Synthesis-oriented

AI Analysis

This addresses cultural bias in AI for Post-Soviet regions, but it is incremental as it focuses on a specific domain and dataset.

The study tackled the problem of foundation models' cultural food knowledge in Post-Soviet states by creating the BORSch dataset and found that models struggle to correctly identify dish origins, over-predicting based on language, with weak correlation between QA and visual description tasks.

The culture of the Post-Soviet states is complex, shaped by a turbulent history that continues to influence current events. In this study, we investigate the Post-Soviet cultural food knowledge of foundation models by constructing BORSch, a multimodal dataset encompassing 1147 and 823 dishes in the Russian and Ukrainian languages, centered around the Post-Soviet region. We demonstrate that leading models struggle to correctly identify the origins of dishes from Post-Soviet nations in both text-only and multimodal Question Answering (QA), instead over-predicting countries linked to the language the question is asked in. Through analysis of pretraining data, we show that these results can be explained by misleading dish-origin co-occurrences, along with linguistic phenomena such as Russian-Ukrainian code mixing. Finally, to move beyond QA-based assessments, we test models' abilities to produce accurate visual descriptions of dishes. The weak correlation between this task and QA suggests that QA alone may be insufficient as an evaluation of cultural understanding. To foster further research, we will make BORSch publicly available at https://github.com/alavrouk/BORSch.

View on arXiv PDF Code

Similar