CVAICLFeb 11, 2025

RusCode: Russian Cultural Code Benchmark for Text-to-Image Generation

arXiv:2502.07455v112 citationsh-index: 7NAACL
Originality Incremental advance
AI Analysis

This work addresses the problem of cultural bias in text-to-image generation for Russian-speaking users, which is an incremental step towards making these models more culturally aware.

The authors tackled the problem of cultural bias in text-to-image generation models, proposing a benchmark for evaluating the quality of generated images containing elements of the Russian cultural code, with a dataset of 1250 text prompts. The results show the limitations of popular generative models in representing Russian visual concepts.

Text-to-image generation models have gained popularity among users around the world. However, many of these models exhibit a strong bias toward English-speaking cultures, ignoring or misrepresenting the unique characteristics of other language groups, countries, and nationalities. The lack of cultural awareness can reduce the generation quality and lead to undesirable consequences such as unintentional insult, and the spread of prejudice. In contrast to the field of natural language processing, cultural awareness in computer vision has not been explored as extensively. In this paper, we strive to reduce this gap. We propose a RusCode benchmark for evaluating the quality of text-to-image generation containing elements of the Russian cultural code. To do this, we form a list of 19 categories that best represent the features of Russian visual culture. Our final dataset consists of 1250 text prompts in Russian and their translations into English. The prompts cover a wide range of topics, including complex concepts from art, popular culture, folk traditions, famous people's names, natural objects, scientific achievements, etc. We present the results of a human evaluation of the side-by-side comparison of Russian visual concepts representations using popular generative models.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes