CVJan 16, 2024

Human vs. LMMs: Exploring the Discrepancy in Emoji Interpretation and Usage in Digital Communication

Hanjia Lyu, Weihong Qi, Zhongyu Wei, Jiebo Luo

arXiv:2401.08212v29.67 citationsHas CodeICWSM

Originality Synthesis-oriented

AI Analysis

It addresses a gap in understanding how advanced models like GPT-4V handle emojis in online interactions, which is incremental as it focuses on a specific aspect of multimodal AI.

This study investigated the discrepancy between human and GPT-4V behaviors in interpreting and using emojis in digital communication, revealing a discernible gap likely due to cultural biases and training limitations.

Leveraging Large Multimodal Models (LMMs) to simulate human behaviors when processing multimodal information, especially in the context of social media, has garnered immense interest due to its broad potential and far-reaching implications. Emojis, as one of the most unique aspects of digital communication, are pivotal in enriching and often clarifying the emotional and tonal dimensions. Yet, there is a notable gap in understanding how these advanced models, such as GPT-4V, interpret and employ emojis in the nuanced context of online interaction. This study intends to bridge this gap by examining the behavior of GPT-4V in replicating human-like use of emojis. The findings reveal a discernible discrepancy between human and GPT-4V behaviors, likely due to the subjective nature of human interpretation and the limitations of GPT-4V's English-centric training, suggesting cultural biases and inadequate representation of non-English cultures.

View on arXiv PDF Code

Similar