The Language of Interoception: Examining Embodiment and Emotion Through a Corpus of Body Part Mentions
This work addresses how body-related language reflects emotional and health states, offering insights for NLP, affective sciences, and wellbeing research, though it is incremental in applying existing methods to new data.
This paper investigated the connection between emotion, embodiment, and everyday language by analyzing body part mentions in online English text, finding that such mentions are common (5-10% of posts), emotionally charged, and correlated with poorer health outcomes.
This paper is the first investigation of the connection between emotion, embodiment, and everyday language in a large sample of natural language data. We created corpora of body part mentions (BPMs) in online English text (blog posts and tweets). This includes a subset featuring human annotations for the emotions of the person whose body part is mentioned in the text. We show that BPMs are common in personal narratives and tweets (~5% to 10% of posts include BPMs) and that their usage patterns vary markedly by time and %geographic location. Using word-emotion association lexicons and our annotated data, we show that text containing BPMs tends to be more emotionally charged, even when the BPM is not explicitly used to describe a physical reaction to the emotion in the text. Finally, we discover a strong and statistically significant correlation between body-related language and a variety of poorer health outcomes. In sum, we argue that investigating the role of body-part related words in language can open up valuable avenues of future research at the intersection of NLP, the affective sciences, and the study of human wellbeing.