Dark Personality Traits and Online Toxicity: Linking Self-Reports to Reddit Activity
For researchers studying online behavior, this work highlights a gap between personality traits and linguistic signals, emphasizing the need for validated psychological measures in computational analyses.
This study links dark personality traits to Reddit activity via a dataset of 57K comments from 114 users, finding that dark traits predict self-reported incivility but not text-derived toxicity, while self-reported experiences of toxicity are reflected in linguistic features.
Dark personality traits have long been associated with antisocial and toxic online behaviors, yet their relationship with observable online activity remains unclear. We investigate the association between validated dark personality measures, self-reported experiences of online incivility, and linguistic and behavioral features extracted from real-world user activity. To this end, we developed a Web application that securely links responses to validated psychological questionnaires collected via Amazon Mechanical Turk with participants' Reddit activity. This yielded a dataset of nearly 57K comments (2.2M tokens) from 114 users, represented through a broad set of linguistic and behavioral features. Our analyses reveal a clear distinction between self-reported and observed behavior. Dark personality traits show consistent associations with self-reported engagement in uncivil interactions. However, no validated dark personality dimension significantly predicts text-derived toxicity or linguistic features. In contrast, self-reported experiences of engaging in or being targeted by toxic behavior are robustly reflected in users' language, exhibiting consistent associations with measures of negativity, moral framing, and emotional intensity. Taken together, these findings highlight a gap between stable personality traits and their manifestation in surface-level linguistic signals. While computational features effectively capture behavioral engagement in online incivility, they do not provide reliable proxies for underlying personality constructs within the present framework. Our results underscore the importance of grounding computational approaches in validated psychological measures and point to the need for richer, context-aware representations to better understand the relationship between personality and online behavior.