HCAICLFeb 22, 2024

Is ChatGPT More Empathetic than Humans?

arXiv:2403.05572v129 citationsh-index: 11Has Code
Originality Incremental advance
AI Analysis

This study addresses the problem of evaluating AI empathy for researchers and developers, offering a scalable framework, but it is incremental as it builds on existing comparisons of AI and human capabilities.

This paper investigated whether ChatGPT, specifically GPT-4, is more empathetic than humans by comparing responses to emotional scenarios, finding that ChatGPT's average empathy rating exceeded human responses by about 10% and that specific empathy prompts made its responses align 5 times more closely with high-empathy expectations.

This paper investigates the empathetic responding capabilities of ChatGPT, particularly its latest iteration, GPT-4, in comparison to human-generated responses to a wide range of emotional scenarios, both positive and negative. We employ a rigorous evaluation methodology, involving a between-groups study with 600 participants, to evaluate the level of empathy in responses generated by humans and ChatGPT. ChatGPT is prompted in two distinct ways: a standard approach and one explicitly detailing empathy's cognitive, affective, and compassionate counterparts. Our findings indicate that the average empathy rating of responses generated by ChatGPT exceeds those crafted by humans by approximately 10%. Additionally, instructing ChatGPT to incorporate a clear understanding of empathy in its responses makes the responses align approximately 5 times more closely with the expectations of individuals possessing a high degree of empathy, compared to human responses. The proposed evaluation framework serves as a scalable and adaptable framework to assess the empathetic capabilities of newer and updated versions of large language models, eliminating the need to replicate the current study's results in future research.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes