The Illusion of Agreement with ChatGPT: Sycophancy and Beyond
This research addresses the problem of AI-induced harms for users and stakeholders, highlighting the need for coordinated interventions, though it is incremental as it builds on existing concerns by exploring user experiences.
The study investigated user-reported concerns about ChatGPT-induced harms, such as sycophancy and gaslighting, by analyzing Reddit discussions, revealing five distinct concerns across personal to societal domains and documenting user-driven suggestions for mitigation.
While concerns about ChatGPT-induced harms due to sycophancy and other behaviors, including gaslighting, have grown among researchers, how users themselves experience and mitigate these harms remain largely underexplored. We analyze Reddit discussions to investigate what concerns users report and how they address them. Our findings reveal five distinct user-reported concerns that manifest across multiple life domains, ranging from personal to societal: inducing delusion, digressing narratives, implicating users for models' limitations, inducing addiction, and providing unsupervised psychological support. We document three-tier user-driven suggestions spanning functional usage techniques, behavioral approaches, and private and institutional safeguards. Our findings show that AI-induced harms require coordinated interventions across users, developers, and policymakers. We discuss design implications and future directions to mitigate the harms and ensure user benefits.