CLSIMar 5, 2024

Scope of Large Language Models for Mining Emerging Opinions in Online Health Discourse

arXiv:2403.03336v13 citationsh-index: 7
AI Analysis

This work addresses the challenge of analyzing uncertain health claims in online communities, which is incremental as it applies existing LLM methods to a new domain-specific dataset.

The paper tackles the problem of mining emerging opinions in online health discourse by developing an LLM-powered framework for stance detection between post titles and comments, using a novel dataset on Long COVID, and shows that GPT-4 significantly outperforms prior works in zero-shot stance detection.

In this paper, we develop an LLM-powered framework for the curation and evaluation of emerging opinion mining in online health communities. We formulate emerging opinion mining as a pairwise stance detection problem between (title, comment) pairs sourced from Reddit, where post titles contain emerging health-related claims on a topic that is not predefined. The claims are either explicitly or implicitly expressed by the user. We detail (i) a method of claim identification -- the task of identifying if a post title contains a claim and (ii) an opinion mining-driven evaluation framework for stance detection using LLMs. We facilitate our exploration by releasing a novel test dataset, Long COVID-Stance, or LC-stance, which can be used to evaluate LLMs on the tasks of claim identification and stance detection in online health communities. Long Covid is an emerging post-COVID disorder with uncertain and complex treatment guidelines, thus making it a suitable use case for our task. LC-Stance contains long COVID treatment related discourse sourced from a Reddit community. Our evaluation shows that GPT-4 significantly outperforms prior works on zero-shot stance detection. We then perform thorough LLM model diagnostics, identifying the role of claim type (i.e. implicit vs explicit claims) and comment length as sources of model error.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes