CLEMFeb 17

Extracting Consumer Insight from Text: A Large Language Model Approach to Emotion and Evaluation Measurement

arXiv:2602.15312v12 citationsh-index: 43
Originality Incremental advance
AI Analysis

This provides a scalable method for marketing research to extract consumer insights from text, though it is incremental as it applies existing LLM fine-tuning to a specific domain.

This study tackled the challenge of measuring consumer emotions and evaluations from unstructured text by introducing LX, a fine-tuned large language model that achieved 81% macro-F1 accuracy on survey responses and over 95% accuracy on annotated reviews, and showed that emotions predict product ratings and purchase behavior.

Accurately measuring consumer emotions and evaluations from unstructured text remains a core challenge for marketing research and practice. This study introduces the Linguistic eXtractor (LX), a fine-tuned, large language model trained on consumer-authored text that also has been labeled with consumers' self-reported ratings of 16 consumption-related emotions and four evaluation constructs: trust, commitment, recommendation, and sentiment. LX consistently outperforms leading models, including GPT-4 Turbo, RoBERTa, and DeepSeek, achieving 81% macro-F1 accuracy on open-ended survey responses and greater than 95% accuracy on third-party-annotated Amazon and Yelp reviews. An application of LX to online retail data, using seemingly unrelated regression, affirms that review-expressed emotions predict product ratings, which in turn predict purchase behavior. Most emotional effects are mediated by product ratings, though some emotions, such as discontent and peacefulness, influence purchase directly, indicating that emotional tone provides meaningful signals beyond star ratings. To support its use, a no-code, cost-free, LX web application is available, enabling scalable analyses of consumer-authored text. In establishing a new methodological foundation for consumer perception measurement, this research demonstrates new methods for leveraging large language models to advance marketing research and practice, thereby achieving validated detection of marketing constructs from consumer data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes