CLAIMay 30, 2025

Multi-Domain ABSA Conversation Dataset Generation via LLMs for Real-World Evaluation and Model Comparison

arXiv:2505.24701v13 citationsh-index: 3
Originality Synthesis-oriented
AI Analysis

This addresses the problem of limited real-world labeled data for ABSA researchers, but it is incremental as it applies an existing LLM method to a new domain-specific task.

The paper tackled the scarcity of diverse labeled datasets for Aspect-Based Sentiment Analysis (ABSA) by generating synthetic data using GPT-4o, and found that the data effectively enabled evaluation of three LLMs, with DeepSeekR1 showing higher precision, Gemini 1.5 Pro and Claude 3.5 Sonnet having strong recall, and Gemini 1.5 Pro offering significantly faster inference.

Aspect-Based Sentiment Analysis (ABSA) offers granular insights into opinions but often suffers from the scarcity of diverse, labeled datasets that reflect real-world conversational nuances. This paper presents an approach for generating synthetic ABSA data using Large Language Models (LLMs) to address this gap. We detail the generation process aimed at producing data with consistent topic and sentiment distributions across multiple domains using GPT-4o. The quality and utility of the generated data were evaluated by assessing the performance of three state-of-the-art LLMs (Gemini 1.5 Pro, Claude 3.5 Sonnet, and DeepSeek-R1) on topic and sentiment classification tasks. Our results demonstrate the effectiveness of the synthetic data, revealing distinct performance trade-offs among the models: DeepSeekR1 showed higher precision, Gemini 1.5 Pro and Claude 3.5 Sonnet exhibited strong recall, and Gemini 1.5 Pro offered significantly faster inference. We conclude that LLM-based synthetic data generation is a viable and flexible method for creating valuable ABSA resources, facilitating research and model evaluation without reliance on limited or inaccessible real-world labeled data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes