CLAIApr 4, 2024

Concept -- An Evaluation Protocol on Conversational Recommender Systems with System-centric and User-centric Factors

arXiv:2404.03304v318 citationsh-index: 30
Originality Incremental advance
AI Analysis

This addresses the need for more comprehensive evaluation in CRS research, though it is incremental as it builds on existing protocols by adding user-centric aspects.

The authors tackled the problem of evaluating conversational recommender systems (CRS) by proposing a new protocol called Concept that integrates both system-centric and user-centric factors, resulting in a framework that identifies weaknesses in current models and highlights low usability in ChatGPT.

The conversational recommendation system (CRS) has been criticized regarding its user experience in real-world scenarios, despite recent significant progress achieved in academia. Existing evaluation protocols for CRS may prioritize system-centric factors such as effectiveness and fluency in conversation while neglecting user-centric aspects. Thus, we propose a new and inclusive evaluation protocol, Concept, which integrates both system- and user-centric factors. We conceptualise three key characteristics in representing such factors and further divide them into six primary abilities. To implement Concept, we adopt a LLM-based user simulator and evaluator with scoring rubrics that are tailored for each primary ability. Our protocol, Concept, serves a dual purpose. First, it provides an overview of the pros and cons in current CRS models. Second, it pinpoints the problem of low usability in the "omnipotent" ChatGPT and offers a comprehensive reference guide for evaluating CRS, thereby setting the foundation for CRS improvement.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes