CL AIApr 4, 2024

Concept -- An Evaluation Protocol on Conversational Recommender Systems with System-centric and User-centric Factors

Chen Huang, Peixin Qin, Yang Deng, Wenqiang Lei, Jiancheng Lv, Tat-Seng Chua

arXiv:2404.03304v38.718 citationsh-index: 30Has Code

Originality Incremental advance

AI Analysis

This addresses the need for more comprehensive evaluation in CRS research, though it is incremental as it builds on existing protocols by adding user-centric aspects.

The authors tackled the problem of evaluating conversational recommender systems (CRS) by proposing a new protocol called Concept that integrates both system-centric and user-centric factors, resulting in a framework that identifies weaknesses in current models and highlights low usability in ChatGPT.

The conversational recommendation system (CRS) has been criticized regarding its user experience in real-world scenarios, despite recent significant progress achieved in academia. Existing evaluation protocols for CRS may prioritize system-centric factors such as effectiveness and fluency in conversation while neglecting user-centric aspects. Thus, we propose a new and inclusive evaluation protocol, Concept, which integrates both system- and user-centric factors. We conceptualise three key characteristics in representing such factors and further divide them into six primary abilities. To implement Concept, we adopt a LLM-based user simulator and evaluator with scoring rubrics that are tailored for each primary ability. Our protocol, Concept, serves a dual purpose. First, it provides an overview of the pros and cons in current CRS models. Second, it pinpoints the problem of low usability in the "omnipotent" ChatGPT and offers a comprehensive reference guide for evaluating CRS, thereby setting the foundation for CRS improvement.

View on arXiv PDF Code

Similar