CLOct 25, 2023

Is ChatGPT a Good Multi-Party Conversation Solver?

Chao-Hong Tan, Jia-Chen Gu, Zhen-Hua Ling

arXiv:2310.16301v121.7137 citationsh-index: 20Has Code

Originality Synthesis-oriented

AI Analysis

This work addresses the challenge of applying LLMs to complex multi-party dialogues, which is important for developing more effective conversational AI agents, though it is incremental as it primarily evaluates existing models.

The paper investigates the ability of generative large language models like ChatGPT and GPT-4 to handle multi-party conversations, finding that ChatGPT performs poorly on several tasks while GPT-4 shows promising results.

Large Language Models (LLMs) have emerged as influential instruments within the realm of natural language processing; nevertheless, their capacity to handle multi-party conversations (MPCs) -- a scenario marked by the presence of multiple interlocutors involved in intricate information exchanges -- remains uncharted. In this paper, we delve into the potential of generative LLMs such as ChatGPT and GPT-4 within the context of MPCs. An empirical analysis is conducted to assess the zero-shot learning capabilities of ChatGPT and GPT-4 by subjecting them to evaluation across three MPC datasets that encompass five representative tasks. The findings reveal that ChatGPT's performance on a number of evaluated MPC tasks leaves much to be desired, whilst GPT-4's results portend a promising future. Additionally, we endeavor to bolster performance through the incorporation of MPC structures, encompassing both speaker and addressee architecture. This study provides an exhaustive evaluation and analysis of applying generative LLMs to MPCs, casting a light upon the conception and creation of increasingly effective and robust MPC agents. Concurrently, this work underscores the challenges implicit in the utilization of LLMs for MPCs, such as deciphering graphical information flows and generating stylistically consistent responses.

View on arXiv PDF Code

Similar