CLOct 11, 2023

Parrot: Enhancing Multi-Turn Instruction Following for Large Language Models

Yuchong Sun, Che Liu, Kun Zhou, Jinwen Huang, Ruihua Song, Wayne Xin Zhao, Fuzheng Zhang, Di Zhang, Kun Gai

arXiv:2310.07301v212.238 citationsh-index: 15Has Code

Originality Incremental advance

AI Analysis

This addresses the need for better multi-turn interaction capabilities in LLMs for users seeking complex, conversational AI assistance, though it is incremental as it builds on existing methods.

The paper tackles the problem of enhancing multi-turn instruction following in large language models (LLMs) by introducing Parrot, which includes a method for collecting multi-turn instructions and a context-aware preference optimization strategy, resulting in up to 7.2% improvement in performance.

Humans often interact with large language models (LLMs) in multi-turn interaction to obtain desired answers or more information. However, most existing studies overlook the multi-turn instruction following ability of LLMs, in terms of training dataset, training method, and evaluation benchmark. In this paper, we introduce Parrot, a solution aiming to enhance multi-turn instruction following for LLMs. First, we introduce an efficient but effective method for collecting multi-turn instructions that feature human-like queries, such as anaphora and ellipsis. Second, we propose a context-aware preference optimization strategy to further enhance LLMs for complex queries in multi-turn interaction. Moreover, to quantitatively evaluate LLMs in multi-turn instruction following, we manually build a multi-turn benchmark derived from existing ones. Extensive experiments show that Parrot improves current LLMs by up to 7.2% in multi-turn instruction following. Our dataset and codes will be open-sourced to facilitate future research.

View on arXiv PDF

Similar