CL AI CYJul 17, 2024

PersLLM: A Personified Training Approach for Large Language Models

Zheni Zeng, Jiayi Chen, Huimin Chen, Yukun Yan, Yuxuan Chen, Zhenghao Liu, Zhiyuan Liu, Maosong Sun

arXiv:2407.12393v55.56 citationsh-index: 31Has Code

Originality Incremental advance

AI Analysis

This work addresses the challenge of creating more human-like and dynamic LLMs for applications requiring personalized interactions, though it appears incremental by building on existing techniques like Chain-of-Thought prompting and DPO.

The authors tackled the problem of insufficient data usage and rigid behavior patterns in personifying large language models (LLMs) by proposing PersLLM, a framework that improves data construction and model tuning, resulting in enhanced personality capture and more natural opinion communication as validated by automated metrics and human evaluations.

Large language models (LLMs) exhibit human-like intelligence, enabling them to simulate human behavior and support various applications that require both humanized communication and extensive knowledge reserves. Efforts are made to personify LLMs with special training data or hand-crafted prompts, while correspondingly faced with challenges such as insufficient data usage or rigid behavior patterns. Consequently, personified LLMs fail to capture personified knowledge or express persistent opinion. To fully unlock the potential of LLM personification, we propose PersLLM, a framework for better data construction and model tuning. For insufficient data usage, we incorporate strategies such as Chain-of-Thought prompting and anti-induction, improving the quality of data construction and capturing the personality experiences, knowledge, and thoughts more comprehensively. For rigid behavior patterns, we design the tuning process and introduce automated DPO to enhance the specificity and dynamism of the models' personalities, which leads to a more natural opinion communication. Both automated metrics and expert human evaluations demonstrate the effectiveness of our approach. Case studies in human-machine interactions and multi-agent systems further suggest potential application scenarios and future directions for LLM personification.

View on arXiv PDF Code

Similar