CLLGApr 16, 2024

Self-playing Adversarial Language Game Enhances LLM Reasoning

arXiv:2404.10642v3102 citationsh-index: 4Has CodeNIPS
Originality Incremental advance
AI Analysis

This work addresses the challenge of improving reasoning in LLMs, which is crucial for AI applications, though it appears incremental as it builds on existing self-play and reinforcement learning techniques.

The authors tackled the problem of enhancing large language models' reasoning abilities by training them through self-play in an adversarial language game, resulting in uniform performance improvements across multiple reasoning benchmarks.

We explore the potential of self-play training for large language models (LLMs) in a two-player adversarial language game called Adversarial Taboo. In this game, an attacker and a defender communicate around a target word only visible to the attacker. The attacker aims to induce the defender to speak the target word unconsciously, while the defender tries to infer the target word from the attacker's utterances. To win the game, both players must have sufficient knowledge about the target word and high-level reasoning ability to infer and express in this information-reserved conversation. Hence, we are curious about whether LLMs' reasoning ability can be further enhanced by Self-Playing this Adversarial language Game (SPAG). With this goal, we select several open-source LLMs and let each act as the attacker and play with a copy of itself as the defender on an extensive range of target words. Through reinforcement learning on the game outcomes, we observe that the LLMs' performances uniformly improve on a broad range of reasoning benchmarks. Furthermore, iteratively adopting this self-play process can continuously promote LLMs' reasoning abilities. The code is available at https://github.com/Linear95/SPAG.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes