CLJan 9

GIFT: Games as Informal Training for Generalizable LLMs

Nuoyan Lyu, Bingbing Xu, Weihao Meng, Yige Yuan, Yang Zhang, Zhiyong Huang, Tat-Seng Chua, Huawei Shen

arXiv:2601.05633v10.6h-index: 12

Originality Incremental advance

AI Analysis

This addresses the problem of enhancing generalizable intelligence in LLMs for broader AI applications, though it appears incremental as it builds on existing reinforcement learning methods.

The paper tackles the gap in LLMs' practical wisdom and generalizable intelligence by proposing game-based informal learning, showing that a nested training framework prevents task interference and significantly improves generalization across ability-oriented benchmarks.

While Large Language Models (LLMs) have achieved remarkable success in formal learning tasks such as mathematics and code generation, they still struggle with the "practical wisdom" and generalizable intelligence, such as strategic creativity and social reasoning, that characterize human cognition. This gap arises from a lack of informal learning, which thrives on interactive feedback rather than goal-oriented instruction. In this paper, we propose treating Games as a primary environment for LLM informal learning, leveraging their intrinsic reward signals and abstracted complexity to cultivate diverse competencies. To address the performance degradation observed in multi-task learning, we introduce a Nested Training Framework. Unlike naive task mixing optimizing an implicit "OR" objective, our framework employs sequential task composition to enforce an explicit "AND" objective, compelling the model to master multiple abilities simultaneously to achieve maximal rewards. Using GRPO-based reinforcement learning across Matrix Games, TicTacToe, and Who's the Spy games, we demonstrate that integrating game-based informal learning not only prevents task interference but also significantly bolsters the model's generalization across broad ability-oriented benchmarks. The framework and implementation are publicly available.

View on arXiv PDF

Similar