LGAIJan 25, 2024

Multi-granularity Knowledge Transfer for Continual Reinforcement Learning

arXiv:2401.15098v37 citationsIJCAI
AI Analysis

This addresses a bottleneck in continual reinforcement learning for agents handling diverse tasks, representing an incremental improvement.

The paper tackles the problem of insufficient knowledge transfer across diverse tasks in continual reinforcement learning by proposing MT-Core, a framework that enhances coarse-grained knowledge transfer using multi-granularity policy learning, and experimental results show its superiority over popular baselines.

Continual reinforcement learning (CRL) empowers RL agents with the ability to learn a sequence of tasks, accumulating knowledge learned in the past and using the knowledge for problemsolving or future task learning. However, existing methods often focus on transferring fine-grained knowledge across similar tasks, which neglects the multi-granularity structure of human cognitive control, resulting in insufficient knowledge transfer across diverse tasks. To enhance coarse-grained knowledge transfer, we propose a novel framework called MT-Core (as shorthand for Multi-granularity knowledge Transfer for Continual reinforcement learning). MT-Core has a key characteristic of multi-granularity policy learning: 1) a coarsegrained policy formulation for utilizing the powerful reasoning ability of the large language model (LLM) to set goals, and 2) a fine-grained policy learning through RL which is oriented by the goals. We also construct a new policy library (knowledge base) to store policies that can be retrieved for multi-granularity knowledge transfer. Experimental results demonstrate the superiority of the proposed MT-Core in handling diverse CRL tasks versus popular baselines.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes