LGAIJan 21, 2022

Reinforcement Learning Your Way: Agent Characterization through Policy Regularization

arXiv:2201.10003v18 citations
Originality Incremental advance
AI Analysis

This addresses the problem of explainability in reinforcement learning for researchers and practitioners, offering an intrinsic characterization method that is incremental in nature.

The paper tackles the opacity of reinforcement learning algorithms by developing a method to imbue characteristic behaviors into agents' policies through regularization of their objective functions, guiding behavior during learning and connecting it with model explanation, with formal arguments and empirical evidence provided for viability.

The increased complexity of state-of-the-art reinforcement learning (RL) algorithms have resulted in an opacity that inhibits explainability and understanding. This has led to the development of several post-hoc explainability methods that aim to extract information from learned policies thus aiding explainability. These methods rely on empirical observations of the policy and thus aim to generalize a characterization of agents' behaviour. In this study, we have instead developed a method to imbue a characteristic behaviour into agents' policies through regularization of their objective functions. Our method guides the agents' behaviour during learning which results in an intrinsic characterization; it connects the learning process with model explanation. We provide a formal argument and empirical evidence for the viability of our method. In future work, we intend to employ it to develop agents that optimize individual financial customers' investment portfolios based on their spending personalities.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes