LGNCNov 23, 2023

Multi-intention Inverse Q-learning for Interpretable Behavior Representation

arXiv:2311.13870v411 citationsh-index: 29
Originality Incremental advance
AI Analysis

This work addresses the problem of interpretable behavior representation for neuroscience and cognitive science, offering an incremental improvement over existing multi-intention frameworks.

The paper tackled the challenge of inferring discrete time-varying rewards in inverse reinforcement learning for understanding natural decision-making, introducing hierarchical inverse Q-learning (HIQL) which outperformed benchmarks in behavior prediction on simulated and real animal datasets, producing interpretable reward functions.

In advancing the understanding of natural decision-making processes, inverse reinforcement learning (IRL) methods have proven instrumental in reconstructing animal's intentions underlying complex behaviors. Given the recent development of a continuous-time multi-intention IRL framework, there has been persistent inquiry into inferring discrete time-varying rewards with IRL. To address this challenge, we introduce the class of hierarchical inverse Q-learning (HIQL) algorithms. Through an unsupervised learning process, HIQL divides expert trajectories into multiple intention segments, and solves the IRL problem independently for each. Applying HIQL to simulated experiments and several real animal behavior datasets, our approach outperforms current benchmarks in behavior prediction and produces interpretable reward functions. Our results suggest that the intention transition dynamics underlying complex decision-making behavior is better modeled by a step function instead of a smoothly varying function. This advancement holds promise for neuroscience and cognitive science, contributing to a deeper understanding of decision-making and uncovering underlying brain mechanisms.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes