AIMay 23, 2022

Using Natural Language and Program Abstractions to Instill Human Inductive Biases in Machines

Sreejan Kumar, Carlos G. Correa, Ishita Dasgupta, Raja Marjieh, Michael Y. Hu, Robert D. Hawkins, Nathaniel D. Daw, Jonathan D. Cohen, Karthik Narasimhan, Thomas L. Griffiths

arXiv:2205.11558v323.434 citationsh-index: 99Has Code

Originality Incremental advance

AI Analysis

This work addresses the challenge of making AI agents learn more like humans, which is incremental as it builds on existing meta-learning and representation learning methods.

The paper tackles the problem of aligning meta-learning agents' inductive biases with human-like strategies by co-training them on natural language descriptions and program abstractions, resulting in agents that exhibit more human-like behavior compared to less abstract controls.

Strong inductive biases give humans the ability to quickly learn to perform a variety of tasks. Although meta-learning is a method to endow neural networks with useful inductive biases, agents trained by meta-learning may sometimes acquire very different strategies from humans. We show that co-training these agents on predicting representations from natural language task descriptions and programs induced to generate such tasks guides them toward more human-like inductive biases. Human-generated language descriptions and program induction models that add new learned primitives both contain abstract concepts that can compress description length. Co-training on these representations result in more human-like behavior in downstream meta-reinforcement learning agents than less abstract controls (synthetic language descriptions, program induction without learned primitives), suggesting that the abstraction supported by these representations is key.

View on arXiv PDF Code

Similar