NEJun 10, 2018

Deep Curiosity Loops in Social Environments

arXiv:1806.03645v11 citations
AI Analysis

This work addresses the challenge of unsupervised learning of social cues for AI agents, though it is incremental in applying curiosity-driven methods to social scenes.

The authors tackled the problem of enabling agents to learn social interaction features from visual scenes without supervision by developing a deep curiosity loop architecture that uses prediction error as intrinsic reward. Their results show that face and hand detection emerge as properties of this curiosity-based learning in social environments.

Inspired by infants' intrinsic motivation to learn, which values informative sensory channels contingent on their immediate social environment, we developed a deep curiosity loop (DCL) architecture. The DCL is composed of a learner, which attempts to learn a forward model of the agent's state-action transition, and a novel reinforcement-learning (RL) component, namely, an Action-Convolution Deep Q-Network, which uses the learner's prediction error as reward. The environment for our agent is composed of visual social scenes, composed of sitcom video streams, thereby both the learner and the RL are constructed as deep convolutional neural networks. The agent's learner learns to predict the zero-th order of the dynamics of visual scenes, resulting in intrinsic rewards proportional to changes within its social environment. The sources of these socially informative changes within the sitcom are predominantly motions of faces and hands, leading to the unsupervised curiosity-based learning of social interaction features. The face and hand detection is represented by the value function and the social interaction optical-flow is represented by the policy. Our results suggest that face and hand detection are emergent properties of curiosity-based learning embedded in social environments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes