NC AI LG NE SYSep 25, 2021

Emergent behavior and neural dynamics in artificial agents tracking turbulent plumes

Satpreet Harcharan Singh, Floris van Breugel, Rajesh P. N. Rao, Bingni Wen Brunton

arXiv:2109.12434v23.34 citationsHas Code

Originality Incremental advance

AI Analysis

This work addresses plume tracking for robotics and neuroscience, offering insights into multi-sensory integration and control strategies, but it is incremental as it builds on existing reinforcement learning methods applied to a simulated domain.

The study tackled the problem of turbulent plume source tracking by training artificial agents with deep reinforcement learning, finding that emergent behaviors resembled those of flying insects and that longer memory timescales were essential for tracking plumes with changing wind direction.

Tracking a turbulent plume to locate its source is a complex control problem because it requires multi-sensory integration and must be robust to intermittent odors, changing wind direction, and variable plume statistics. This task is routinely performed by flying insects, often over long distances, in pursuit of food or mates. Several aspects of this remarkable behavior have been studied in detail in many experimental studies. Here, we take a complementary in silico approach, using artificial agents trained with reinforcement learning to develop an integrated understanding of the behaviors and neural computations that support plume tracking. Specifically, we use deep reinforcement learning (DRL) to train recurrent neural network (RNN) agents to locate the source of simulated turbulent plumes. Interestingly, the agents' emergent behaviors resemble those of flying insects, and the RNNs learn to represent task-relevant variables, such as head direction and time since last odor encounter. Our analyses suggest an intriguing experimentally testable hypothesis for tracking plumes in changing wind direction -- that agents follow local plume shape rather than the current wind direction. While reflexive short-memory behaviors are sufficient for tracking plumes in constant wind, longer timescales of memory are essential for tracking plumes that switch direction. At the level of neural dynamics, the RNNs' population activity is low-dimensional and organized into distinct dynamical structures, with some correspondence to behavioral modules. Our in silico approach provides key intuitions for turbulent plume tracking strategies and motivates future targeted experimental and theoretical developments.

View on arXiv PDF Code

Similar