AINCOct 14, 2022

Adaptive patch foraging in deep reinforcement learning agents

arXiv:2210.08085v212 citationsh-index: 92
Originality Incremental advance
AI Analysis

This work addresses a gap in artificial intelligence by applying deep reinforcement learning to an ecological optimization problem, with implications for understanding biological intelligence and developing adaptive AI agents.

The paper tackled the problem of patch foraging in deep reinforcement learning agents, showing that these agents can learn adaptive foraging patterns similar to biological foragers and approach optimal behavior with temporal discounting, achieving performance within 15% of theoretical optimum.

Patch foraging is one of the most heavily studied behavioral optimization challenges in biology. However, despite its importance to biological intelligence, this behavioral optimization problem is understudied in artificial intelligence research. Patch foraging is especially amenable to study given that it has a known optimal solution, which may be difficult to discover given current techniques in deep reinforcement learning. Here, we investigate deep reinforcement learning agents in an ecological patch foraging task. For the first time, we show that machine learning agents can learn to patch forage adaptively in patterns similar to biological foragers, and approach optimal patch foraging behavior when accounting for temporal discounting. Finally, we show emergent internal dynamics in these agents that resemble single-cell recordings from foraging non-human primates, which complements experimental and theoretical work on the neural mechanisms of biological foraging. This work suggests that agents interacting in complex environments with ecologically valid pressures arrive at common solutions, suggesting the emergence of foundational computations behind adaptive, intelligent behavior in both biological and artificial agents.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes