AI LGFeb 17

When Remembering and Planning are Worth it: Navigating under Change

Omid Madani, J. Brian Burns, Reza Eghbali, Thomas L. Dean

arXiv:2602.15274v12.41 citationsh-index: 39

Originality Incremental advance

AI Analysis

This work addresses navigation challenges for agents in dynamic environments, but it is incremental as it builds on existing memory and planning techniques.

The paper tackles the problem of spatial navigation in changing, uncertain environments by exploring memory and planning strategies, finding that an agent using non-stationary probability learning to update memories and plan with imperfect maps becomes substantially more efficient as task difficulty increases, with efficiency gains scaling with factors like distance to goal.

We explore how different types and uses of memory can aid spatial navigation in changing uncertain environments. In the simple foraging task we study, every day, our agent has to find its way from its home, through barriers, to food. Moreover, the world is non-stationary: from day to day, the location of the barriers and food may change, and the agent's sensing such as its location information is uncertain and very limited. Any model construction, such as a map, and use, such as planning, needs to be robust against these challenges, and if any learning is to be useful, it needs to be adequately fast. We look at a range of strategies, from simple to sophisticated, with various uses of memory and learning. We find that an architecture that can incorporate multiple strategies is required to handle (sub)tasks of a different nature, in particular for exploration and search, when food location is not known, and for planning a good path to a remembered (likely) food location. An agent that utilizes non-stationary probability learning techniques to keep updating its (episodic) memories and that uses those memories to build maps and plan on the fly (imperfect maps, i.e. noisy and limited to the agent's experience) can be increasingly and substantially more efficient than the simpler (minimal-memory) agents, as the task difficulties such as distance to goal are raised, as long as the uncertainty, from localization and change, is not too large.

View on arXiv PDF

Similar