ROCVLGDec 9, 2021

Trajectory-Constrained Deep Latent Visual Attention for Improved Local Planning in Presence of Heterogeneous Terrain

arXiv:2112.04684v37 citations
Originality Incremental advance
AI Analysis

This work addresses visual navigation challenges for autonomous vehicles in heterogeneous terrain, representing an incremental improvement over existing attention-based methods.

The paper tackled the problem of local planning in visual navigation tasks by introducing a trajectory-constrained deep latent visual attention method, which improved generalization and learning efficiency compared to no-attention and self-attention alternatives in off-road and slippery terrain settings.

We present a reward-predictive, model-based deep learning method featuring trajectory-constrained visual attention for local planning in visual navigation tasks. Our method learns to place visual attention at locations in latent image space which follow trajectories caused by vehicle control actions to enhance predictive accuracy during planning. The attention model is jointly optimized by the task-specific loss and an additional trajectory-constraint loss, allowing adaptability yet encouraging a regularized structure for improved generalization and reliability. Importantly, visual attention is applied in latent feature map space instead of raw image space to promote efficient planning. We validated our model in visual navigation tasks of planning low turbulence, collision-free trajectories in off-road settings and hill climbing with locking differentials in the presence of slippery terrain. Experiments involved randomized procedural generated simulation and real-world environments. We found our method improved generalization and learning efficiency when compared to no-attention and self-attention alternatives.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes